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Preface 


Just after Wiley-VCH’s consulting editor, Ed Immergut, invited me to edit a book 
on “alloy physics’, I hesitated greatly before agreeing, as I was a bit shocked by 
the breadth of the topic to be covered in such a book. I therefore first suggested a 
volume on Atom Jumps in Intermetallics to reduce the area correspondingly, but 
finally agreed to Alloy Physics because it would appeal to a much broader audience 
including physicists, materials scientists, physical chemists, and metallurgists in 
academic as well as in industrial laboratories. 

Writing “alloy physics” into an Internet search engine, I suddenly realized that 
indeed there seems to be a need for such a reference book. I therefore started to 
prepare a tentative outline, and found that this could be done in a straightforward 
way and in a comparatively short time. Another surprise was the great number of 
very positive referee reports on my outline — four of them arrived within just two 
days after I sent out my outline! 

The next step was to find and motivate chapter authors to take part in the proj- 
ect. The resulting book now contains 14 chapters, written by an international 
team of authors, some of whom have written their text by correspondence across 
the Atlantic. 

An essential point is that the book does not avoid overlaps between chapters: on 
the contrary, I hope that looking at fundamental problems in alloy physics from 
different aspects will make the book even more interesting and useful. 

I am very grateful to my colleague Wolfgang Püschl for continual discussions, 
comments, and suggestions, to our student David Reith for the use of one of his 
simulation pictures for the cover, and last but not least to my dear wife Heidi for 
her great patience and encouragement. 


Vienna, May 2007 Wolfgang Pfeiler 
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Foreword * 


The word ‘alloy’, from the old French, originally carried overtones of base metals 
contaminating noble ones, and the word is still sometimes used in this sense 
in literary metaphor. Today, however, we know that the true practical value of an 
alloy derives from the knowledge and ingenuity that has gone into the choice of 
constituents, proportions and heat-treatment, and an aristocratic alloy often 
emerges from a mix of base metals. So, ‘nobility’ in alloys is a function of under- 
standing, not of scarcity in the earth's crust. 

This important book sets out the sources of the modern understanding of 
alloys, in great depth. The book represents the intimate mingling of physics 
with metallurgy — whether that is called alloy physics or physical metallurgy is of 
no consequence. It is some time since an overview of this whole field has been 
published and the time is very ripe for this new venture. The book not only mixes 
theory and experiment in a very helpful way but it also comprehensively covers 
the world community of alloy physicists. 

Sitting at the centre of the spiders web is a notable Austrian physicist: Wolf- 
gang Pfeiler’s office is in the Boltzmanngasse, named after the genial theorist 
who first properly understood the entropy of mixtures. All of us in the profession 
owe Wolfgang a great debt for his skill and persuasiveness in drawing together 
such a distinguished array of contributors. 


Cambridge, March 2007 Robert W. Cahn, FRS 


* Robert Wolfgang Cahn (1924-2007) passed away on April 9th, 2007. 


Professor Robert Cahn was one of the first promoters of ‘Materials Science’ (see 
his recent book ‘The Coming of Materials Science’, Pergamon 2001) and soon 
became an international leader in this field. Aside of his own intense scientific 
research he was editor of many compendia and book series and founded four 
journals. His memoirs have recently been published (‘The Art of Belonging’, 
Book Guild Publishing 2005). 
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Motto 


Georgii Agricolae, De re metallica libri XII, quibus Officia, Instrumenta, Machinae, 
ac omnia denique ad Metallicam spectantia, non modo luculentissime describuntur, sed 
& per effigies, suis locis insertas, adiunctis Latinis, Germanisque appellationibus ita ob 
oculos ponuntur, ut clarius tradi non possint. Eiusdem de animantibus subterraneis 
Liber, ab Autore recognitus: cum Indicibus diversis, quicquid in opere tractatum est, 
pulchre demonstrantibus. Basileae MDLVI: 

Metallicus praeterea sit oportet multarum artium & disciplinarum non ignarus: 
Primo Philosophiae, ut subterraneorum ortus & causas, naturasque noscat: Nam ad 
fodiendas venas faciliore & commodiore uia perveniet, & ex effossis uberiores capiet 
fructus. Secundo Medicinae, ut fossoribus & aliis operariis providere possit, ne in mor- 
bos, quibus prae caeteris urgentur, incidant: aut si inciderint, vel ipse eis curationes ad- 
hibere, vel ut medici adhibeant curare. Tertio Astronomiae, ut cognoscat coeli partes, 
atque ex eis venarum extensiones iudicet. Quarto Mensurarum disciplinae, ut & metiri 
queat, quam alte fodiendus sit puteus, ut pertineat ad cuniculum usque qui eo agitur, & 
certos cuique fodinae, praesertim in profundo, constituere fines terminosque. Tum nu- 
merorum disciplinae sit intelligens, ut sumptus, qui in machinas & fossiones habendi 
sunt, ad calculos revocare possit. Deinde Architecturae, ut diversas machinas substruc- 
tionesque ipse fabricari, vel magis fabricandi rationem aliis explicare queat. Postea Pic- 
turae, ut machinarum exempla deformare possit. Postremo Iuris, maxime metallici sit 
peritus, ut & alteri nihil surripiat, & sibi petat non iniquum, munusque aliis de iure 
respondendi sustineat. Itaque necesse est ut is, cui placent certae rationes & praecepta 
rei metallicae hos aliosque nostros libros studiose diligenterque legat, aut de quaque re 
consulat experientes metallicos, sed paucos inveniet gnaros totius artis. 

“Furthermore, there are many arts and sciences of which a miner should not 
be ignorant. First there is Philosophy, that he may discern the origin, cause, and 
nature of subterranean things; for then he will be able to dig out the veins easily 
and advantageously, and to obtain more abundant results from his mining. Sec- 
ondly, there is Medicine, that he may be able to look after his diggers and other 
workmen, that they do not meet with those diseases to which they are more liable 
than workmen in other occupations, or if they do meet with them, that he him- 
self may be able to heal them or may see that the doctors do so. Thirdly follows 
Astronomy, that he may know the divisions of the heavens and from them judge 
the direction of the veins. Fourthly, there is the science of Surveying that he may 
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XXIV 


Motto 


be able to estimate how deep a shaft should be sunk to reach the tunnel which is 
being driven to it, and to determine the limits and boundaries in these workings, 
especially in depth. Fifthly, his knowledge in Arithmetical Science should be such 
that he may calculate the cost to be incurred in the machinery and the working of 
the mine. Sixthly, his learning must comprise Architecture, that he himself may 
construct various machines and timber work required underground, or that he 
may be able to explain the method of the construction to others. Next, he must 
have knowledge of Drawing, that he can draw plans of his machinery. Lastly, 
there is the Law, especially that dealing with metals, that he may claim his own 
rights, that he may undertake the duty of giving others his opinion on legal mat- 
ters, that he may not take another man’s property and so make trouble for him- 
self, and that he may fulfill his obligations according to the law. 

It is therefore necessary that those who take interest in the methods and pre- 
cepts of mining and metallurgy should read these and others of our books studi- 
ously and diligently; or on every point they should consult expert mining people, 
though they will discover few who are skilled in the whole art.” 

This translation follows Hoover and Hoover (1950). 

Thus, here in the introduction to his famous work, Agricola gives us a plethora 
of skills and arts which are indispensable for anyone extracting metal ores from 
the bowels of the earth and processing them. This enumeration is more than 
symbolic for the conditions the modern alloy physicist has to fulfill. Although 
the development of science has carried his tasks to other levels of sophistication, 
as a result they are no less numerous and diverse than those of the medieval met- 
alworker. It is the intended spirit of this book to provide, by presenting solid 
knowledge and different viewpoints, the wide horizon which favors significant 
scientific progress. 


Reference 


Hoover, H.C. and Hoover, L.H. (1950) Translation of the first Latin Edition of 1556, 
Georgius Agricola: De Re Metallica, Dover Publications, Inc., New York, p. 3. 
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1 
Introduction 


Wolfgang Pfeiler 


1.1 
The Importance of Alloys at the Beginning of the Third Millennium 


An alloy is a “mixture” of two or more chemical elements, one of which at least is 
a metal. The alloying element can be distributed over the crystal lattice sites of the 
host element and yield a solid solution, or it can form different phases showing 
up as particles in a “matrix.” Whereas the physical properties of a solid solution 
are essentially determined by the chemical composition of the constituents, the 
properties of a multiphase alloy are determined largely by the spatial distribution 
of the second-phase particles. This possibility of “designing” physical and tech- 
nical properties of a material by a careful selection of alloying elements and alloy- 
ing concentrations put alloys in the forefront of materials from early human his- 
tory up to our time at the beginning of the third millennium. 

We are facing tremendous progress in materials development and design of ad- 
vanced materials, driven by technical needs in all fields of modern production. A 
real revolution in materials science, however, could be observed during very re- 
cent years, a great leap from understanding bulk materials to the study, develop- 
ment, and application of nanostructured materials. In the present book we give 
accounts of the state-of-the-art of alloy physics and the challenges of future re- 
search in the field together with the basic knowledge necessary to understand 
the radical changes currently happening. 

Materials today, aside from metals and alloys, include such different substances 
as ceramics, high-T. (HT.) superconductors, liquid crystals, polymers, foams, bio- 
mimetic materials, nanotubes, nanocomposites ... and it is justified to ask: 
“What is the importance of alloys in the field of modern materials today? What 
role do alloys play in the fascinating advance toward nanostructurization?” 

We will later cast a glance at the historical perspectives together with a 
short description of the development of modern alloy science. However, let us 
get started by affirming the importance of alloys at the onset of the third 
millennium. 
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1 Introduction 


Conductors and superconductors In July 1996 overloaded transmission lines sag- 
ging low enough to touch trees caused a blackout of electric power which affected 
millions of people in the western part of the United States, Canada, and parts of 
Mexico. The blackout on August 14 2003 was the biggest in US history. Roughly 
50 million people lost power due to a failure in the “Lake Erie Loop.” A similar 
event occurred in Europe on September 29 of the same year: some 57 million 
people lost their electrical power in Italy at the weekend when the national grid 
crashed. The minor event that a single tree fell across a line in the Alps during a 
storm started a domino effect which soon knocked out the entire Italian grid. 

A key step in the process of improving the distribution of electric power lies in 
reducing the weight/conductivity ratio of utility transmission and distribution 
lines. 

Aside from conventional conductors used for overhead transmission and distri- 
bution lines, special systems making use of superconductivity may be applied. 
The capacity of superconducting materials to handle large currents with no resis- 
tance and extremely low energy losses can be applied to electric devices such as 
motors and generators as well as to transmission in electric power lines. A super- 
conducting power system would meet the growing demand for electricity with 
fewer power plants and transmission lines than was otherwise needed. High- 
temperature superconductors which need cooling to only about 80 K seem espe- 
cially promising and are just about to enter economical use for utility applica- 
tions. Possible applications are cables which carry up to five times more power 
than conventional utility cables, smaller and more efficient motors, smaller and 
lighter generators, compact transformers without oil (but with liquid nitrogen) 
as the cooling medium, fault-current limiters, and superconducting magnetic en- 
ergy storage (SMES) systems which store energy in a magnetic field created by 
the flow of direct current in a coil of superconducting material. 


Soft and hard magnets Tailoring the ferromagnetic hysteresis curves is an ongo- 
ing demand in the field of soft and hard magnets. At one end of the spectrum 
magnetically soft materials reduce hysteresis losses, where necessary. Amorphous 
alloys, for example, show a high permeability and very low losses (especially low 
losses at elevated frequencies), but also a high magnetization at room tempera- 
ture. At the other end, using rare earth alloys we get very high values of magneti- 
zation which have been increasing exponentially during recent years. Magneti- 
cally hard materials with high coercivity are needed for magnetic storage. To 
increase information storage density, nanosized magnetic domain structures are 
being developed. Nanocrystalline soft magnetic materials are already commer- 
cially available. Nanocrystalline hard magnetic materials exhibit interesting prop- 
erties; an industrial breakthrough has not yet been achieved, however. 


Intermetallics and superalloys Intermetallic compounds, because of their long- 
range ordered structure, show a temperature range where their mechanical 
strength increases with temperature (yield stress anomaly). Together with their 
advantageous corrosion properties this selects them as potential high- 
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temperature structural materials. The present challenge is to combine extreme 
strength and hardness with sufficient ductility and surface stability, a problem 
which is not yet really solved. 

At present, turbo superchargers and aircraft turbine engines are made of two- 
phase nickel-based superalloys, usually strengthened by a coherent precipitation 
of L12 ordered »’ particles in the disordered face-centered cubic y matrix. The op- 
erating surface temperatures of turbine blades are close to 1150 °C, and the aver- 
age bulk metal temperature approaches 1000 °C. Ni-based superalloys are also 
used in load-bearing structures at up to 90% of their melting temperature. 

Another possible use of intermetallics when they are ferromagnetic and nano- 
structured is that for high-density magnetic recording, a field in which much ef- 
fort is invested at the moment, especially in order to overcome the superparamag- 
netic limit. 


Shape memory alloys These are alloys which exhibit thermoelastic effects and re- 
vert to the original shape by a phase transformation when they are heated after a 
plastic deformation. The applications of shape memory alloys are essentially in 
electrical/mechanical junctions, actuators which are the most promising alterna- 
tives to hydraulic systems, actuators for microelectromechanical systems (MEMS), 
surgical tools such as NiTi bone plates which apply a steady pressure to assist the 
healing process and thus reduce recovery time, and robotic muscles (shape mem- 
ory alloys mimic human muscles and tendons very well). In addition, magnetic 
shape memory alloys (MSM) based on NiMnGa exist in which the martensitic 
transition can be triggered by an external magnetic field. These are alloys for 
which many applications can be expected. 


Nanocrystalline and amorphous alloys Glassy forms of metallic alloys have quite 
unique mechanical, electrical, and magnetic properties. It has been found in the 
recent past that for certain alloy compositions appropriate cooling rates can be ap- 
plied to form “bulk amorphous alloys.” Many of these alloys have a very high 
strength to weight ratio with excellent elastic energy storage. Other applications 
are in ferromagnets ranging from the lowest known coercivities to extremely 
high coercivities with high saturation induction values, as well as in composites 
made of ferromagnetic and antiferromagnetic components (giant magneto resis- 
tance), and in magnetic reading heads. 


Dimensionally restricted alloys Dimensionally restricted systems are currently 
very fashionable in physics and materials science and research in this area easily 
attracts funding. In electronics, for example, the ever greater complexity in micro- 
processor and memory chips means an exponentially increasing chip density 
(with a doubling rate about two years at the moment: Moore’s law). Individual 
electronic components are therefore smaller than 100 nm, bringing these con- 
ducting elements into the low-dimensional regime. Another field where nano- 
structurization commends itself is high-density data storage and magnetic or 
magneto-optic recording. Further materials enhancement is achieved by incorpo- 
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rating nanocrystalline particles to increase toughness or to improve catalyst prop- 
erties. 

In addition, there is a steadily growing general interest in the basic physics of 
low-dimensional alloys. Significant differences are obtained in the physical prop- 
erties of nanosized structures as compared to the bulk, for instance a change in 
melting temperature or modification of microstructures. The changes in diffu- 
sion processes seem to be essential but are not known in detail. Other fundamen- 
tal questions are which structural defects are stable and what their role is in 
phase transformations. It is therefore important to study all the changes which 
occur when going from bulk to thin films, nanowires, or small atom clusters. 


Friction and wear The reduction of friction between moving parts becomes more 
and more important for future mechanical applications, especially in modern 
combustion engines. On the one hand there are efforts to minimize the energy 
loss during operation; on the other hand there is the trend to simultaneously 
maximize the output and minimize the engine volume, in this way increasing 
the efficiency of engines. Friction properties and the resistance to high thermal 
stresses therefore become main design factors. Besides low friction and low 
wear rate, the materials must also have high fatigue strength and good ductility 
to sustain the deformation of the different parts caused by the firing pressure 
and by the inertial forces. To close the gap between the performance of conven- 
tional materials and the needs of the engines of the next generation, there is a 
very acute demand in alloy design for the design of new tribological alloys. 


Medical and biological applications Special alloy products are frequently used for 
medical applications, e.g. in accident surgery. Typically, these include orthopedic 
and dental implants, fracture fixation devices, suture needles and staples, surgical 
blades and saw cutting tools, dental burrs and reamers, catheter guide wire sys- 
tems and wire for diagnostic and sensor electrical leads. Another area of current 
interest is biomimetic engineering. Inspired by Nature, biomimetic engineering 
copies natural systems, utilizing molecular self-assembly as the key link between 
physics, chemistry, and biology, and thereby creating novel advanced structures. 
The driving force stems from the recognition that there are a number of areas 
where biological methods are more efficient and environmentally friendly than 
current technology, and superior to it overall. These areas are, for example, en- 
ergy storage and utilization, low-temperature fabrication of complex materials, 
linear motors and actuators, dirt-repellent surfaces, neural computation, sensors, 
and many other possible applications. Alloys play a major role in this field. 

In conclusion, alloy physics is at the hub of materials science and materials 
physics. It turns out that virtually all high-technology developments are involving 
modern alloys, and alloy design is a driving force of scientific research in ma- 
terials physics. This means that within the field of technological applications spe- 
cialized alloys are involved in all cutting-edge technologies. In addition, alloys 
have essential repercussions in other fields as well, such as environmental tech- 
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nology to limit the growing ecocide, modern surgery and health care, and even 
the social and economic development of underdeveloped regions. 


1.2 
Historical Development 


1.2.1 
Historical Perspective 


Modern anthropology dates the rise of the genus “human” at more than three 
million years ago (Homo rudolfensis and Homo habilis, both being potential ances- 
tors of Homo ergaster, who thrived about 1.5 million years ago). Looking into the 
early history of man, certain development “steps” can be distinguished, which are 
correlated with a “revolutionary” advance of skills. One example is the first han- 
dling of fire in the middle of the Pleistocene (the Ice Ages) together with the 
manufacture of stone tools. This age is therefore also called the Old Stone Age 
(the Paleolithic period) and marks the first appearance of human species closely 
resembling ourselves (Homo ergaster — Homo heidelbergensis Homo sapiens). It 
is certainly at that moment that “matter” changed into “material” - in the sense 
that a material is matter with some special use. Another “revolutionary” change 
led directly to the class of materials which is the focus of this book: The develop- 
ment of metallic alloys had probably started already in the Far East, in China and 
Indochina, Thailand, Vietnam, and East India before 5000 sc. Among the oldest 
known tin bronzes are the dagger of Veľke Raškovce in central Europe (Slovakia), 
containing 4.5 wt.% tin from the second half of the fifth millennium sc and 
pieces from Mundigak, Afghanistan, from the second half of the fourth millen- 
nium sc (Trnka 2006). The reason why those processes of human development 
look fast and “revolutionary” to us, while they are in fact slow and evolutionary, 
is to be found in the large timescale for counting these developments in the far 
past. Actually, a drastic reduction of the periods in which such far-reaching 
changes took place can already be recognized by comparing the Stone Age with 
the Bronze Age. Whereas the human development during the Stone Age took 
roughly a million years, the Bronze Age in Middle Europe was superseded by 
the Iron Age after no more than about 2000 years. A corresponding acceleration 
in technical development continues into present times, the replacement of elec- 
tron tubes by transistors after only 50 years being such an example. 

It is known that in early history the extended use of tools and weapons was es- 
sential for the survival of the human species, in the permanent search for food 
and in the fight against animals as well as other human species. Tools and weap- 
ons were first found accidentally in the near surroundings, then later made artifi- 
cially from specific materials. This documents the importance of materials for 
mankind, which has continued right up to the 21st century. Very early in human 
history, alloys started to play a central role among the materials used, because it 
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was quickly recognized that many of the desired properties can be created by de- 
liberately “mixing” together a metal with other components, especially other 
metals. Metal alloys, therefore, immediately started a new era of human life 
when tools of copper bronze (70-95% copper and 30-5% tin) prevailed over ear- 
lier ones made from stone or pure copper. The evolution of hardening a metal by 
alloying was accompanied by the possibility of casting complicated shapes in spe- 
cial molds made of wax embedded in clay. 

We will now give an overview of the historical perspectives of alloys, followed 
by a short description of the development of modern alloy science. 

During the final period of the Stone Age men became settled and started to live 
together in small tribes and communities. Lumps of native copper and later of 
gold collected from ice-free mountains and out of rivers, respectively, were proba- 
bly the first metals to be discovered and both were immediately recognized as 
being excellently suited to substitute wood and bone in artwork. The first use of 
copper dates back more than 10000 years (Ergani Maden and Catal Hüyük in the 
highlands of Anatolia). An intensified search then probably brought to light the 
bright green malachite and the blue azurite. It was learned that copper got hard 
when hammered and could be softened again by heating it. Early smelting tech- 
niques were already being applied around 6500 Bc, first in East Asia (China) and 
West Asia (Near East), and from the first half of the fifth millennium sc onward 
in Europe also. 

The very first bronze pieces used were made from “arsenic bronze,” arsenic ac- 
cidentally contained in the copper ore. It turned out to be easier to cast, more 
ductile, and harder than pure copper. It is now known that arsenic deoxidizes 
the harmful oxygen and thereby reduces brittleness. Later, by intentional addition 
of tin to copper, true “bronze” (tin bronze) was made and the production became 
independent of deposits with special ore. Its use spread very rapidly out of Meso- 
potamia to the Mediterranean countries, where bronze lasted for over 2000 years 
and even longer in north-western Europe (see Fig. 1.1). 

It is assumed that the knowledge about iron smelting and how to obtain steel 
was first gained in the highlands of Anatolia where the Hittites flourished from 
about 1450 to 1200 sc, but several objects of smelted iron date back further than 
2000 Bc (Cowen 2005). When Tutankhamen was buried in about 1400 sc he had 
with him an iron dagger with a hilt and sheath of gold decorated with rock crys- 
tal. Surprisingly enough, the dagger blade had not rusted within the 3000 years 
and more until its discovery, whereas many other ancient iron objects probably 
perished as they were transformed into shapeless rust. Since the element iron 
practically always occurs as a compound, usually siderite (FeCO3) and often to- 
gether with malachite (CuCO3), most of these very old iron weapons have been 
made from meteoritic iron or from the black sands on the south coast of the 
Black Sea containing the iron-rich magnetite (Fe304). One of the oldest known 
furnaces for “cooking” iron from iron ore are from Tell Hammeh, on the north 
bank of the Zarqa River in Jordan, and date to the eighth century sc (Veldhuijzen 
and Van der Steen, 2000). In Europe iron came into use with the develop- 
ment of iron smelting and working which began around 800 sc during the 
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(a) (b) 

Fig. 1.1 (a) “Strettweger Kultwagen,” a “Kultwagen von Acholshausen,” made about 
beautifully made artefact showing a sacrificial 1000 Bc, found near Acholshausen, Germany 
procession with a cart, bearing a vessel (Worschech 1977/78). Both objects show the 
containing some liquid for offerings. It was importance of coaches with spoke wheels as 
produced about 600 Bc and found in 1851 prestigious objects used for religious 

near Judenburg, Austria (Egg 1996). (b) ceremonies. 


Hallstatt culture, named after the famous cemetery near a salt mine at Hallstatt, 
Austria. 

The usual way to obtain useful products from the unattractive, unmelted iron 
which is left in the furnace (a spongy mass, the “bloom”, containing slag) is to 
hammer the liquid slag out of the hot lump, yielding fairly pure wrought iron. 
The result may not be better than bronze. Only in a forging process with a long 
procedure of hammering and reheating were superior steels obtained. In 
charcoal-fired forges the fresh, hot surface of the wrought iron comes into contact 
with carbon and carbon monoxide which thus “carburize” the iron surface to 
form steel as an iron—carbon alloy. 

Alloying iron with more than 2% carbon considerably reduces the melting tem- 
perature with respect to iron and makes it possible to cast in molds. Whereas in 
the western hemisphere this development was discouraged by the knowledge that 
more than 1% carbon makes the iron alloy brittle, the superior furnace technol- 
ogy in China already allowed iron casting in the first half of the first millennium 
Bc, and the technique of decarburization by reheating to 800-900 °C in air be- 
tween 400 and 300 Bc (Cowen 2005). 

When they came to the Near East during the Crusades, European knights 
learned to fear the extreme quality of razor-sharp Damascene blades (see Fig. 
1.2). Two production methods have been applied, both dating back to before 500 
AD. One method was to forge two ingots of high- and low-carbon steels, respec- 
tively, into flat sheets and after cutting them to pieces, intimately bonding the 
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(b) 


Fig. 1.2 (a) Etched surface of a Damascus blade showing in detail the 
Damascene surface pattern generated by continued folding and twisting 
during forge welding of alternating sheets of high- and low-carbon 
steels. (b) A reconstructed Damascus blade showing the Damascene 
surface pattern (“Mohammed's ladders” and “rose patterns”) 
produced from a single wootz steel ingot (Verhoeven et al. 1998). 





Fig. 1.3 During cooling of the “wootz” ingot, impurity elements (e.g. 
vanadium) segregate out of the solid iron and freeze in an aligned way. 
Subsequent cycles of heating and cooling lead to a growth of iron 
carbide particles (Fe3C, cementite) along the lines, forming the light- 
colored bands in the Damascene blade. Right: Micrograph showing 
light and dark bands in an original Damascene blade (top) and a 
modern reconstruction (bottom) (Verhoeven 2001). 
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pieces together by continued folding, twisting, and hammering the work piece 
(Fig. 1.2a). The other method started from a single high-carbon steel ingot, called 
“wootz’, originally produced from special ore in India. The wootz was then 
forged in many heating-cooling cycles (Fig. 1.2b). 

The secret of the production of Damascene blades from a wootz steel ingot was 
unfortunately lost during the 18th century, but has recently been rediscovered in 
a co-operation between a scientist and a blacksmith (Verhoeven et al. 1998; Verho- 
even 2001). It turned out that the surface patterns on the blades resulted from the 
formation of bands of iron carbide (Fe;C) particles initiated by the microsegrega- 
tion of carbide-forming impurity elements being present in the wootz ingots (Fig. 
1.3). 


1.2.2 
The Development of Modern Alloy Science 


Whereas, as we have seen, the roots of alloy production go back into early human 
history, what we can call “scientific alloy research’ started in the 19th century. It 
is well known that the 19th century was very productive in several fields; the time 
of the industrial revolution was accompanied by a deep belief in progress and in 
universal feasibility, and several new branches of science (social, medical, bio- 
logical, technical) were developed. The progress in science and technology began 
to be tightly connected. The foundations were laid for more sophisticated studies 
of alloys by the development of crystallography, elasticity, electrochemistry, and 
metallography, together with a parallel advance in experimental methods (e.g., 
thermometry and measurement of electrical resistivity and hardness). 

In his two articles “On the Equilibrium of Heterogeneous Substances” (in 1876 
and 1878, respectively), J. W. Gibbs applied thermodynamics to the phase equi- 
libria of alloys, introduced the phase diagram as a basis of modern alloy science, 
and derived his famous phase rule. We can indeed connect the rise of modern 
alloy physics with this pivotal work of Gibbs. An important first step toward an 
understanding of alloy kinetics was the investigation of solid-state diffusion by 
W.C. Roberts-Austen, showing in 1896 that the diffusion coefficient of gold in 
lead was surprisingly far greater than supposed. The systematic study of binary 
phase diagrams is closely connected with G. Tammann. Generations of students 
used Tammann’s famous book Lehrbuch der Metallographie (first published in 
1914) for getting started with binary phase diagrams. The idea of the crystallinity 
of metals and alloys was already growing during the 18th and 19th centuries, but 
conclusive evidence of their crystalline structure was furnished after the discovery 
of X-rays as electromagnetic waves which could be diffracted by crystals (M. von 
Laue in 1912). Immediately afterward, W. H. Bragg and his son W. L. Bragg used 
X-rays systematically for the study of crystal structures (in 1913-1914). The deter- 
mination of crystal unit cells of many primary and intermediate phases brought 
about a real revolution. This was also the onset of the study of chemical long- 
range order (LRO), i.e., the condensation of the different sorts of atom in an alloy 
on specific sublattices of the crystal structure. The three papers by W. L. Bragg 
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and E. J. Williams discussed for the first time the degree of LRO and LRO kinet- 
ics with respect to atomic interaction energies (Bragg and Williams 1934, 1935; 
Williams 1935). These papers are still great reading for anybody entering the field 
of ordering kinetics. 

In 1930, F. Laves tried a classification of crystal structures from a topological 
point of view and found special compounds which are stabilized by their high 
packing density. These “Laves phases” have a packing density higher than a 
close-packed structure of identical hard spheres. 

Besides phase diagrams, Tammann investigated extensively the mechanisms of 
phase transformations during freezing, which led to the development of a theory 
of nucleation by M. Volmer and A. Weber in 1926, later applied to solid-state 
phase transformations by R. Becker and W. Döring (in 1935). 

In 1926 W. Hume-Rothery showed that the electron/atom ratio plays a decisive 
role in the determination of the solid solubility of copper, silver, and gold alloys 
with B-subgroup metals and also for the so-called “magic numbers” of concentra- 
tions involved in the intermediate phases of these alloys. The reason lies in varia- 
tions of the band structure for different crystallographic systems which comes 
into play when metals of different electron/atom ratio are alloyed together. 

Much effort within physical metallurgy and alloy physics has of course been 
put into the study of plastic deformation. Great progress in this respect was the 
possibility of growing single crystals (J. Czochralski in 1917; P.W. Bridgman in 
1925), allowing the investigation of plasticity without any influence of grain 
boundaries and with a well-defined orientation. The fundamental studies of M. 
Polanyi, E. Schmid, and W. M. Boas were summarized in the famous book by 
Schmid and Boas in 1935 establishing the critical resolved shear stress as a fun- 
damental criterion for slip. In this macroscopic description of plasticity, a funda- 
mental problem remained unsolved: The experimentally observed yield stress of 
single crystals is several orders of magnitude lower than calculated from the the- 
oretical strength determined for the rigid glide of lattice planes. This led to the 
postulation of dislocations by M. Polanyi, E. Orowan, and G. I. Taylor (separately, 
in 1934), which later, after a further development of the electron microscope by E. 
Ruska (in 1931), was brilliantly confirmed by transmission electron microscopy 
(TEM) of thin crystals by Heidenreich (in 1949) and P. B. Hirsch (in 1954). 

J. Frenkel (in 1926) and C. Wagner, as well as W. Schottky (in 1930), postulated 
point defects, vacancies, and interstitials as very important for the physical prop- 
erties of metals and alloys. Vacancies turned out to be essential for diffusion, 
as was shown theoretically by H. B. Huntington and F. Seitz in 1942, later 
confirmed experimentally as fact by A. D. Smigelskas and E. O. Kirkendall (in 
1947), and critically analyzed by L. S. Darken (in 1948). The Kirkendall effect of 
marker movement during chemical diffusion of two components with different 
diffusion coefficients is generally accepted as evidence for the vacancy diffusion 
mechanism. 

The theory of phase transformations in alloys, originally dominated mainly by 
the nucleation and growth concept, was greatly developed by efforts to explain su- 
perconductivity (H. Kamerlingh-Onnes in 1911) and superfluidity (P. L. Kapitsa, 
J. F. Allen, and D. Misener in 1937). First a concept was developed on a phenom- 
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enological level by L. D. Landau in 1938 to explain second-order phase transi- 
tions, but it applies to more general physical transition problems (see, e.g., Mel- 
bourne 2000). Another essential development reaches back to Gibbs and Volmer: 
spinodal decomposition, with the consequence of “uphill’ diffusion. The prob- 
lem was described theoretically by J. W. Cahn and J. E. Hilliard in 1958 in a con- 
tinuum approach and by M. Hillert (in 1961; and also in his 1956 thesis) in an 
atomistic approach. Corroboration came from small-angle scattering experiments 
in the early stages of decomposing systems (see, e.g., Gerold and Kostorz (1978), 
for a first review on this matter). An atomistic approach to obtain theoretically 
phase diagrams and information on the kinetics of phase transformations from 
statistical thermodynamics was developed in 1950 by R. Kikuchi, called cluster 
variation method (CVM), and was later extended to the path probability method 
(PPM) for describing transformation kinetics (in 1966). In 1949 N. Metropolis 
and S. Ulam first described the Monte Carlo method (MC) of random variations 
of the microstate; today it is one of the leading methods in alloy physics for 
studying structure and kinetics. 

After C. J. Davisson and L. H. Germer in 1927 had observed the wave character 
of electrons in diffraction experiments on Ni crystals, it was only a small step to 
scattering experiments with neutrons, yielding phonon dispersion curves, diffuse 
scattering, and magnetic scattering with polarized neutrons. 

The density functional theory (DFT) by P. Hohenberg and W. Kohn (in 1964) 
and W. Kohn and L. J. Sham (in 1965) solved the fundamental problem of 
many-body interactions in a solid. It was therefore a starting point for a quantum 
mechanical ab-initio approach to realistic phase diagrams and materials parame- 
ters. The theory has been very successful in recent years at describing the ground- 
state properties of metals and alloys. Very recently the DFT was combined with 
statistical thermodynamics within the cluster expansion method (CEM) by J. M. 
Sanchez, F. Ducastelle, and D. Gratias (in 1984). When this is applied together 
with MC simulations, a quantitative study of alloy properties is also made possible 
with respect to the temperature dependence. 

The work of Cahn and Hilliard, and later S. M. Allen and J. W. Cahn (in 1979), 
laid the basis for a general modeling of multiphase systems, called phase field 
modeling (PFM), which has been used successfully to study solidification and 
phase transformation in alloys. 

Since the mid-1990s, electron microscopy as well as field ion microscopy, origi- 
nally invented by E. W. Müller in 1951, made a big advance. Methods for electron 
microscopy were developed to obtain atomic resolution (high-resolution TEM, or 
HRTEM) by special observation techniques and technical correction of aberra- 
tions. The original field ion microscope and its analytical version, the atom probe, 
have recently been upgraded to 3D tomography atom probe (A. Cerezo in 1988; 
D. Blavette in 1993): The field-evaporated part of a very fine tip is simultaneously 
imaged on a position-sensitive detector and chemically analyzed by a time-of- 
flight (TOF) measurement. If all data are carefully stored as a function of time, 
the sample can be computer-reconstructed on an atomic scale. The development 
of scanning probes for atomic-scale microscopy led to further extremely fruitful 
experimental tools: the scanning tunneling microscope (STM) invented by G. 
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Binning and H. Rohrer in 1982, and the atomic force microscope (AFM) by C. F. 
Quate, G. Binning, and C. Gerber in 1985. Using a cantilever with magnetic coat- 
ing enables the AFM to be extended to the study of magnetic structures as the 
magnetic force microscope (MFM). 

Since their use in historical times, alloys have never left the front row of ad- 
vanced materials. Among the greatest challenges of alloy physics today are the 
following topics: 

e ductile, high-temperature, high-strength, structural materials 

e high-density, magnetic and magneto-optic, recording media 

e physical behavior with restricted dimensions 

e alloy behavior under extreme conditions 

e increasingly accurate computation of physical parameters, 
especially by ab-initio calculations 

e access to alloy kinetics by PPM, PFM, molecular dynamics 
(MD), and MC simulations supported by CEM. 


Having at hand elaborate experimental and theoretical methods together with the 
possibility of computer simulations, further developments in the field of ad- 
vanced alloys and alloy design will continue to contribute to the overall scientific 
and technological progress. 


1.3 
Atom Kinetics 


As we have seen already in the historical perspective, producing an alloy with spe- 
cific properties has always needed specific microstructural changes, such as take 
place in the carburizing process when forging iron in the flame to get steel or in 
the special segregation processes for the famous Damascene blades. It is abso- 
lutely clear that all diffusive structural changes in a crystal are ultimately brought 
about by atoms jumping between different lattice sites. If atoms are not able to 
move, no change at all is possible and the alloy remains frozen in the current 
state. For physical properties of an alloy to be meaningful they have to be mea- 
sured in a thermodynamic equilibrium state. The knowledge of atom jump pro- 
cesses is an essential prerequisite to decide if the system is in a true equilibrium 
state. Furthermore, detailed information on alloy kinetics is required for the de- 
sign and development of advanced alloys. 

Some examples will underline the importance of atom jumps for alloy physics. 
During the solidification process the homogeneity/heterogeneity/crystallinity of 
the solid alloy is almost completely determined by atomic mobility. Fast cooling 
of the melt, for instance, enables the production of metallic glasses with alloy 
properties far different from crystalline materials. 

Dislocation reactions essentially determining the mechanical properties are 
widely influenced by atom jump processes bringing about the motion of vacan- 
cies and solute atoms. 


1.4 The Structure of this Book 


Atom jumps, mainly into neighboring vacancies, are the underlying mecha- 
nism of nondisplacive phase transformations. Therefore, what really happens de- 
pends critically on the local atomic motion in an alloy (see, e.g., Soffa et al. 2003). 

Phase changes of the order—disorder type are of special interest due to their re- 
versibility. In this case atom jumps occur between different sublattices of the or- 
dered or at least partially ordered alloy. A fundamental understanding of the role 
of defects in intermetallic compounds can then be attempted, which will allow 
open questions on configurational kinetics of alloys to be answered. 

We have further seen that size reduction to obtain nanostructures is a current 
challenge in alloy physics. Essential changes in physical properties of alloys ac- 
company dimensional restrictions. These are due to the increase in surface/ 
volume ratio so that the atom jump processes run in a different manner than 
those in bulk alloys. 

The necessary information on atom jump processes may be obtained by a com- 
bination of experimental and theoretical methods, together with computer simu- 
lations (see, e.g., Pfeiler et al. 2004; Kozubski et al. 2004, 2006). 

Because of their importance and ubiquity in all microstructural changes, both 
desired and unwanted, it was decided to choose “atom jump processes” as a guid- 
ing principle and leitmotiv through this book. 


1.4 
The Structure of this Book 


When one looks at the Contents pages, the structure of this book (or let’s say the 
order parameter) will become apparent: from statics to dynamics and kinetics, 
from fundamental to complex structures, from phenomenology via theory and ex- 
periment to applications. 

This section gives a short overview of the chapters and their interconnections. 

Chapter 2, “Crystal Structure and Chemical Bonding,” is dedicated to a classi- 
fication of structure types and the relationship between structures and structure- 
determining parameters. A modern view of Hume-Rothery alloys and intermetal- 
lic compounds is given as well as of Laves phases and structurally more complex 
alloy systems such as magnetic materials of the CaCus type or quasicrystals. 

Alloy structure and structural defects depend of course on the solidification 
process. In Chapter 3, “Solidification and Grown-in Defects,” it is shown how 
diffusion, mainly on the liquid side, and convection processes close to the solid— 
liquid interface determine details of the resulting alloy structure. The generation 
of defects, formation of grain structure, twinning, and chemical segregation ef- 
fects are other topics in this chapter. 

Starting from a discussion on atomic binding and atomic interaction energies, 
a review is given in Chapter 4, “Lattice Statics and Lattice Dynamics,” on the elas- 
ticity of crystalline lattices, lattice vibrations, and thermal properties of alloys. Ex- 
amples are shown of soft phonon modes and their influence on structural phase 
transitions, and special emphasis is put on the connection of phonon spectra with 
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atomic migration, allowing a determination of migration energies from lattice- 
dynamical properties. 

The importance of lattice defects for atomic migration plays a central role in 
Chapter 5, “Point Defects, Atom Jumps, and Diffusion.” The discussion starts 
with point defects and their energetics in alloys and continues with the different 
descriptions of single and multiple atom jump processes leading to atomic diffu- 
sion under various conditions. There is a fairly detailed discussion of diffusion in 
ordered intermetallics and of various theoretical approaches to obtain the macro- 
scopic diffusion behavior from atom kinetics. 

Plastic deformation of alloys due to dislocation processes is reviewed in detail 
in Chapter 6, “Dislocations and Mechanical Properties.” The mechanisms dis- 
cussed include thermally activated processes such as dislocation glide and climb, 
as well as hardening and recovery, complex deformation behavior like the famous 
yield strength anomaly and fatigue, the strength of nanocrystalline alloys and 
thin layers, and the mechanical properties of quasi-crystals. 

The mechanisms of phase transformations in alloys are reviewed in Chapter 7, 
“Phase Equilibria and Phase Transformation.” After a solid introduction on phase 
diagrams, phase transformation kinetics is described, and how it can be sup- 
pressed to get amorphous alloys is dicussed. The difference between nucleation 
and growth on the one hand, and spinodal decomposition on the other, is dis- 
cussed for the cases of phase separations as well as for order-disorder transfor- 
mations. Martensitic transformations, which are massive and displacive, are pre- 
sented at the end of this important chapter together with some applications. 

Chapter 8, “Relaxation of Nonequilibrium Alloys,” first concerns description of 
alloy relaxation to an equilibrium structure by diffusion-controlled, thermally acti- 
vated processes, starting from a nonequilibrium state. It is shown that a realistic 
kinetic pathway can be found by linking the classical nucleation theory to the de- 
tails of atomic jumps. Secondly driven alloys are considered, for which an exter- 
nal forcing maintains the alloy in nonequilibrium states. The theory and model- 
ing of such alloys rests on the combination of kinetics: that resulting from the 
external forcing and that of natural relaxation to equilibrium. 

In Chapter 9, “Change of Alloy Properties under Dimensional Restrictions,” 
detailed studies of phase equilibrium and phase transformations of nanometer- 
sized alloy particles are presented and a discussion on atom movement in nano- 
particles is given. It is shown that a stable amorphous phase can form in a eutec- 
tic system for a small enough particle size (below 10 nm), which promises an 
insight into the liquid—glass transition. 

The theoretical basis of Chapter 10, “Statistical Thermodynamics and Model 
Calculations,” is the cluster variation method (CVM). The statistical thermody- 
namics on a discrete lattice and for continuous media are reviewed and it is 
shown that by combining the CVM with calculations of the total energy of the 
electronic structure, self-consistent first-principles calculations of phase equilibria 
are made possible. Results of calculations using CVM and PPM (the path proba- 
bility method) are presented and a possible approach toward first-principles phase 
field calculations is discussed. 
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Chapter 11, “Ab-initio Methods and Applications,” is dedicated to a review of 
the present state of density functional theory. After an account of the theoretical 
background, the different computational methods and approaches are discussed 
critically and several examples of applications are given, such as elastic and vibra- 
tional properties, point defects and diffusion, the search for ground-state struc- 
tures, and ordering phenomena. It is shown that the very successful cluster ex- 
pansion method yields access to multiscale modeling. 

In Chapter 12, “Simulation Techniques,” advantages and disadvantages of mo- 
lecular dynamics simulations are discussed first and some interesting results are 
given. Then various Monte Carlo (MC) simulation techniques, including kinetic 
MC, are described, and their possibilities and limits are shown in several exam- 
ples. In the final part of the chapter, phase field models are explained, starting 
from their principles. These models are used to describe, e.g., the precipitation 
of a new phase, grain growth in polycrystals, and solidification. 

Two chapters on experimental methods focus on high-resolution techniques: 
Chapters 13.1, “High-Resolution Scattering Methods and Time-Resolved Diffrac- 
tion,” and 13.2, “High-Resolution Microscopy.” To study nonequilibrium alloys 
or equilibrium dynamics a high temporal experimental resolution is of great in- 
terest, which can be met under certain conditions, especially using the third 
generation of synchrotron sources. In Chapter 13.1, examples are given of high- 
resolution X-ray and neutron scattering methods, which include magnetic scatter- 
ing, coherent time-resolved scattering, elastic, quasielastic, and inelastic scatter- 
ing as well as diffuse scattering and scattering from surfaces. 

In Chapter 13.2, after a short introduction to the field of surface analysis by 
scanning probe microscopy, the basic principles and practical aspects of HRTEM 
and related techniques are provided. This is followed by several examples, partic- 
ularly emphasizing the use of in-situ HRTEM. The second part of this chapter fo- 
cuses on up-to-date atom probe tomography, giving a detailed description of the 
method. Exemplary studies are described on decomposition, diffusion in nano- 
crystalline thin films, and grain boundary diffusion. 

Chapter 14, “Materials and Process Design,” is dedicated to various applica- 
tions of alloys, and tries to convey a sound understanding of the corresponding 
effects and their up-to-date technical uses. These include soft and hard magnetic 
materials, invar alloys, magnetic media for the use in hard disk memories, spin- 
tronic materials showing giant magnetoresistance, and phase change media 
which use the phase transformation from crystalline to amorphous state for data 
storage. Last but not least the story of superconductors is related. 

It is clear that the chapters summarized above have multiple connections; 
some important concepts are discussed in several chapters from different aspects. 
An example is the formation of a new phase from a supersaturated matrix. This 
process, which is fundamental for alloys, is discussed from the viewpoint of atom 
jumps mediating the transformation process in Chapter 5, a description starting 
from the basics is presented in Chapter 7, a prospective theoretical modeling of 
the kinetic process is formulated in Chapter 8, changes in the transformation be- 
havior under the dimensional restrictions of nanometer-sized particles are illus- 
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Table 1.1 Key to the topics in this book. 
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trated in Chapter 9, statistical models of phase transitions are presented in Chap- 
ter 10, the advantage of DFT together with the cluster expansion method and ki- 
netic MC simulations is documented in Chapter 11, the application of phase field 
models is discussed in Chapter 12, the experimental access to details of phase 
transformations by high-resolution scattering techniques as well as by TEM and 
atom probe tomography is presented in Chapters 13.1 and 13.2, respectively, and 
finally, possibilities of technical applications, e.g., of the phase change media for 
data storage in CDs and DVDs, are described in Chapter 14. Table 1.1 shows how 
important topics figure in different chapters. It is our explicit intention to present 
these various aspects, instead of conscientiously avoiding any overlap. We hope 
that this many-sided approach makes this book even more attractive. 

In the course of this introductory chapter, we have tried to demonstrate that 
alloys still have to be counted among the most important materials of the 21st 
century, especially when considering the ongoing shift in interest from bulk ma- 
terials to nanostructures. It goes without saying that a thorough understanding of 
the physics of alloys is a precondition for anyone who wants to do serious re- 
search or to engage in technical application of alloys. The present book covers in 
much detail and considerable depth nearly all fields of alloy physics necessary to 
know when entering one of these areas either as a graduate student or post-doc. 
It should be valuable for scientists of neighboring fields or somebody already 
working in an area of alloy physics who wants more information in one of the 
specific directions covered by this compendium. 
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Crystal Structure and Chemical Bonding 


Yuri Grin, Ulrich Schwarz, and Walter Steurer 


2.1 
Introduction 


For the following systematic considerations we define alloys as a class of mate- 
rials which may be multiphase systems. The components of these systems are bi- 
nary or multinary phases, e.g., or intermetallic compounds. Hence, the relevant 
chemical and physical properties of alloys can be described in a slightly simplified 
manner as being dependent on the properties of each of the constituents and on 
the interactions between these constituents at the phase boundaries. 

Intermetallic compounds (phases) show physical and chemical properties which 
are often interesting for applications, so they frequently serve as important com- 
ponents for materials design. Due to the strongly application-governed develop- 
ment of knowledge on intermetallic compounds, understanding of their chemical 
nature is still incomplete. Important questions can not be answered definitely: 
Why do some intermetallic compounds only form at distinct compositions? Why 
do others form homogeneity ranges? Why are certain structural motifs stable and 
can be found frequently among intermetallic phases? Why do others appear only 
occasionally? One of the reasons for this situation is the insufficient knowledge 
about chemical bonding in intermetallic compounds. Additionally, attempts to 
find a direct causal link between properties and chemical bonding for this group 
of inorganic substances often leads to the conclusion that there is a lack of reli- 
able information on both bonding and properties, and, consequently, on their re- 
lationship. The nature of chemical bonding in intermetallic compounds, in par- 
ticular, is still under debate (Nesper 1991, Grin 2000). 

In this chapter, we discuss the most common crystal structure motifs of inter- 
metallic compounds, factors which are suggested to govern formation and prop- 
erties, and different approaches to understanding the atomic interactions in inter- 
metallic phases. 
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2.2 
Factors Governing Formation, Composition and Crystal Structure of Intermetallic 
Phases 


Compound formation, spatial arrangement of atoms, and bond constitution are 
influenced by factors which can be traced back to parameters such as the aver- 
age number of valence electrons per atom (valence electron concentration), the 
electronegativity of the constituents (electrochemical factor), and atomic size. 
With the additional condition that the two metals realize the same crystal struc- 
ture, the same factors have been used as parameters controlling solid solubility 
(Hume-Rothery and Raynor 1938). As additional parameters for compound for- 
mation, measures of the cohesive energy such as boiling point, compressibility, 
or lattice energy are employed, but these parameters have no apparent relevance 
to the realization of certain atomic arrangements. 

For crystal chemistry considerations, sets of self-consistent atomic radii were 
introduced which are dependent on the realized coordination number due to 
repulsive interactions between adjacent ions (Goldschmidt 1926). Moreover, in 
order to take into account different types of chemical bonding there are sets of 
metallic, covalent, and ionic radii (Teatum et al. 1960; Emsley 1991). Despite 
these drawbacks, the correlation of radius ratios and coordination number results 
in a crude separation of different structure types. 

Figure 2.1 shows an example of a mapping using the radius ratio as the only 
structure-determining factor for the Laves phases AB, (Stein et al. 2004). From 
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Fig. 2.1 Number of known Laves phases as a function of the radius 
ratio of the constituting metals. 
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the geometrical analysis (cf. Section 3.2.4), it was expected that the ideal size ratio 
for this structural motif is 1.225:1. In reality, only a few compounds form with this 
size ratio. Moreover, maximal numbers of the phases are obtained at ra/rg = 1.15 
and 1.33. Obviously, an individual determining factor is not sufficient to predict 
the realization of a distinct crystal structure. Taking this shortcoming into ac- 
count, a later approach analyzed the dependence of the stabilization of Laves 
phases on the heteronuclear A-B and homonuclear B-B next-neighbor distances 
(Simon 1983). Similarly, later classifications of atomic arrangements employed 
several factors, a procedure which is known as structure mapping. 


22:1 
Mappings of Crystal Structure Types 


The influence of a combination of two or more factors on the stability of certain 
atomic patterns can be studied in a systematic way by employing so-called struc- 
ture maps. Here, normally factors such as valence electron concentration, electro- 
negativity, or atomic size are used as coordinates for multidimensional diagrams 
of crystal structure types. 

An early mapping of the atomic arrangements of valence compounds (Mooser 
and Pearson 1959) employed the difference in electronegativity and the averaged 
value of the principle quantum number of the constituents as coordinates (Fig. 2.2). 
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Fig. 2.2 Mooser and Pearson representation of binary valence 
compounds with composition AB and eight valence electrons, as a 
function of average quantum number ñ and electronegativity difference 
Ay. Open symbols indicate ZnS or ZnO types of crystal structures 

(CN 4), filled symbols NaCl arrangements (CN 6). 
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Small values of the parameters favor covalent bonding and a low coordination 
number (e.g., four in zincblende and wurtzite-type arrangements), whereas large 
values correspond to ionic bonding and higher coordination numbers (e.g., six in 
sodium chloride-type arrangements or eight in CsCl-type structures). Conse- 
quently, the graphical representation revealed a satisfactory separation of the sta- 
bility fields of binary AB compounds with eight valence electrons and coordina- 
tion numbers of four and six (Fig. 2.2). However, different atomic patterns of 
intermetallic non-octet compounds are often located within narrow parameter do- 
mains and are not well separated into different domains by these coordinates 
(Zunger 1980). 

On the basis of pseudopotential calculations, the average band gap En and the 
ionicity C of group 14 elements and binary semiconductors with four valence 
electrons per atom was calculated (Phillips 1968; van Vechten 1969a,b; Phillips 
1970). A plot using the square roots of the homopolar and ionic fractions of 
the band gap, respectively, results in a clear separation of fourfold and sixfold 
coordination. The positions of the stability fields reveal that CN 4 is more 
stable for dominantly covalent compounds and CN 6 for more ionic binaries 
(Fig. 2.3). 

In another approach, two-dimensional structure maps of AB compounds were 
constructed on the basis of calculated pseudopotential radii and two coordinates 
[Eqs. (1, 2)] derived thereof (St. John and Bloch 1974; Zunger 1980). 
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Fig. 2.3 Stability fields of tetrahedral and octahedral coordination 

in binary valence compounds as a function of the homopolar band 
gap £n and the ionicity C. Open symbols indicate CN 4, filled symbols 
CN 6. 
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Fig. 2.4 Stability fields of selected crystal structures of valence 
compounds as a function of calculated pseudopotential radii (see text). 
Open circles: diamond-type structure (CN 4); open diamonds: ZnS type 
(CN 4); open squares: ZnO type (wurtzite, CN 4); filled dots: NaCl type 
(CN 6); crosses: CsCl type (CN 8). 





RS + rs^) — (p° + rs”) (1) 


R,^ = (^ r^) i (n® n”) (2) 





The sums are measures of the atomic radii of the constituents and the differences 
are a measure for the promotion energy from s to p states. The coordinate R,^®® 
represents a size difference and the value of R,^? the tendency for sp hybridiza- 
tion. The resulting separation of stability fields of structure types (Fig. 2.4) works 
for 437 non-octet compounds adopting 27 different crystal structures, but other 
arrangements with similar atomic coordination are not resolved, e.g., the NiAs 
and MnP. 

For a more efficient separation, three-dimensional mappings of crystal struc- 
ture types have been accomplished (Villars 1983, 1984a,b). For different composi- 
tions stability fields are shown in cross-sections sorted for certain ranges of the 
number of valence electrons (defined as electron per atom ratio) and the crystal 
structures of compounds are displayed using the differences of orbital electroneg- 
ativities Ay, and orbital radii A(rs + rp),g as coordinates (Fig. 2.5). Although a 
reasonable separation is achieved in general, a shortcoming of the mapping is 
that the NiAs type had to be excluded and that some stability fields contain more 
than one structure type. The incomplete separation is taken as evidence for the 
importance of the angular character of the valence orbitals which is not included 
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Fig. 2.5 Stability fields of selected structure types of compounds with 
composition AB and a valence electron concentration of four, as a 
function of orbital radii and electronegativity difference Ay. Open 
diamonds: ZnS type (CN 4); open squares: ZnO type (wurtzite, CN 4); 
filled dots: NaCl type (CN 6). Stability fields of FeB- and CrB-type 
arrangements are indicated. 


in Villars’ maps. Moreover, the sorting does not separate elemental structures 
(Axap = 0 and A(r; + fp) 4p = 0) with the same number of electrons but different 
crystal structures, e.g., those of sulfur, selenium, and chromium with six valence 
electrons. 

In an attempt to overcome the shortcomings of structure mapping which are 
due to the neglect of the angular dependence, i.e., the s, p or d character of the 
involved electrons, Pettifor introduced a phenomenological coordinate in order 
to group structural data within a simple and coherent framework. By running a 
one-dimensional string through the two-dimensional periodic system of ele- 
ments, a so-called Mendeleev number (a combination of the factors atomic num- 
ber and valence electron configuration) is defined (Pettifor 1990). Plotting the sta- 
ble atomic arrangements at a given composition as a function of the Mendeleev 
number of the constituents provides a clear separation of structure types (Fig. 
2.6). Moreover, the implicit use of the electronic configuration as a coordinate 
also carries predictive power for targeted synthesis and modification of properties 
by partial substitution. With regard to regions where different arrangements com- 
pete, the diagrams provide optimized test structures for total energy calculations. 
Although the Pettifor’ mapping is widely accepted as optimal, even this type of 
structure plot does not reveal a clear and easy separation of some structure types. 
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Fig. 2.6 Stability fields of the AB crystal structure types as a function of 
the Mendeleev number. Open diamonds: ZnS type (CN 4); open squares: 
ZnO type (wurtzite, CN 4); filled dots: NaCl type (CN 6); crosses: CsCl 
type (CN 8). Stars and open fields indicate other structure types. 


2.3 
Models of Chemical Bonding in Intermetallic Phases 


One of the main reasons for problems associated with definitions of structural 
fields in different mapping systems originates from the fact that the parameters 
involved are describing particular or even singular aspects of atomic interactions. 
The complete picture of chemical bonding in intermetallic phases is not developed 
to the same extent as it is for other classes of inorganic or organic compounds. 
Nevertheless, several successful models were proposed for the description of par- 
ticular groups of intermetallic compounds which are the subject of this chapter. 


2.3.1 
Models Based on the Valence (or Total) Electron Numbers 


An early, qualitative systematization of bond types (the so-called van Arkel- 
Ketelaar triangle; van Arkel 1956; Ketelaar 1958) included ionic, covalent, and me- 
tallic interactions. The van der Waals force was not considered as a physical inter- 
action although a large number of crystal structures comprising low-dimensional 
building units provide evidence of the importance of this type of interaction for 
the cohesion of condensed phases. 

Based on early concepts of chemical bonding and the octet-rule (Abegg 1904; 
Drude 1904), the number of valence electrons turned out to be the essential quan- 
tity in elements and compounds with covalent bonding (Lewis 1916) and in the 
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configuration of ionic solids (Kossel 1916). A simple and semi-quantitative de- 
scription of homodesmic ionic bonding in strongly polar crystalline solids as- 
sumes a full charge transfer from the electropositive component, usually a main 
group metal, to the electronegative constituent, typically a halogen or chalcogen. 
A model of the different coordination in compounds constituted by spherically 
symmetric closed-shell ions was elaborated (Pauling 1960). The compositions are 
balanced so that the resulting ions can adopt a noble gas configuration. The re- 
sulting cohesion is based on nondirected Coulomb (electrostatic) interactions 
between charged particles which give rise to attractive and repulsive forces in a 
three-dimensionally periodic arrangement of ions. The dimensionless Madelung 
factor sums up attractive and repulsive forces of the specific spatial arrangement 
of ions. The potential energy in equilibrium conditions is a function of the ionic 
charge, the equilibrium distance, and the spatial arrangement. An additional 
characteristic repulsion due to the interaction of the electron shells (Pauli exclu- 
sion principle) is taken into account by a correction term which depends on the 
electron configuration of the ions. Typically, the calculated full lattice energy is 
about 10% smaller than the Coulomb energy alone. The concept is very useful 
for an understanding of the atomic coordination on a basis of ionic interactions 
(Hoppe 1970, 1979). The approximation of this approach is, first, an assignment 
of certain radii to atoms or ions although the extension of their wavefunction is 
infinite. Moreover, in accordance with earlier experimental data, elaborated quan- 
tum mechanical treatments reveal additional cohesion contributions besides the 
ionic ones, e.g., covalent bonding and static dipole interaction. 

In contrast to the undirected forces between ions in a solid, covalent bonds are 
directional between distinct neighboring atoms. The case of the two-electron-two- 
centre bonds can easily be visualized by Lewis formulae. In the majority of com- 
pounds formed by main group elements, atoms in molecules or solids fulfill the 
octet rule. 

A severe conceptual problem of discussing metallic bonding is that the terms 
“covalent” or “ionic” only describe types of chemical interaction, while the term 
“metallic” can characterize either a type of bonding or a kind of electronic trans- 
port behavior, i.e., a physical property. Thus, metallic conductivity is generated by 
the presence of partially filled bands and is not strictly correlated with a certain 
bonding mechanism. An example of this situation in chemical elements is the 
carbon modification graphite, which combines covalent bonding with a 2D 
metal-type conductivity. A different aspect of the problem is highlighted by the bi- 
nary ionic compound CsI, which is characterized by filled p and empty d states 
and, thus, semiconducting properties at ambient pressure. Upon compression 
the p and d states start to overlap and partly filled bands result. As a consequence, 
a pressure-induced insulator-metal transition is observed (Asaumi and Kondo 
1981; Reichlin et al. 1986). 

In a metal, the diffuse wavefunctions of the constituting atoms in some cases 
enable a description of the chemical bonding by the model of a free-electron-gas. 
In contrast to the local organization concepts for valence compounds, this model 
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for metallic solids describes valence electrons as negatively charged particles in- 
teracting in a field constituted by a regular arrangement of positively charged 
atom cores. The resulting band dispersions (widths) are large in comparison 
with the band splittings (gaps) at special points of the Brillouin zone and partially 
filled bands result. Experimental and theoretical investigations reveal that, to a 
good approximation, a nearly unstructured electron gas with a square-root shaped 
density of states (DOS) is found for elemental sodium (Fig. 2.7). With increasing 
atomic weight, a growing contribution of d states brings about a structuring of 
the DOS. Similarly, enhancing the occupation of p states by proceeding from so- 
dium via magnesium to aluminum gives rise to a subtle structuring and a clearly 
increasing dispersion of the bands evidences the intensification of next-neighbor 
interactions (Demchyna et al. 2006). 

With regard to electron count, the description of the bonding situation becomes 
even more complicated when we analyze band structures of transition metals and 
their compounds which are normally characterized by broad s and narrow d 
states (Burdett 1995). Here, typically one electron per metal atom is located in a 
band (or bands) constituted by predominantly s orbitals. As a consequence, tran- 
sition metals exhibit a maximum of the cohesion energy in the region of six va- 
lence electrons because one s and five d electrons are located in bonding states. In 
late transition metals like Fe, Ni or Co, the interaction of completely filled, non- 
or anti-bonding d orbitals influences the structural arrangement critically. 

Several binary intermetallic systems exhibit a similar sequence (cf. Section 2.4) 
of crystal structures as observed in the binary system Cu-Zn. The similarity of 
behavior is attributed to the systematically varying valence electron concentration, 
which changes proportionally to the composition and stabilizes certain atomic ar- 
rangements under the condition that the constituting atoms have only small size 
differences. Mixing of metals with similar radii and electronegativity, such as cop- 
per and zinc, does not require a significant amount of elastic energy, and the 
phase exhibits a crystal structure which would be realized by a hypothetical pure 
element with the same (average) electron configuration. For example, in the y- 
brass phase, the excess charge of zinc with respect to copper is screened and the 
electron densities of the different types of atoms are adjusted. A similar process 
takes place for the core states so that the electron wavefunction is only slightly 
modified by the substitution. In these so-called electron compounds or Hume- 
Rothery phases (frequently known as alloys components), the dominant factor 
for the phase stability is the number of electrons in the Brillouin zone so that cer- 
tain magic electron numbers correspond to maxima in stability. Small deviations 
of the optimal filling can be tolerated, with the consequence that a certain range 
of slightly varying compositions also fulfill the stability criteria. Well-known 
examples of electronic compounds are the « phases (fcc crystal structures with 
approximately one electron per atom), the £ phases (bcc-type crystal structures 
with 1.5 electrons per atom), the y phases (a defect variety of bcc phases with 52 
atoms in the unit cell and an electron concentration of # electrons per atom), and 


3 
the e phases (hcp arrangements with electrons per atom). 
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Fig. 2.7 Density of states of selected (a) alkali metals and (b) third row 
elements. For the heavy alkali metals, the DOS close to the Fermi level 

is structured by the presence of d states; subtle deviations for Mg and 

Al are attributed to the occupation of p states. 


Assuming that in these electron phases the energy difference is given entirely 
by the difference in band structure energies and that the electron-electron inter- 
actions can be neglected, the effect of substitution can be described in the rigid- 
band approximation as a change in the number of occupied orbitals but not the 
band structure. Thus, a determination of the stability of atomic patterns requires 
a calculation of reasonable densities of states and band structure energies at cer- 


tain electron concentrations. 
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Early attempts to explain the concurrence of certain structural patterns at spe- 
cific electron concentrations (Mott and Jones 1936) attribute the stabilization to a 
drop in the density of states after a pronounced maximum. According to this 
idea, this abrupt decrease is located at an energy where the free-electron sphere 
expands beyond the boundary of the Brillouin zone. Addition of electrons after 
this point causes an energy cost which is higher than that for arrangements 
where the density of states continues to rise. An advantage of this model is the 
accurate prediction of phase changes, but it has the shortcoming that the Fermi 
surface of the metal is approximated as a sphere. 

In a later attempt (Jones 1937), the energy bands of the intermetallic com- 
pounds were assumed to be free-electron-like except when the wave vector ap- 
proaches Bragg reflecting planes (nearly a free-electron model). This scenario 
has the severe disadvantage that the drop in the density of states is located at con- 
centrations far below the observed structure changes. 

More elaborate quantum mechanical calculations reveal that the free-electron 
model (Mott and Jones 1936) works surprisingly well. It was shown in detail that 
peaks in the DOS tend to stabilize a phase, but the maximum stability is reached 
at a significantly higher electron concentration (Paxton et al. 1997). The density of 
states does not directly give the band structure energy, but its second derivative. 
Integration of the function (calculating the energy by filling up electronic levels) 
shifts its extrema to higher values, and integrating two times changes arguments 
(i.e., it exchanges the positions of minima and maxima). Thus, a Fermi energy 
which is located in a dip in a density of states often leads to a maximum in stabil- 
ity. It was shown that band structure calculations taking into account sp bands 
exclusively reproduce the results of Mott and Jones, but those including d bands 
also correctly predict the structure of copper metal (one valence electron per 
atom) and confirm roughly the correctness of the earlier models. Thus, structural 
stability is largely determined by the density of states, e.g., the stability of y-brass- 
type crystal structures is due to a lowering of the DOS around an e/a of 1.7:1. Ad- 
ditionally, the structural distortion of the atomic planes from bcc gives rise to 
extra scattering of the electrons. 

Recent theoretical investigation on the y-brass phases CusZng, CugAl, (Asahi et 
al. 2005a) and T}Zn,ı (T = Ni, Pd, Co, and Fe; Asahi et al. 2005b) revealed a sta- 
bilization of these electron phases by a pseudogap formation across the Fermi 
level caused by a combination of orbital hybridization and Fermi level-Brillouin 
zone interaction. 


2.3.2 
Quantum Mechanical Models for Metallic Structures 


The structural systematics of elemental metals as a function of the atomic num- 
ber stimulated investigations on the stability of atomic patterns by quantum me- 
chanical calculations (Skriver 1985). The study of close-packed (metallic) crystal 
structures was performed by taking advantage of the frozen potential approxima- 
tion. Here, the potentials were not self-consistently relaxed, with the consequence 
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Fig. 2.8 Differences in the structural energy as calculated using 
canonical d bands. Reference line uS?A = 0 corresponds to fcc, open 
dots to bcc, and filled dots to hcp. Broken and dotted lines are guides 
to the eye. A stands for energy difference between the fcc and hcp or 
bec structure, u means d-band mass, S is the atomic Wigner-Seitz 
radius. 


that both core level contributions and double-counting terms of different atomic 
patterns cancel in the calculation of the structural energy differences. As a conse- 
quence, the only remaining quantities are the one-electron energies (which in 
general underestimate electron-electron correlations) and the electrostatic term. 
Since cohesion of elemental metals has no electrostatic contribution, the energy 
difference of the fcc and hcp or bcc structural patterns can be obtained alone by 
integrating the one-electron state densities (Fig. 2.8). The calculation procedure 
reproduced the stability of the empirically observed crystal structures for Na, and 
all transition metals except gold, ytterbium, and the three 3d metals exhibiting 
ferromagnetic behavior. However, the calculated energy differences are approxi- 
mately five times larger than the experimental values. 

Systematics of interatomic distances within the transition metal series can be 
understood in the framework of the Friedel model (Friedel 1969; Söderlind et al. 
1994). For the nonmagnetic 4d transition metal series (Fig. 2.9, broken curve) the 
successive filling of the bonding, nonbonding, and anti-bonding states results in 
a parabolic dependence of the binding energies and the interatomic distances on 
the number of valence electrons. In the 4d series the magnetic interaction is rela- 
tively weak and cannot compete with the elastic energy. In contrast, magnetic 
order occurs within the 3d series. Here, the metals with four to eight d electrons 
show pronounced deviations from the expected parabolic behavior (Fig. 2.9, full 
curve) toward longer distances. The weakening of the bonds is driven by a gain 
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Averaged interatomic distance 
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Fig. 2.9 Averaged interatomic distance in 3d and 4d metals as a 
function of the number of d electrons. Full circles indicat nonmagnetic 
3d metals, full squares magnetically ordered 3d metals, and open 
circles the nonmagnetic 4d metals. 


in magnetic energy. The magnetic order shifts the majority spin states to lower 
energy and, in the case of the magnetic 3d elements, increases the occupation of 
the anti-bonding majority spin states. The corresponding depopulation of the mi- 
nority spins has less influence since it moves the nonbonding states to the Fermi 
level. Increasing the band dispersion, e.g., by application of pressure can sup- 
press the magnetic ordering and cancel the anomaly (Rosner et al. 2006). 

For elemental rare-earth metals, band structure calculations (Duthie and Petti- 
for 1977) revealed that the relative volume of the ion (metal) core influences the d 
band occupancy quite critically. Within the series of metals, the d band occupancy 
changes from about 1.5 to 2.5 and it is this fraction of the total energy which 
drives the crystal structure sequence in the trivalent 4f metals. 


2.3.3 
Electronic Closed-Shell Configurations and Two-Center Two-Electron Bonds in 
Intermetallic Compounds 


While valence compounds usually comprise fully occupied valence bands and 
empty conduction bands, so-called semimetals such as the compound Liln with 
a diamond-type In” partial structure have an indirect overlap of bands (Schwarz 
et al. 1998). The band structure reveals strong similarities to those of group 14 
semiconductors with the energy of the conduction band dropping below the top 
of the valence band. This electronic situation induces the formation of an equiva- 
lent number of electrons and holes, which causes a metal-like conductivity. De- 
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spite the presence of partially filled bands, counting rules are valid since the 
number of orbitals is still in accordance with the simple empirical rules for va- 
lence compounds. 


2.3.3.1 Zintl-Klemm Approach 

With respect to both experimental and theoretical work, the so-called Zintl phases 
(Zintl and Kaiser 1933; Zintl 1939; Schäfer et al. 1973) are of special interest since 
chemical bonding in this group of intermetallic compounds can be described in 
most cases in full accordance with the counting rules for ionic and covalent com- 
pounds, i.e., the correlation between composition and number of covalent bonds 
per atom can be predicted by means of the “8 — N” rule (cf. Section 2.3.3.2). 
Thus, in compounds like K4Si4 the valence electron of sodium is transferred to 
silicon. The resulting three-bonded Si” ions with five valence electrons form tet- 
rahedra (Si*~)4 similar to those in white phosphorus. Analogously, in the Zintl 
phase NaTl a complete transfer of the valence electron of the electropositive metal 
sodium to the more electronegative thallium is assumed. The Tl” anions have the 
same number of valence electrons as group 14 elements and consequently realize 
a diamond-type network of four-bonded thallium species. However, the isotypic 
compound LiCd comprises four-bonded Cd” with only three valence electrons 
per Cd” thus providing evidence that, even within this subset of intermetallic 
compounds, violations of the simple valence rules occur. 

Syntheses and characterization of compounds like CsAu (Sommer 1943; Kie- 
nast et al. 1961) and Cs2Pt (Karpov et al. 2003) which are constituted by a main 
group and a noble metal have completed the group of intermetallic compounds, 
in which the constituting metals exhibit large differences in electronegativity, re- 
sulting in a full charge transfer from cations to anions. Experimental data reveal 
that, e.g., the auride anion Au” with a filled d shell plus a filled s shell is a sur- 
prisingly stable species, and that CsAu exhibits semiconducting behavior (Spicer 
et al. 1959) like a valence compound. Heavy noble metals such as platinum and 
gold are characterized by high electron affinities originating from their special 
electron configuration comprising 4f and 5d shells which are just filled. Addi- 
tional stabilization of heavy metal anions by relativistic effects is discussed in 
theoretical treatments within the local density approximation (LDA) (Karpov et 
al. 2003). However, the stability of the CsCl-type compound LiAg (Freeth and 
Ragnor 1953/54) demonstrates that s*d’° configurations are also adopted by ligh- 
ter elements with a less pronounced relativistic stabilization of the s states. More- 
over, the compound Naj¢Rb7Sb7, which may be more precisely written as 
NaietRbg*Rb~ Sb73-, indicates that even a filled s subshell can be realized (von 
Schnering et al. 1995). 

The given examples expand the list of compounds where filled valence electron 
subshells are stable, and oxidation states can be attributed in a number of 
strongly polar intermetallic compounds comprising discrete building units which 
can be described with known counting rules such as 8 — N (Hume-Rothery 1930; 
see below). Thus, the identification of covalent fragments in intermetallic com- 
pounds appears to be an especially promising approach to advance the counting 
rules into the field of intermetallic compounds. 
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2.3.3.2 Extended 8 — N Rule 

Elemental semi-metals like arsenic or tellurium are elements with partly filled p 
bands in a hypothetical primitive cubic arrangement. Symmetry breaking by spe- 
cific distortions opens band gaps at the zone bondary, with the consequence of 
metal-insulator transitions. The number of covalent bonds in the resulting dis- 
torted patterns, e.g., 8 — 5 = 3 for arsenic and 8 — 6 = 2 for tellurium, is in accor- 
dance with the 8 — N rule (Hume-Rothery 1930), which correlates the number of 
valence electrons N with the number of atomic bonds for post-transition main 
group elements. The basic idea is that the atoms complete their octet by forming 
8 — N two-center two-electron bonds. Later, the idea was extended to binary semi- 
conductors and generalized to take into account homonuclear bonding. In the so- 
called general valence rule [Eq. (3)], where ne is the total number of valence elec- 
trons, b, and b. represent the number of bonds between anions and cations, 
respectively, and n, stands for the number of anions (Pearson 1964). 


(ne + ba — be) /na = 8 (3) 


In b., the number of unshared valence electron pairs has to be included. All 
values relate to one formula unit of the compound. Thus, for GaTe with n. = 9, 
n, = 1 and b, = 0, we find one cation-cation bond per formula unit; for PbSe 
with ne = 10, na = 1, ba = 0, we find a stereochemically active electron pair. 

A special case of the 8—N rule is realized by the weakly polar binary 1:1 
Grimm-Sommerfeld compounds. These are composed of elements of groups 
N—k and N +k (k = 1,2,3) and exhibit properties similar to those of the ele- 
ments in group N. Examples are Ge and the binaries GaAs, ZnS, and CuBr, or 
As and binary SnTe (Grimm and Sommerfeld 1926). 

There is no direct way to predict immediately the structural effect of the elec- 
tron deficiency with respect to the (8 — N) electron count. Of crucial importance 
seems to be the relation between skeletal bonding, i.e, within the anionic part of 
the crystal structure, and exohedral bonding, i.e., on the periphery of the anionic 
part of the crystal structure. For example, Wade’s rules (Wade 1976) for 
boranes imply counts lower than two for both skeletal and exohedral bonds, with 
a preference for the exohedral bonds compared with the skeletal bonds, since in 
the series from closo- to arachno-pentaborane B;Hs?~ (22 valence electrons), Bs Ho 
(24 valence electrons), B5H1; (26 valence electrons), the number of skeletal B-B 
bonding electrons increases from 12 via 14 to 16 while the number of exohedral 
bonding electrons is constantly 10. For the electron-deficient intermetallic com- 
pound Eu3Ges = (Eu?+)3(Ges)° it was shown by means of the electron localiza- 
tion function/electron density (ELF/ED) approach (cf. Section 2.3.2.3) that the 
skeletal Ge-Ge bonds have electron counts lower than two, whereas in the lone- 
pair regions the expected numbers of electrons are observed (Budnyk et al. 2006). 
For systems like silicides of alkali and alkaline earth metals (Nesper 2003) or 
(Eu?*)s(Gag)!- (Grin et al. 2003) it was demonstrated that the skeletal bonds 
remain as two-electron—two-center bonds, whereas the electron counts for the 
exohedral part, i.e., lone pairs, are less then two. 
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2.3.3.3 Bonding Models in Direct Space 

The local environment of atoms in the crystal structures of intermetallic com- 
pounds is difficult to describe by means of traditional MO pictures as well as by 
means of the free-electron gas model. The atomic coordination is usually more 
complex, as in salt-like inorganic compounds, and cannot necessarily be derived 
from the motifs corresponding to closest packing of spheres, which are character- 
istic of crystal structures of, e.g., elemental metals. 

Bonding analysis in real space is a promising way to obtain deeper insight into 
this field. Recently, some quantum chemical functions such as the electron local- 
ization function (ELF) or the localized orbital locator (LOL; Schmider and Becke 
2000) have been developed as quantum-chemical tools for analyzing and investi- 
gating bonding problems of chemical compounds in direct space. The ELF is es- 
pecially useful for investigating bond formation in compounds with typically me- 
tallic physical properties and it is widely used for describing and visualizing 
chemical bonds in solids. Originally defined within the framework of Hartree- 
Fock theory (Becke and Edgecome 1990) and later developed for density func- 
tional theory (Savin et al. 1992), the ELF belongs to the so-called bonding indica- 
tors in real space (i.e., position space). As was shown later (Kohout 2004), the 
functions and functionals of this group trace the correlation of electronic motion 
of same-spin electrons, and a more general function of this kind is called the elec- 
tron localizability indicator (ELI). The ELI represents, in a general sense, a charge 
distribution of electron pairs and, thus, are suitable tools for the analysis of chem- 
ical bonding in the sense of the Lewis theory (Lewis 1916), where pair formation 
plays the central role. In the vicinity of the nuclei, maxima indicate closed atomic 
shells. More detailed investigations show that not only the number of ELI (ELF) 
maxima for a given atom but also the integrated electronic population per shell is 
very close to the values expected from the Aufbau principle (Kohout and Savin 
1996). The maxima of ELI (ELF) in the valence region (valence shells) or/and 
structurization of the penultimate (outer core) shell (Kohout et al. 2002) provide 
signatures for directed (covalent) bonding in position space. The ELI (ELF) tools 
are especially suitable to detect directed (covalent) bonding in materials with 
bands which are not fully occupied or are strongly overlapping, a situation which 
is typical for intermetallic compounds. The analysis of the topology of ELI (ELF) 
can be combined with the consecutive integration of the electron density (ED) in 
basins, which are bounded by zero-flux surfaces in the ELI (ELF) gradient field. 
This procedure, similar to the one proposed for ED (Bader 1999), makes it possi- 
ble to assign an electron count for each basin, revealing the basic information 
about the chemical bonding. This combined application of ELI (ELF) together 
with ED offers the possibility of Zintl-like electron counts for a large group of 
intermetallic phases and of getting access to a bond definition in real space. The 
application of ELF in different kinds of bonding situations has been reviewed 
(Gatti 2005; for more details see http://www.cpfs.mpg.de/ELF). 

In metals, the atoms adopt high coordination numbers but have relatively few 
valence electrons. Therefore, it seems natural to expect multicenter bonding. 
There is also an alternative view (Pauling 1960) which regards metallic bonding 
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Fig. 2.10 Chemical bonding in metals and 
intermetallic compounds as evidenced by the 
ELF (ELI) approach. (a) Location of ELF 
maxima in the octahedral holes (six-center 
bonds visualized by the appropriate 
isosurface of ELF, orange) in elemental Li 
from TB-LMTO calculation. (b) Four-center 


tetrahedral holes shown by the isosurface 
(turquoise). (c) Interpenetrating graphite-like 
nets of aluminum (blue) with copper atoms 
embedded in the framework by three-center 
bonds (dark gray) in the crystal structure of 
Cu1-xAl2. (d) Two-center bonds in the silicon 
framework (red) of EuSig visualized by the 


bonds in the fcc structure of copper from 
FPLO calculations, with ELF maxima in the 


ELF isosurface (gray). 


as a partial (or unsaturated) bonding between nearest neighbors. For bcc Li (Fig. 
2.10a), ELF (ELI) bonding attractors are observed in the centers of octahedral 
holes, thus indicating six-center bonds for calculations with the Hartree-Fock or 
the tight binding-linear muffin tin orbital (TB-LMTO) method. In the full- 
potential calculation, attractors occur at the tetrahedral voids, and imply four- 
center bonds. Despite the differences, these calculations show consistently that 
bonding is multicentered. ELF (ELI) for fcc Cu (Fig. 2.10b), as computed by the 
full potential local orbital (FPLO) and TB-LMTO-ASA (atomic spheres approxima- 
tion) methods, yields a single set of identical attractor positions. Each bonding 
basin has a count of 1.05 electrons and represents a four-center bond. This im- 
plies that each Cu atom participates with 2.1 electrons in chemical bonding (Or- 
meci et al. 2006). 
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The atomic coordination in the crystal structure of the binary phase Cuy_,Aly is 
pronouncedly different from that of the constituting elements in having copper 
in a tetragonal-antiprismatic coordination of aluminum. Despite this, the elec- 
tronic density of states is metal-like since the valence and conduction bands over- 
lap. The analysis of ELF reveals a larger amount of directed bonding than ex- 
pected from the constituting elements. The covalently three-bonded aluminum 
atoms form interpenetrating graphite-like nets by two-center (and nearly two- 
electron) bonds. The copper atoms are embedded in the aluminum framework 
by means of three-center Cu-Al-Cu bonds (Fig. 2.10c; Grin et al. 2006). 

Four-bonded silicon atoms in the crystal structure of the high-pressure phase 
EuSig form a 3D network with a reduced density which is similar to that of dia- 
mond. The europium atoms are embedded in large cavities of the framework. Co- 
valent bonding within the network is confirmed by ELF calculation (Fig. 2.10d; 
Wosylus et al. 2006) and, as a consequence, the calculated electronic density of 
states shows a clear (pseudo-) gap. From the ELF/ED approach, the valence elec- 
trons of Eu are transferred to the anionic network. In the electronic density of 
states (DOS) they fill bands above the pseudo-gap, causing metal-like electronic 
transport properties. In total, this compound is an example of so-called polar in- 
termetallic compounds (Nesper 1990) and can be described by the charge-transfer 
balance Eu?*(SiP), x 2e”, which visualizes the organization of bonding electrons 
causing metallic behavior. 


2.4 
Structure Types of Intermetallic Compounds 


The crystal structures of the approximately 20000 intermetallic compounds 
known in total belong to about 1200 structure types. The structure type is defined 
crystallographically as an atomic arrangement characterized by: 
e asymmetry of the same space group 
e a lattice with the same or a similar ratio of lattice parameters 
a:b:c and angles «,/,y 
e a set of occupied Wyckoff positions (Wondratschek 1983) 
(sites) within a given space group 
e a distinct distribution of the sites among different 
components 
e similar values of the free positional parameters for according 
sites. 


A more detailed analysis (Parthé et al. 1993/94) revealed the presence of subtypes 
(branches) within the structure types. Nevertheless, the definition above allows a 
classification of the whole variety of crystal structures of intermetallic compounds 
into a manageable number of basic motifs. Some of the structure types have hun- 
dreds of representatives (e.g., BaAl,, AlB,), others are found for only a few com- 
pounds, and some of them are unique (Villars and Calvert 1996). 
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2.4.1 
Classification of the Crystal Structures of Intermetallic Compounds 


The first classification of the crystal structures of the intermetallic compounds 
was introduced in the Strukturbericht and continued in Structure Reports (Anon. 
1931-1943, 1940-1990; see References); it was based on the composition of the 
materials: A = elements, B = two-component compounds of 1:1 composition, 
C = two-component compounds with 1:2 stoichiometry, ... This system offers 
the possibility of only very restricted and formal descriptions. Thus currently it 
is not used very much; however, descriptors like B2 (CsCl type) and A15 (Cr3Si 
type) can still be found in the literature for ordered bcc phases or for conventional 
superconductors. For ordered alloys, labels such as L1, (CuPtz type), Llo (AuCu 
type), L1, (CuPt type), and L1, (AuCus) are still in use. 

Early attempts to classify the crystal structures of intermetallic compounds ac- 
cording to chemical bonding were not successful because of a lack of efficient 
tools for bonding investigations (cf. Section 2.3.1). Thus, the combined applica- 
tion of structural and chemical principles was considered as an appropriate route 
for classification of the crystal structures of intermetallic compounds. 

Pauling (1960) pointed out the following groups: crystal structures of elemental 
metals (Cu, Mg, «-Fe) and their derivatives based on simple atomic packings, 
crystal structures with packings of atoms with different sizes (Laves phases, 
BaAl,), electronic phases (f- and y-brass), structures of metals with half- and non- 
metals (Fe3C, FeB, AIB2). 

The structural principle of classification can be superimposed on the chemical 
aspects (Schubert 1964). According to the structural pattern, the derivatives of the 
closest- and closed-sphere packings are grouped together with the brass-like 
structures. The remaining crystal structures are classified according to their 
chemical composition (structures formed by main group elements, by a main 
group element and a transition metal, and by different transition metals). 

Another way of taking chemical principles into account for the classification of 
crystal structures of intermetallic phases is to focus on the atomic environment. 
Krypiakevich (1977) classified the structure types according to the coordination 
number and the shape of the coordination polyhedra of the atoms with the small- 
est CN in the investigated structure. Seventeen groups were defined with CN 
values from 2 to 14 and with a coordination ranging from linear to rhombodode- 
cahedral. This classification offers the possibility of recognizing relationships 
between structure types which are not easily detectable on the level of structural 
patterns. 

The following description of basic structures of intermetallic compounds is per- 
formed by combining the structural approach and atomic coordination analysis. 


2.4.2 
Crystal Structures Derived from the Closest Packings of Equal Spheres 


From a chemical point of view, the easiest way to describe intermetallic com- 
pounds is to start with the crystal structures of elemental metals. Most crystal 
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Fig. 2.11 Formation of basic closest-packed structure patterns. (a) 
Closest-packed atomic layer. (b) Stacking of the second layer in position 
B. (c) Stacking of the third layer in position A (... AB... packing 
sequence). (d) Stacking of the third and fourth layers in positions C 
and A (...ABC... packing sequence). 
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Fig. 2.12 Basic stacking patterns for closest-sphere packings and their notation. 
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structures of main group and transition metals are based on the closest and close 
packings of spheres of the same size. The closest packing of the spheres in the 
plane (2D) is defined by the coordination number 6. Each atom has six ligands 


forming a six-membered ring (Fig. 2.11). 


The 3D closest packing is formed when the atoms of adjacent layers are in con- 
tact with three atoms of each neighboring layer. This can be realized in two differ- 
ent positions B and C when the arrangement of the starting layer is labeled A 
(Fig. 2.11). Two different positions, A and C, are available for the stacking of the 
third atomic plane when, e.g., the second one is in the position B. The resulting 


stacking sequencies are ... ABC... and. 


..AB..., respectively. This defines the 


endless variety of the closest packings according to their stacking pattern. Only 
two different coordination environments are present in the closest packings: 
cuboctahedral in case of the ... ABC... stacking and its hexagonal analogue in 
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Fig. 2.13 Ordering pattern in the closest-packed layers in the crystal 
structures of (a) (Cdo.s INo.5)Au3; (b) NbPd3; (c) CdMg;, AuCus, TiNiz, 
PuAls; (d) MoPtz, TaPt2; (e) ZrAus; (f) TizGa3; (g) TiAl, AuCu, 


(Nbo.75 Rho.25) Rh, a-TaRh, B'-AuCd; (h) MOoNig. 
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case of the... AB... stacking. Alternatively to the stacking sequences (positional 
symbols), the so-called coordinational symbols are used for the notation of the 
layer sequence: c for the layers with the cuoboctahedral coordination and h for 
the layers with the coordination in the shape of the hexagonal analogue of the 
cuboctahedron. In this so-called Jagodzinski-Wyckoff notation (Jagodzinski 
1954), the ...ABC...,...AB..., and ... ABAC... stackings are labeled c3, h2, 
and (ch),, respectively (Fig. 2.12). 

When all the atoms in the crystal structure are chemically equal, the positional 
or Jagodzinki symbol notates uniquely the symmetry, axis ratio, and atomic posi- 
tions in the structure. In cases where the atoms in the crystral structure are dif- 
ferent, they can be ordered in different ways within the layer, depending on the 
composition. Typical ordering patterns are shown in Fig. 2.13. In this way, the 
whole variety of the ordered variants of closest packing structures can be classi- 
fied and notated with stacking sequence, composition, and ordering pattern in 
the layer. 


2.4.3 
Crystal Structures Derived from the Close Packings of Equal Spheres 


The bcc structural pattern can be obtained from an fcc arrangement by compres- 
sion of the latter either along the four-fold or along the three-fold axis. In case of a 
distortion along a four-fold axis, the intermediate structure is tetragonal (Fig. 
2.14). After reducing the compressed axis length to 2/2 of the initial value, the 
structure becomes bcc. The unit cell contains only two atoms; thus, the bcc struc- 
ture is one of the simplest among intermetallic compounds. Several metals and 
intermetallic phases adopt this type of crystal structure and the relatively simple 
pattern allows for a large number of derivatives. 

The first route to obtain new arrangements is an ordered occupation of the 
sites by different atoms, as already shown for the closest packings. Ordered re- 


(a) 











Fig. 2.14 The structure type of (c) «-Fe (bcc, broken line: unit cell) 
obtained from (a) the Cu type (fcc, solid line: unit cell) by a 
compression along a four-fold axis. (b) The relationship of the unit 
cells. 
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Fig. 2.15 Ordering patterns for (a) the basic bcc structure: structure 
types (b) CsCl; (c) Fe3Si; (d) MnCu2Al; (e) Os2Als; (f) CrzAl. 


placement of the positions in the ratio 1:1 without increasing the unit cell size 
leads to the formation of the cubic CsCl (or B2) type of structure (Fig. 2.15), 
which is often observed in metallic construction materials. Ordered occupation 
of the sites in the ratio 1:3 leads to the cubic structure of the BiF; (or Fe3Si, 
D0;) type with a unit cell which is eight times larger than that of the CsCl type 
(16 atoms per unit cell). This crystal structure is characteristic of, e.g., the Ni3Al 
phase (superalloy). If the positions in the bcc motif are occupied in an ordered 
manner by three different kinds of atoms in the ratio 1:2:1, the cubic structure 
of the AlCuzMn type (L21) forms. The unit cell volume is also eight times that 
of the CsCl arrangement. The phases showing this type of crystal structure are 
called Heusler phases. These materials are currently becoming the focus of re- 
search because of their promising electronic properties (spintronics). Numerous 
derivatives of the bcc motif with ordered occupation (superstructures) are known, 
and some have lower symmetry than cubic. As examples, we show the tetragonal 
structures of the Cr,Al and Os2Al; types with three and five times the unit cell 
volume of the initial bcc structure, respectively (Fig. 2.15). 
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a-Fe (bec) 






Ordering + distortion + vacancy 





Fig. 2.16 Formation of structural patterns derived from the bcc packing. 


Formation of superstructures is not a unique way to obtain structurally new 
materials derived from the bcc pattern (Fig. 2.16). Ordered occupation may lead 
to a local distortion resulting in a symmetry reduction to orthorhombic in the 
case of «-VIr. However, the formation of a superstructure can be combined with 
the occurrence of vacancies: bcc — AlCuzMn — MgAgAs. The phases with the 
crystal structure of the latter type can be found in the literature under the name 
“semi-Heusler phases,” which reflects solely their structural relationship. 

The technically important y-brass family can be obtained from the simple bcc 
structure by formation of a superstructure (also with partial occupation of sites 
by two kinds of atoms) with a unit cell volume which is 27 times that of bcc. 
Two of the 54 positions are unoccupied and the neighboring atoms are shifted to- 
ward the vacancy (Fig. 2.16). Various compositions of the y-brass phases originate 
from a mixed occupation of some positions (gray in CugAly, Fig. 2.16, bottom) de- 
pending on the constituting elements (cf. Hume-Rothery phases, Section 2.3.1). 
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2.4.4 
Crystal Structures Derived from the Packings of the Spheres of Different Sizes 


Laves phases form the largest group of intermetallic compounds with the ideal 
composition AB). The crystal structures of the Laves phases can be topologically 
derived from the closest packings. Eight of the 16 atoms in the unit cell are 
substituted by four larger atoms corresponding to the composition 16B — 8B + 
4A = 4AB). In case of the packing ... ABAC... (structure type «-Nd, (ch),), 
the replacement takes place in neighboring layers (Fig. 2.17). Pairs of vertex- 
condensed tetrahedra are replaced by two A atoms, and a pattern of the structure 
type MgZn, forms. In a similar way, the structural motif of the Laves phase 
MgCu» can be derived from the closest packing ... ABCABC... (structure type 





Fig. 2.17 (a) The structure type «-Nd (closest packing (ch)2t) and its 
transformation to (c) a crystal structure of the MgZn; type by (b) 
substitution of the four dark gray smaller atoms by one larger atom. (d) 
The structure type of Cu (closest packing (c3)2) and its transformation 
to (f) the structure of the MgCup type by (e) substitution of the four 
dark gray smaller atoms by one larger atom. 
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Cu, (c3),). According to this building principle an optimal radius ratio of 
1:21/3 = 1.26 (Nowotny and Mornheim 1939) or 1:/1.5 (approximately 1:1.225, 
Laves and Witte 1935; Dehlinger and Schulze 1940) is calculated for the compo- 
nents B and A. The importance of radius ratios for the packing of atoms of differ- 
ent sizes seems to be underlined by the experimental observation of the solid 
noble gas compound NeHe, with MgZn)-type crystal structure at high pressures 
(Loubeyre et al. 1993), but only a few of the known Laves phases have a radius 
ratio in the interval obtained from these geometrical considerations (Stein et al. 
2004; Fig. 2.1). One of the extreme outliers is KAu,, which does exhibit a radius 
ratio r(K)/r(Au) = 1.648. 


2.4.5 
Selected Crystal Structures with Complex Structural Patterns 


The intermetallic compound SmCo; is the first and best-known representative of 
a family of magnetic materials with general composition RX;. The crystal struc- 
ture of these compounds belongs to the CaCus type. This structural pattern can 
not be derived from the closest- or close-packed structures, but from the struc- 
tures of the Laves phases. The structure of the MgZn and CaCus types can be 
described as frameworks of vertices-condensed tetrahedra X4 with R atoms em- 
bedded in their cavities (Fig. 2.18a,b). The relationship between the two structure 
types becomes clearly visible when the atomic layers perpendicular to the six-fold 
axis are compared (Fig. 2.18c,d). Both patterns comprise common Kagome nets 
(Wells 1968; O’Keeffe and Hyde 1996) with composition X; (within a unit cell). 
The mixed layers of R and X atoms have the same topology in both structures 
with occupied positions (0,0), (1/3,2/3) and (2/3, 1/3). However, the composi- 
tions are different: RX in MgZn2 and RX) in CaCus. They can be easily obtained 
as R)X + X; = 2RX2 and RX) + X; = RX; for MgZn, and CaCus, respectively. 

Despite the apparent simplicity, the atomic pattern of CaCus offers several pos- 
sibilities for formation of new structural motifs. The presence of the same Ka- 
gome net in the structure types of CaCus and the Laves phases (e.g., MgCu) 
allows a combination of both structural motifs along the hexagonal axis. The Ka- 
gome net serves as the common interface. The new structural patterns obtained 
in this way are called intergrowth structures (Parthé et al. 1985; Grin 1992). The 
composition of the resulting intergrowth structures is simply the sum of the com- 
positions of the prototypes: R2X4 + RX; = 3RX; for the CeNi; and PuNi; struc- 
ture types and R2X4 + 2RX5 = 2R2X; for Ce2Ni; and f-Gd2Co7 structure types 
(Fig. 2.19). 

Another route to the formation of new structural motifs derived from the 
CaCus type is a double replacement of an R atom by an X, pair. This leads to the 
crystal structures of the Th,Nij7 and Th)Znj7 types (3RX5 — R + X = RNiı7) or 
the ThMny, type (2RX5 — R + X% = RX). 

The crystal structure of a large group of superconducting materials (e.g., 
Nb3Ge, which for a long time had the highest known transition temperature of 
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Fig. 2.18 Crystal structures of (a) MgZn; (RX2) and (b) CaCus (RXs) as 
frameworks of vertices-condensed tetrahedra X4 with embedded R 
atoms. Atomic layers normal to the hexagonal axis in (c) MgZn2 and 
(d) CaCus. Dotted lines show projection of the unit cell. 


23 K, in the era before oxide-based superconductors) belongs to the Cr3Si (A15) 
type (Fig. 2.20). Similarly to the CaCus motif, the Cr3Si pattern cannot be derived 
directly from the closest or close packings. In a first approximation the structure 
can be described as composed of layers perpendicular to the axes of the cubic lat- 
tice (Fig. 2.20b). The mixed Cr/Si layer is a defect closest-packed one. Between 
these layers, additional Cr atoms are located in front of the vacancies, forming a 


45 


46 | 2 Crystal Structure and Chemical Bonding 





CeNi, 
Ce,Ni, 


Fig. 2.19 Crystal structures of binary compounds in the Ce-Ni system 
as intergrowth derivatives of CaCus and Laves phases. 


very loose network (Fig. 2.20b). On the other hand, the crystal structure of Cr3Si 
can be understood as Si atoms and Cr, pairs located in the positions of the fcc 
pattern (000) and (4,4,0), respectively (Fig. 2.20c). 

The general complexity of the structural pattern increases further by consider- 
ation of the crystal structure of Nd) Fe,4B (Fig. 2.21). This compound represents 
the currently most efficient hard magnetic materials. The tetragonal crystal struc- 
ture can be understood as a stacking of two types of different structural slabs 
along the four-fold axis. The first one is centered at z = 0, the second at z =}. 
In both segments we observe atomic arrangements originating from the CaCus 
motif (broken-line parallelogram). Additionally, fragments of the Cr3Si pattern 
are present (dotted square). Thus, the whole crystal structure can be described as 
a three-dimensional arrangement of fragments of simpler crystal structures like 
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Fig. 2.20 The crystal structure of the Cr3Si type (a) as a stacking of 
defect closest-packed layers (b) with additional embedded Cr atoms. 
Derivation of the Cr3Si pattern from fcc by double substitution on the 


7,1,0) position (c). Dotted lines show the unit cells. 





Nd,Fe,,B 


Fig. 2.21 The crystal structure of the hard magnetic material Nd»Fe14B 
as a derivative of the crystal structure motifs of CaCus and Cr3Si. 
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CaCus or Cr3Si. A further increase in the structural complexity is observed, pro- 
ceeding to the quasicrystalline phases and their approximants. 


2.5 
Quasicrystals 


2.5.1 
Introduction 


All known single-phase solids in thermodynamic equilibrium are crystalline. 
This means that the symmetry of their crystal structures can be described by an 
n-dimensional (nD) space group in direct space (cf. Steurer and Haibach 1999) 
or, fully equivalently, by the corresponding symmetry group in 3D reciprocal 
space (Mermin 1992). For the standard case of 3D translationally periodic crystal 
structures, n = 3. For the known aperiodic crystals that can all be described geo- 
metrically as 3D sections of nD hypercrystals, 4 < n < 6 applies. Crystallinity 
leads to an intrinsic anisotropy of physical properties. Faceted crystals are one of 
its manifestations (Fig. 2.22). 

A universal definition of aperiodic crystals directly related to a diffraction exper- 
iment was provided by the IUCr Commission on Aperiodic Crystals (IUCr 1992): 
“.. by crystal is meant any solid having an essentially discrete diffraction dia- 
gram, and by aperiodic crystal is meant any crystal in which three dimensional lat- 
tice periodicity can be considered to be absent.” “Essentially discrete” means that 
the diffraction intensities can even be densely distributed as long as they are 
Bragg reflections. The known aperiodic crystals comprise the classes of quasicrys- 
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Fig. 2.22 (a) SEM photograph of decagonal Al-Co-Ni with 
decaprismatic growth morphology; (b) the diffraction pattern and star 
of reciprocal lattice vectors. 
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tals (QCs), incommensurately modulated crystals (IMCs), and composite crystals 
(CCs) (cf. Yamamoto 1996). All three of these classes possess the kind of topolog- 
ical long-range order which leads to Bragg reflections in a diffraction experiment 
(Fig. 2.22), although their structural building units are arranged aperiodically in 
3D space. 

3D lattice periodicity is compatible with n° = one-, two-, three-, four- and six- 
fold rotational symmetry, consequently called crystallographic symmetry. If one 
of the lattice points is a point of global n°-fold symmetry, then all other lattice 
points are as well. Rotation around any lattice point brings the lattice n° times 
into coincidence with itself. Contrariwise, a 3D aperiodic crystal structure cannot 
contain more than a single axis of global noncrystallographic n”-fold symmetry. 
Typical, however, is the existence of an infinite number of axes with only local 
n"‘-fold symmetry (i.e., symmetry of patches of tiles or of atomic clusters) as 
well as global n”’-fold orientational symmetry (i.e., the edges in a tiling are ori- 
ented only along n™ symmetrically equivalent directions). 

Noncrystallographic n”-fold symmetries can become crystallographic in 
higher-dimensional lattices. For 5-, 8-, 10- and 12-fold rotational symmetry this is 
the case in 4D lattices and for 7-, 9-, 14- and 18-fold symmetry in 6D lattices (Her- 
mann 1949; Hiller 1985). This property underlies the higher-dimensional 
approach (Wolff 1974; Janssen 1986; cf. Steurer and Haibach 1999), a powerful 
and elegant way to describe aperiodic crystals. The information on symmetry 
and metrics of an aperiodic structure is coded in the parameters of the nD lattice 
and its orientation to the 3D physical space. The full information on atomic posi- 
tions, site occupancy, stoichiometry, and all interatomic distances is coded in the 
(n — 3)D atomic surfaces and their positions in the nD unit cell (Fig. 2.23, below). 
The nD embedding space consists of the physical or parallel space (par-space) and 
the perpendicular space (perp-space) orthogonal to it, V = V! ® V+. The point 





Fig. 2.23 (a) Fibonacci sequence and (b) its reciprocal space in the 2D description. 


49 


50 


2 Crystal Structure and Chemical Bonding 


symmetry group of the nD hypercrystal has to be isomorphous with the point 
group of the 3D aperiodic crystal. 

Special degrees of freedom that are characteristic of aperiodic crystals can be 
easily visualized by this approach. Moving the hypercrystal along the perp-space 
generates an infinite number of locally isomorphous (homometric and isoener- 
getic) quasiperiodic structures. Shearing it along the perp-space generates peri- 
odic rational approximants. Periodic or random fluctuations of the perp-space co- 
ordinates lead to phason modes or phason fluctuations, respectively (Socolar et al. 
1986; Socolar 1988; Bancel 1989; Janssen and Radulescu 2004). 


2.5.2 
Quasiperiodic Structures in Direct and Reciprocal Space 


The simplest example of a QC is the 1D Fibonacci sequence (FS). It can be gen- 
erated either by substitution rules or as a cut of a 2D hypercrystal (Fig. 2.23). The 
substitution rule ø: S> L, L> LS replaces a letter S by L and the letter L by 
the word LS (Table 2.1) (cf. Luck et al. 1993). An infinite FS remains invariant 
under this substitution rule. The Fibonacci numbers F,,; and F,, with 
F, = Fu-ı + Fn-2, give the frequency of Land S, respectively, in the chain. Their 
ratio [Eq. (4)] approaches the golden mean for n going to infinity. 





a (1 + V5) /2 = 2 cos(z/5) (4) 
n= 0 Fn 
The number r is the solution of the algebraic equation t? — t — 1 = 0 and it is 
therefore a quadratic irrationality. If we assign to L and S long and short atomic 
distances with the ratio t [Eq. (5)], then we get a quasiperiodic structure, which is 
invariant under scaling with t”. 


L LS LSE ESLLS 
5S L LS LSL 








T (5) 


Table 2.1 Generation of the Fibonacci sequence by the substitution rule 
a: the frequency of the letters L and S is given by the respective 
Fibonacci numbers F,,; and Fy. 





n Word Number of S(F,) Fa+1 Fa+1/Fn 

0 L 0 1 0/1=0 

1 ES 1 1 1/1=1 

2 LSL ‘ll 2 2/1=2 

3 LSLLS 2 3 3/2=1.5 

4 LSLLSLSL 3 5 5/3 = 1.666... 
5 LSLLSLSLLSLLS 5 8 8/5 = 1.6 


o0 a t=1:618...; 
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Embedding the FS in 2D space allows its description as a 1D cut of a periodic 2D 
hypercrystal (cf. Steurer and Haibach 1999) (Fig. 2.23). The lattice is spanned by 
the basis vectors dı, d2. The slope of dı is equal to t. The vertices of the FS are 
generated where the physical space V! cuts the atomic surfaces (line segments). 
The size of the atomic surface results from the projection of a unit cell upon the 
perp-space. The frequency of distances L and S is proportional to the height of the 
shaded areas in the upper left part of Fig. 2.23(a). Cutting the hypercrystal with 
the physical space with a rational slope (for instance, the inclined gray line 
through lattice points 00, 11, 22 ...,) gives the n/m-approximant (in our example 
1/1). Moving the physical space (horizontal gray line) through the end points of 
the atomic surface (marked by an arrow) leads to vertex jumps (phason flips) of 
the type LS = SL. Projecting the hypercrystal along the light-gray shaded lines 
gives the periodic average structure (PAS) of the FS. 

In 1D reciprocal space, the FS shows Bragg reflections at positions given by a 
Z-module of rank 2 [Eq. (6)], with a} = tay. 


2 
H=) hay, h,eZ (6) 
i=1 


By embedding the FS in 2D space the periodicity of the reciprocal lattice is re- 
stored (Fig. 2.23b). Since direct and reciprocal spaces are mathematically related 
by Fourier transformation, the cut in direct space, generating the aperiodic crystal 
structure, corresponds to a projection in reciprocal space yielding the 1D diffrac- 
tion pattern. Reflections which are close to each other in the 1D diffraction 
pattern originate from regions of the reciprocal lattice which are far apart 
with strongly differing perp-space components of their diffraction vectors H = 
(HI, H+). They have very different intensities because they decrease rapidly with 
increasing |H*|. This is the reason why an experimentally observed diffraction 
pattern of a QC shows clearly separated Bragg reflections although they have basi- 
cally a dense distribution. 

The Penrose tiling (PT; Penrose 1974) is a good example of a quasilattice in two 
dimensions (Fig. 2.24). It can be constructed from two unit tiles: a fat rhomb 
(acute angle «= 27/5) and a skinny rhomb (acute angle « = z/5) with equal 
edge lengths a,. The ratios of their areas as well as of their frequencies in the in- 
finite tiling are 1:1. The construction of the PT has to obey matching rules of the 
tiles that can be derived from its scaling properties (cf. Steurer and Haibach 
1999). A special decoration of the unit tiles with line segments yields a PT with 
quasiperiodically spaced lines in five orientations, the so called Ammann lines. 
These lines can be seen as the equivalent of the lattice lines in 2D or lattice 
planes (net planes) in 3D lattices. A quasiperiodic structure resulting from an ap- 
propriate decoration of the unit tiles with atoms or atomic clusters can reasonably 
well describe some characteristics of real decagonal QCs (DQCs), which are geo- 
metrically stackings of quasiperiodic layers. 

A 3D quasilattice, the 3D Penrose tiling or Ammann tiling, is based on two 
rhombohedral unit tiles (Fig. 2.25) (cf. Steurer and Haibach 1999). The acute an- 
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Fig. 2.24 Penrose tiling with Ammann lines. Top: the two unit tiles with 
line segments giving infinite straight lines in the ideal PT. Bottom: 
substitution of the unit tiles by tiles which are smaller by a factor t. 
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Fig. 2.25 The two unit tiles of the 3D Penrose tiling: (a) a prolate and 
(b) an oblate rhombohedron with equal edge lengths a,. Decoration 
sites by atoms or clusters are marked by spheres. 


gles of the rhombs amount to a, = 0 = arctan(2) = 63.44°. Their volumes and 
frequencies are in a ratio of 1:1. 


29:3 
Formation and Stability 


QCs are binary or ternary intermetallic phases. They are hard and brittle, simi- 
larly to almost all known complex intermetallic phases. Due to their special 
cluster-based structure, the electronic density of states of QCs shows a pseudo- 
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gap at the Fermi edge. Although their constituents are electronically very good 
metallic conductors, their electrical and thermal conductivity decreases with tem- 
perature. It falls even below the Mott conductivity threshold in some cases. For a 
detailed introduction into the physics and (still only potential) applications of 
QCs, see Dubois (2005). 

More than 100 stable or metastable representatives are known so far. Most QCs 
have been discovered by a search strategy based on the concept of electron com- 
pounds (Hume-Rothery phases) (Tsai 2003). Most of them contain aluminum as 
main component, some titanium, magnesium, zinc, or cadmium. Some are line 
phases such as icosahedral Cd-Yb (i-Cd-Yb), others show broad compositional 
stability ranges such as decagonal Al-Co-Ni (d-Al-Co-Ni). With the exception 
of i-Ti-Zr-Ni, they are stable up to the melting temperature. Only a few QCs 
melt congruently. 





2.5.4 
Structures of Decagonal Quasicrystals (DQCs) 


DQCs, with diffraction symmetry 10/mmm or 10/m, can be geometrically seen 
as periodic packings of quasiperiodic atomic layers. More physical, however, is 
the description as quasiperiodic packing of decagonal columnar clusters that are 
periodic along the column axis. The needle-like morphology of DQCs indicates 
faster growth along the ten-fold axis, i.e., the periodic direction (Fig. 2.22). 

A typical example of the structure of a DQC is depicted in Fig. 2.26(a) (Cervel- 
lino et al. 2002). The projected electron density of decagonal Al-Co-Ni is overlaid 
with different tilings and coverings to show the cluster structure. A pentagonal 








Fig. 2.26 45A x 45A section of the projected electron density of 
decagonal Al-Co-Ni with (a) high resolution and (b) a resolution 
comparable with that of the HAADF image (Abe and Tsai 2001) copied 
on the position of Gummelt decagon «. In (b) black contrasts cor- 
respond to zero density, white to maximum density (from Steurer 2004). 
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PT with edge length a, = 4.625 A is shown by thin lines. A group of five decago- 
nal Gummelt clusters (Gummelt 1996) with 20.6 A diameters is drawn in, labeled 
æ to e. The centers of the Gummelt decagons are located on a r?-inflated version 
of the pentagon tiling. In Fig. 2.26(b), the projected electron density of Fig. 
2.26(a) is now shown at a resolution comparable with that of the HAADF-STEM 
(High-Angle Annular Dark Field Scanning Transmission Electron Microscopy) 
image (Abe and Tsai 2001) copied on the position of Gummelt decagon «. There 
is a good qualitative agreement between the distribution of transition metal atom 
related contrasts of the calculated and observed images. 

Intermetallic phases with structures closely related to those of quasicrystals are 
called approximants. “Closely related” means that they consist of the same (more 
or less distorted) structural building units (clusters). In the special case where the 
structure can be described by a rational section of the hypercrystal structure that 
also generates the quasicrystal structure, it is called a rational n/m-approximant. 
An example of an approximant of d-Al-Co-Ni is shown in Fig. 2.27. One layer 
shows a decorated pentagon tiling, the other a hexagon tiling dual to it (black 
lines). All the vertices of these tilings are decorated by Co atoms. The hexagons 
have the same edge lengths as those of Fig. 2.26. 
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Fig. 2.27 (011) atomic layers of o-Alı3Co4 and structure projected along 
[001] (Grin et al. 1994). Grin, J., Burkhardt, U., Ellner, M., Peters, K. 
(1994), J. Alloy. Compd. 206, 243. 
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Fig. 2.28 Compositions of stable decagonal phases. Only the upper half 
diagram (A,B,Cz, x + y +z = 100, 50% <x < 100%, 0% < y < 50%, 
0% < z < 50%) is shown. 


More than 20 stable decagonal phases with n-layer periodicity along the tenfold 
axis have been found so far in the following systems (Steurer 2004). 

e Two-layer periodicity (sometimes with two-fold superstructure): 
Al-Cu-Me (Me = Co, Rh, Ir), Al-Ni-Me (Me = Co, Fe, Rh, 
Ru), Zn-Mg-RE (RE = Y, Dy, Ho, Er, Tm, Lu) 

e Six-layer periodicity: Al-Mn-Pd, Al-Mn-Fe-Ge, Ga-Co-Cu, 
Ga-Cu-Fe-Si, Ga-V-Ni-Si 

e Eight-layer periodicity: Al-Ni-Ru, Al-Pd-Me (Me=Fe, Ru, Os). 











The approximate stability regions of the stable DQCs are shown in Fig. 2.28 
(based on Grushko and Velikanova 2004). 


2.3.9 
Structures of Icosahedral Quasicrystals 


There are different ways of classifying icosahedral quasicrystals (IQCs). Histori- 
cally the first approach was to classify them as Frank-Kasper-type QCs (FK type; 
sp-QCs) and Al-transition-metal type (Al-TM type; Mackay type; spd-QCs) QCs 
(Henly and Elser 1986). The first case is characterized by a tetrahedral building 
principle (tetrahedral close packing) and the occurrence of Bergman clusters 
(Pauling triacontahedra). In the second case TM atoms are commonly coordi- 
nated icosahedrally, Al atoms often form octahedra. The larger clusters which 
occur are of the Mackay type. 
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Fig. 2.29 Shells of the double-Mackay cluster building cP138-%- 
Al-Mn-Si (projected along [—1 0 5.7]) (Sugiyama et al. 1998). The 
shells (a)-(e) are centered at the origin of the unit cell and shells (a)- 
(c) in the body center. Shells (a)-(c) together form the 54-atom Mackay 
cluster. 


With an increasing number of IQC structures known, the 6D hypercrystal 
structure type (position, size, shape, and chemical partition of atomic surfaces) 
could be used for classification. The 6D Bravais-lattice type can be either hypercu- 
bic primitive (iP) or hypercubic face-centered (iF). Note that the last case is fre- 
quently just a 2°-fold superstructure of the first, i.e., all 6D lattice parameters are 
doubled. In the cases where both Bravais types are found for one and the same 
composition, the iP phase is just the chemically disordered variant of the iF 
phase. 

Sticking to the 3D IQC structure, the fundamental cluster types building the 
structures of quasicrystals and their rational approximants can be used for a clas- 
sification. There are three main types, the Mackay cluster (MC), the Bergman 
cluster (BC), and the Tsai cluster (TC) (Figs. 2.29-2.31): 

e Mackay cluster (Fig. 2.29): The innermost cluster, an Al/Si 
icosahedron, is surrounded by an Al icosidodecahedron. Its 
pentagons are capped by Mn atoms at the corners of an 
icosahedron. These three shells form the Mackay cluster. 
Adding two further shells, a complex Al shell and an 
icosahedral Al/Si shell, gives the double-Mackay cluster 
(Sugyama et al. 1998). If the icosahedral symmetry is broken 
by the innermost shell, then one gets a pseudo-Mackay 
cluster (Boudard et al. 1992). 

e Bergman cluster (Fig. 2.30): The Bergman cluster in cI160-R- 
Al;CuLi; is built from an Al/Cu icosahedron surrounded by 
a Li pentagondodecahedron. The pentagons are capped again 
by Al/Cu atoms yielding an icosahedron with a similar 
diameter to the dodecahedron. These two shells together 
form a triacontahedron. The fourth shell is a truncated 
icosahedron of Al/Cu atoms, and the fifth shell is a Li 
triacontahedron with Li atoms capping the hexagonal faces of 
the truncated icosahedron. The diameters of the large and 
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(a) (b) (c) (d) (e) 


Fig. 2.30 Shells of the 104-atom Samson cluster building c!160-R- 
AlsCuLi3 projected along [—1 —1 0.7] (Audier et al. 1988). All the shells 
are centered at the origin. Shells (a)-(c) together form the 44-atom 
Bergman-cluster. 





(a) (b) 


Fig. 2.31 Shells of the 66-atom Tsai-cluster building cl168-CdeYb 
projected along [—1 —1 0.7] (Palenzona 1971). All the shells are 
centered at the origin. 


the small triacontahedra are in a ratio equal to t (Audier et al. 
1988). 

e Tsai cluster (Fig. 2.31): The Tsai cluster in cI168-Cd¢Yb 
consists of an orientationally disordered Cd tetrahedron 
inside a Cd pentagondodecahedron with an average Cd—Cd 
distance of 2.98 A. The pentagons are capped by Yb forming 
an icosahedron with edge length 5.83 A. The fourth shell, an 
icosidodecahedron, is formed by edge-centering Cd atoms 
with Cd-Cd distances of 3.97 A (Takakura et al. 2001). 


One should bear in mind, however, that a clear decision is not always possible on 
whether a QC is built mainly from one or the other cluster type. A careful 
analysis of the structure of i-Al-Mn-Pd demonstrated, for instance, that it can 
be covered up to 77.1% by MCs and up to 72.8% by BCs, respectively (Loreto 
et al. 2003). A similar statistic was found for i-Al-Cu-Fe (Gratias et al. 2001). 
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The cluster centers of MCs and BCs are empty or partially filled in the case of 
the Al-based QCs, and filled in the case of the Ti-based ones (Kim et al. 1998). 
Empty Alı2-cluster shells (e.g., in MCs) have a covalent bonding nature while the 
centered Alı; ones (e.g., in Mg-Al-Zn quasicrystals) are metallic (Kimura et al. 
1997). Typical of Aly, clusters are very short nearest-neighbor distances below 
2.55 A (in fcc Al, x 2.86 A), which are indicative of strong bonding. In free Als 
clusters, the atomic distances can even reach 2.435 A (Jia et al. 2002). 

There are typical ratios, a,/d, of the quasicrystal lattice constants (Elser 1985) to 
the average interatomic distances for QCs built from MCs (1.65-1.75), TCs 
(x 1.75), and BCs (x 2.0), respectively (Chen et al. 1987; Guo et al. 2002). Typical 
values for the optimum electron concentrations amount to 1.7-1.9 for the MC- 
type, 2.0-2.1 for the TC-type, and 2.0-2.2 for the BC-type QCs (Trambly de Lassa- 
diere et al. 2005). 

i-QCs descriptions can be based on cluster-decorated 3D Penrose tilings with 
edge lengths of the rhombohedra 13a,. The quasilattice constant and the lattice 
parameter of the hypercubic lattice are related by a, = a/2/2. The prototype 
structures and representatives based on 6D structure analyses (Yamamoto and Ta- 
kakura 2004) and the three cluster types, respectively, are: 

e i-Al-Pd-Mn type: a=9.14A, a, = t’a/2, Fm35 
(2°-fold superstructure; Al-TM type; 3D PT with edge length 
a, decorated by pseudo-icosahedral MCs) 


Al-Pd-Me (Me = Mn, Re), Al-Cu-Me (Me = Fe, Ru, Os), 
Ti-Zr-Ni. 


e i-Zn-Mg-Ho type: a= 10.28 Å, a, = t’a/2, Fm35 
(2°-fold superstructure; Frank-Kasper type; 3D PT with edge 
length a, decorated by pseudo-BCs) 


Mg-Zn-RE (RE = Y, Nd, Gd, Ho, Dy, La, Pr, Tb, Ce), 
Zn-Mg-Hf, Zn-Mg-Zr 








and Al-Cu-Li, Mg-Ga-Zn, Mg-Al-M (M = Rh, Pd, Pt) with 
symmetry Pm35 (i.e., these are disordered variants of the F- 
type). 

e i-Cd-Yb type: a= 5.689 À, a, = t’a/2, Pm35 
(3D PT with edge length a, decorated by TCs with a 
disordered first shell) 





Cd-Me (Me=Ca, Yb), Cd-Mg-Ca, Cd-Mg-RE 

(RE = Y, Nd, Eu, Gd, Tb, Dy, Ho, Er, Tm, Yb, Lu), 
Zn-Me-Sc (Me = Ag, Au, Co, Cu, Fe, Mg, Mn, Ni, Pd, Pt), 
Cu-Ga-Mg-Sc, Zn-Mg-Ti, Ag-In-(Mg)-Me 

(Me = Ca, Yb) 
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Fig. 2.32 Compositions of stable icosahedral phases. Note that only the 
upper half (A,ByC,, x + y +z = 100, 50% <x < 100%, 0% < y < 50%, 
0% < z < 50%) is shown in (b). 


The approximate stability regions of stable IQCs are shown in Fig. 2.32 (based on 
Grushko and Velikanova 2004). 


2.6 
Outlook 


One of the most important recent trends in the knowledge of intermetallic com- 
pounds is the disappearance of the border between the chemistry and physics of 
this class of materials. Being a part of inorganic chemistry, intermetallic com- 
pounds reveal a very rich spectrum of physical properties which qualify them for 
prospective applications. In this respect, the structure—property relationships ap- 
pear to be an issue of ongoing interest. Despite a vast number of investigations, a 
causal link between atomic organization and physical behavior is still not finally 
established. Involving the analysis of chemical bonding into these considerations 
appears as an emerging trend in the research on intermetallic compounds. 

As an example, materials such as intermetallic clathrates or filled skutterudites 
are interpreted as covalently bonded framework anions with embedded cations. 
Consequently, they combine low charge-carrier density with phonon-glass be- 
havior, so they meet the criteria for thermoelectric applications. Moreover, the 
analysis of chemical bonding offers the possibility of recognizing low- 


60 


2 Crystal Structure and Chemical Bonding 


dimensional structural units which may cause pronounced anisotropy in, e.g., 
magnetic or mechanical behavior. 

Quasicrystals are special representatives of complex intermetallic phases with 
inherent low-dimensional atomic arrangements. Their intensive study will in- 
creasingly stimulate the search for compounds with more pronounced cluster 
structure, i.e., significantly different chemical bonding and/or composition of 
cluster and embedding matrix. This would be the metallic counterpart to supra- 
molecular structures or fullerenes. One of the goals may be the development of 
a modular approach of forming crystalline, quasicrystalline, fractal, or amorphous 
structures which represent a bulk homogeneous counterpart to materials with 
nanocrystalline precipitates. Novel physical properties will materialize in cases 
where the length scales of the giant unit cells or even of the cluster diameters 
are larger than those relevant to the physical effect. Finally, a more precise under- 
standing of atomic interactions is expected to open new routes fo preparation of 


intermetallic compounds. 


References 


Abe, E., Tsai, A. P. (2001), JEOL News 36E, 
18. 

Abegg, R. (1904), Z. Anorg. Chem. 39, 330. 

Anon. (1931-1943), Strukturbericht, 
Supplement issues of Z. Kristallogr. 

Anon. (1940-1990), Structure Reports, 
published for the Int. Union of 
Crystallography, Cambridge. 

Asahi, R., Sato, H., Takeuchi, T., Mizutani, 
U. (2005a), Phys. Rev. B 71, 165 103. 

Asahi, R., Sato, H., Takeuchi, T., Mizutani, 
U. (2005b), Phys. Rev. B 72, 125 102. 

Asaumi, K., Kondo, Y. (1981), Solid State 
Commun. 40, 715. 

Audier, M., Pannetier, J., Leblanc, M., Janot, 
C., Lang, J. M., Dubost, B. (1988), Physica 
B 153, 136. 

Bader, R. F. W. (1999), Atoms in Molecules: A 
Quantum Theory, OUP, Oxford. 

Bancel, P. A. (1989), Phys. Rev. Let. 63, 2741. 

Becke, A. D., Edgecombe, K. E. (1990), 
J. Chem. Phys. 92, 5397. 

Boudard, M., Deboissieu, M., Janot, C., 
Heger, G., Beeli, C., Nissen, H. U., 
Vincent, H., Ibberson, R., Audier, M., 


Dubois, J. M. (1992), J. Phys.-Condens. Matt. 


4, 10 149. 

Budnyk, S., Weitzer, F., Kubata, Ch., 
Prots, Yu., Akselrud, L. G., Schnelle, 
W., Hiebl, K., Nesper, R., Wagner, F. R., 
Grin, Yu. (2006), J. Solid State Chem. 179, 
2329. 


Burdett, J. K. (1995), Chemical Bonding in 
Solids, OUP, New York. 

Cervellino, A., Haibach, T., Steurer, W. (2002), 
Acta Crystallogr. B 58, 8. 

Chen, H. S., Phillips, J. C., Villars, P., Kortan, 
A. R., Inoue, A. (1987), Phys. Rev. B 35, 
9326. 

Dehlinger, U., Schulze, G. E. R. (1940), Z. 
Kristallogr. 102, 377. 

Demchyna, R., Leoni, S., Rosner, H., 
Schwarz, U. (2006), Z. Kristallogr. 221, 420. 

Drude, P. (1904), Ann. Phys. 14, 677. 

Dubois, J.-M. (2005), Useful Quasicrystals, 
World Scientific, Singapore. 

Duthie, J. C., Pettifor, D. G. (1977), Phys. Rev. 
Lett, 38, 564. 

Elser, V. (1985), Phys. Rev. B 32, 4892. 

Emsley, J. (1991), The Elements, 2nd edition, 
Clarendon Press, Oxford. 

Freeth, W., Raynor, G. V. (1953/54), J. Inst. 
Met. 82, 569. 

Friedel, J. (1969), in: The Physics of Metals 
(Ed.: J. M. Ziman), Cambridge University 
Press, New York, chapter 8. 

Gatti, C. (2005), Z. Kristallogr. 220, 399. 

Goldschmidt, V. M. (1926), Skr. Norske 
Vidensk.-Akad. Oslo, I. Mat.-Naturvidensk. 
Kl. no. 2. 

Goldschmidt, V. M. (1927), Ber. Dtsch. Chem. 
Ges. 60, 1263. 

Gratias, D., Puyraimond, F., Quiquandon, M., 
Katz, A. (2001), Phys. Rev. B 63, 024 202. 


Grimm, H. G., Sommerfeld, A. (1926), Z. 
Physik 36, 36. 

Grin, J., Burkhardt, U., Ellner, M., Peters, K. 
(1994), J. Alloy. Compd. 206, 243. 

Grin, Yu. (1992), Modern Perspectives in 
Inorganic Chemistry (Ed.: E. Parthé), 
Kluwer, Dordrecht. 

Grin, Yu. (2000), Wiss. Z. TU Dresden 49(1), 16. 

Grin, Yu., Schnelle, W., Cardoso Gil, R., 
Sichevich, O., Müllmann, B. D., Mosel, G., 
Kotzyba, G., Pöttgen, R. (2003), J. Solid 
State Chem. 176, 567. 

Grin, Yu., Wagner, F. R., Armbrüster, M., 
Kohout, M., Leithe-Jasper, A., Schwarz, U., 
Wedig, U., von Schnering, H. G. (2006), 

J. Solid State Chem. 179, 1707. 

Grushko, B., Velikanova, T. Y. (2004), J. Alloy. 
Compd. 367, 58. 

Gummelt, P. (1996), Geometr. Ded. 62, 1. 

Guo, J. Q., Abe, E., Tsai, A. P. (2002), Phil. 
Mag. Lett. 82, 27. 

Henley, C. L., Elser, V. (1986), Phil. Mag. B 
53, 159. 

Hermann, C. (1949), Acta Crystallogr. 2, 139. 

Hiller, H. (1985), Acta Crystallogr. A 41, 541. 

Hoppe, R. (1970), Angew. Chem. 82, 7; Angew. 
Chem. Int. Ed. 9, 25. 

Hoppe, R. (1979), Z. Kristallogr. 150, 23. 

Hume-Rothery, W. (1925), J. Inst. Met. 35, 
209. 

Hume-Rothery, W. (1930), Phil. Mag. 9, 65. 

Hume-Rothery, W., Raynor, G. V. (1938), Phil. 
Mag. 26, 143. 

IUCr (1992), Report of the Executive 
Committee for 1991, Acta Crystallogr. A 48, 
922. 

Jagodzinski, H. (1954), Acta Crystallogr. 7, 17. 

Janssen, T. (1986), Acta Crystallogr. A 42, 261. 

Janssen, T., Radulescu, O. (2004), Ferroelectrics 
305, 179. 

Jia, J., Wang, J. Z., Liu, X., Xue, Q. K., Li, 

Z. Q., Kawazoe, Y., Zhang, S. B. (2002), 
Appl. Phys. Lett. 80, 3186. 

Jones, H. (1937), Proc. Phys. Soc. A 49, 250. 

Karpov, A., Nuss, J., Wedig, U., Jansen, M. 
(2003), Angew. Chem. 115, 4966; Angew. 
Chem. Int. Ed. 42, 4818. 

Ketelaar, J. A. A. (1958), Chemical 
Constitution: An Introduction to the Theory of 
the Chemical Bond, 2nd edition, Elsevier, 
Amsterdam. 

Kienast, G., Verma, J., Klemm, W. (1961), Z. 
Anorg. Allg. Chem. 310, 143. 

Kim, W. J., Gibbons, P. C., Kelton, K. F., 
Yelon, W. B. (1998), Phys. Rev. B 58, 2578. 


References | 61 

Kimura, K., Takeda, M., Fujimori, M., 
Tamura, R., Matsuda, H., Schmechel, R., 
Werheit, H. (1997), Solid State Chem. 133, 
302. 

Kohout, M. (2004), Int. J. Quantum Chem. 97, 
651. 

Kohout, M., Savin, A. (1996), Int. J. Quant. 
Chem. 60, 875. 

Kohout, M., Wagner, F. R., Grin, Yu. (2002), 
Theor. Chem. Acc. 108, 150. 

Kossel, W. (1916), Ann. Phys. 49, 229. 

Krypyakevich, P. I. (1977), Structure Types of 
Intermetallic Compounds, Nauka, Moscow 
(in Russian). 

Laves, F., Witte, H. (1935), Metallwirtschaft 14, 
645. 

Lewis, G. N. (1916), J. Am. Chem. Soc. 38, 
762. 

Loreto, L., Farinato, R., Catallo, S., Janot, C., 
Gerbasi, G., DeAngelis, G. (2003), Physica 
B 328, 193. 

Loubeyre, P., Jean-Louis, M., LeToullec, R. 
(1993), Phys. Rev. Lett. 70, 178. 

Luck, J. M., Godreche, C., Janner, A., Janssen, 
T. (1993), J. Phys. A 26, 1951. 

Mermin, N. D. (1992), Rev. Mod. Phys. 64, 3. 

Mooser, E., Pearson, W. B. (1959), Acta 
Crystallogr. 12, 1015. 

Mott, N. F., Jones, H. (1936), The Theory of 
the Properties of Metals and Alloys, 
Clarendon Press, Oxford. 

Nesper, R. (1990), Prog. Solid State Chem. 20, 
1 


Nesper, R. (1991), Angew. Chem. 103, 805; 
Angew. Chem. Int. Ed. Engl. 30, 789. 

Nesper, R. (2003), in: Silicon Chemistry (Ed.: 
P. Jutzi, U. Schubert), Wiley-VCH, 
Weinheim, pp. 171-180. 

Nowotny, H., Mohrnheim, A. (1939), Z. 
Kristallogr. 100, 540. 

O'Keeffe, M., Hyde, B. G. (1996), in: Crystal 
Structures, Mineralogical Society of 
America, Washington DC. 

Ormeci, A., Rosner, H., Wagner, F. R., 
Kohout, M., Grin, Yu. (2006), J. Phys. 
Chem. A 110, 1100. 

Palenzona, A. (1971), J. Less-Common Met. 25, 
367. 

Parthé, E., Chabot, B. A., Censual, K. (1985), 
Chimia 39, 164. 

Parthé, E., Chabot, B., Penzo, M., Censual, 
K., Gladyshevskii, R. (1993/94), Typix 
Standardized Data and Crystal Chemical 
Characterization of Inorganic Structure Types, 
Springer, Heidelberg. 


62 


2 Crystal Structure and Chemical Bonding 


Pauling, L. (1960), The Nature of the Chemical 
Bond, 3rd edition, Cornell University Press, 
Ithaca. 

Paxton, A. T., Methfessel, M., Pettifor, D. G. 
(1997), Proc. R. Soc. Lond. A 453, 1493. 

Pearson, W. B. (1964), Acta Crystallogr. 17, 1. 

Penrose, R. (1974), Bull. Inst. Math. Appl. 10, 
266. 

Pettifor, D. G. (1990), J. Chem. Soc. Faraday 
Trans. 86, 1209. 

Phillips, J. C. (1968), Phys. Rev. Lett. 20, 550. 

Phillips, J. C. (1970), Rev. Mod. Phys. 42, 

317: 

Reichlin, R., Ross, M., Martin, S., Goettel, 

K. A. (1986), Phys. Rev. Lett. 56, 2858. 

Rosner, H., Koudela, D., Schwarz, U., 
Handstein, A., Hanfland, M., Opahle, 1., 
Koepernik, K., Kuzmin, M. D., Müller, 
K.-H., Mydosh, J. A., Richter, M. (2006), 
Nature Physics 2, 469. 

Savin, A., Flad, H. J., Flad, J., Preuss, H., 
von Schnering, H. G. (1992), Angew. Chem. 
104, 185; Angew. Chem. Int. Ed. Engl. 31, 
185. 

Savin, A., Nesper, R., Wengert, S., Fässler, 

T. F. (1997), Angew. Chem. 109, 1892; 
Angew. Chem. Int. Ed. Engl. 36, 1808. 
Schäfer, H., Eisenmann, B., Müller, W. 


(1973), Angew. Chem. 85, 742; Angew. Chem. 


Int. Ed. Engl. 12, 694. 

Schmider, H. L., Becke, A. D. (2000), J. Mol. 
Struct. (TheoChem) 527, 51. 

Schubert, K. (1964), Kristallstrukturen 
zweikomponentiger Phasen, Springer, Berlin. 

Schwarz, U., Bräuninger, S., Syassen, K., 
Kniep, R. (1998), J. Solid State Chem. 137, 
104. 

Simon, A. (1983), Angew. Chem. 95, 94; 
Angew. Chem. Int. Ed. Engl. 22, 95. 

Skriver, H. L. (1985), Phys. Rev. B 31, 1909. 

Socolar, J. E. S. (1988), Bull. Am. Phys. Soc. 
33, 463. 

Socolar, J. E. S., Lubensky, T. C., Steinhardt, 
P. J. (1986), Phys. Rev. B 34, 3345. 

Söderlind, P., Ahuja, R., Eriksson, O., Wills, 
J. M., Johansson, B. (1994), Phys. Rev. B 50, 
5918. 

Sommer, A. H. (1943), Nature 152, 215. 

Spicer, W. E., Sommer, A. H., White, J. G. 
(1959), Phys. Rev. 115, 57. 

Stein, F., Palm, M., Sauthoff, G. (2004), 
Intermetallics 12, 713. 

Steurer, W. (2004), Z. Kristallogr. 219, 391. 

Steurer, W. Haibach, T. (1999), 


Crystallography of quasicrystals, in: Physical 
Properties of Quasicrystals (Ed.: Z. M. 
Stadnik), Springer, Heidelberg, pp. 51-89. 

St. John, J., Bloch, A. N. (1974), Phys. Rev. 
Lett. 33, 1095. 

Sugiyama, K., Kaji, N., Hiraga, K. (1998), Acta 
Crystallogr. C 54, 445. 

Takakura, H., Guo, J. Q., Tsai, A. P. (2001), 
Phil. Mag. Lett. 81, 411. 

Teatum, E., Gschneidner, K., Waber, J. (1960), 
Report LA-2345, US Dept. of Commerce, 
Washington DC. 

Trambly de Lassadière, G., Nguyen-Manh, D., 
Mayou, D. (2005), Progr. Mater. Sci. 50, 679. 

Tsai, A. P. (2003), Acc. Chem. Res. 36, 31. 

van Arkel, A. E. (1956), Molecules and Crystals 
in Inorganic Chemistry, Interscience, New 
York. 

van Vechten, J. A. (1969a), Phys. Rev. 182, 
891. 

van Vechten, J. A. (1969b), Phys. Rev. 187, 
1007. 

Villars, P. (1983), J. Less Common Met. 92, 
215, 

Villars, P. (1984a), J. Less Common Met. 99, 
33. 

Villars, P. (1984b), J. Less Common Met. 102, 
199. 

Villars, P., Calvert, L. D. (1996), Pearson’s 
Handbook of Crystallographic Data, 2nd 
edition, ASM International, Materials Park, 
Ohio. 

von Schnering, H. G., Cardoso-Gil, R., Hénle, 
W. (1995), Angew. Chem. 107, 81; Angew. 
Chem. Int. Ed. 34, 103. 

Wade, K. (1976), Adv. Inorg. Chem. Radiochem. 
18, 1. 

Wells, A. F. (1968), Acta Crystallogr. B 24, 50. 

Wolff, P. M. de (1974), Acta Crystallogr. A 30, 
777. 

Wondratschek, H. (1983), International Tables 
for Crystallography (Ed.: T. Hahn), Reidel, 
Dordrecht, chapter 8.3.2. 

Wosylus, A., Prots, Yu., Burkhardt, U., 
Schnelle, W., Schwarz, U., Grin, Yu. (2006), 
Solid State Sciences 8, 773. 

Yamamoto, A. (1996), Acta Crystallogr. A 52, 
509. 

Yamamoto, A., Takakura, H. (2004), 
Ferroelectrics 305, 223. 

Zintl, E. (1939), Angew. Chem. 52, 1. 

Zintl, E., Kaiser, H. (1933), Z. Anorg. Allg. 
Chem. 211, 113. 

Zunger, A. (1980), Phys. Rev. B 22, 5839. 


3 
Solidification and Grown-in Defects 


Thierry Duffar 


This chapter is dedicated to my colleague Marie-Danielle Dupouy, who died in Sep- 
tember 2004. She devoted her scientific activity to the study of the effect of liquid con- 
vection on metallurgical alloy structures. We were working together, with the help of 
numerous colleagues and students, on an academic project that ultimately fed the pres- 
ent text. 


3.1 
Introduction: the Solid-Liquid Interface 


Most solid alloys are prepared from the liquid phase, and the liquid-to-solid trans- 
formation, called solidification, is the topic of this chapter. 

Obviously, the structure, defects, and properties found in the final alloy are the 
result of what happened in the elaboration process and therefore first in the liq- 
uid phase, where transport phenomena, diffusion and convection, have a strong 
influence on the final structure. The main and special solidification features oc- 
cur at the transition between the solid and the liquid phases, the so-called solid- 
liquid interface. The physics of what happens later in the solid phase is covered 
by other chapters in this book: this one will deal essentially with the solid—liquid 
interface and its interaction with the liquid. 

This introduction will be completed with some fundamental notions relating to 
this interface. Section 3.2 will present the classical theories of structures in alloy 
solidification: stability of the solid-liquid interface, growth of dendrites and eu- 
tectics and rapid solidification; it will be seen that the diffusion in the liquid state 
plays a major role in these processes. Section 3.3 will be focused on the defects 
resulting from solidification processes, and the role of convection in the liquid 
will be stressed. 


31a 
Structure of the Solid-Liquid Interface 


There are essentially two different structures of a solid—liquid interface. The 
“faceted” interface is represented by a flat, smooth surface, at the atomic scale. 
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Fig. 3.1 AFM observation of a (101) facet on a growing monoclinic 
lysozyme crystal. An unsaturable spiral step, likely to be associated with 
a dislocation, is visible (Chernov et al. 2004). 


Therefore there is a sharp, total change in the degree of order when crossing the 
interface, from the fully crystalline solid side to the liquid (Fig. 3.1). 

In “rough” interfaces, the transition between the fully ordered solid and the 
disordered liquid is less abrupt, extending over several atomic distances (Fig. 
3.2). In this layer, atoms, molecules, or building units are experiencing an envi- 
ronment of which the degree of order is fluctuating with time. Results of molec- 
ular dynamics simulations have shown that the unit movements do not change 
markedly when crossing the interface (at the melting temperature). Structural 
and kinetic models of such diffuse interfaces have been proposed; see for exam- 
ple the pioneering work of J. W. Cahn (1960), and more sophisticated models of 





Fig. 3.2 Liquid-solid transition through a diffuse zone with a thickness 
of a few structural units (Hoyt et al. 2004). 
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Table 3.1 Observed morphology of the crystal surface during growth of 
various materials in several processes. 





Reduced entropy a Material Feeding phase Morphology 

~1 metal molten liquid rough 

~1 crystalline polymer molten liquid rough 

2-3 semiconductor solution rough to faceted 
~10 metal vapor faceted 





the interface can be found in the review article by Bennema (1993), but for the 
purpose of the present discussion simplified surface representations will be used. 

Theoretical models have been proposed in order to predict the interface rough- 
ness. The most classical, from K. Jackson (1958) takes into account the transfor- 
mation entropy (vaporization, solution, melting) reduced to the ideal gas constant 


R [Eq. (1)]. 
a = — (1) 


Faceting occurs if « > 2. 


3.1.2 
Kinetics of the Solid-Liquid Interface 


Solidification generally occurs under thermal or chemical gradients and cannot 
be considered as an equilibrium process. However, a local thermodynamic equi- 
librium on the interface can be considered in the case where, on arrival, a build- 
ing unit can easily find a satisfactory position in its surroundings. By denoting 
the surface diffusion coefficient as Ds and the typical size of a structural unit as 
a, the surface velocity V, of the unit can be compared with the interface velocity v; 


by Eq. (2) 


ni (2) 


Ui av; 


With typical values of Ds and a (5 x 1071? m? s~! and 5 x 107! m) it can be 
seen that for interface velocities higher than 1 m s™! the local thermodynamical 
equilibrium does not apply. This case will be treated in Section 3.2.3 (Rapid 
Solidification). 

For lower velocities, thermodynamic data, especially the phase diagram in the 
case of alloys, can be used safely. For example, the temperature T; of an alloy 
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T 


T2 C 


kCo Co Cmax Co/ k C: Csolid 


Fig. 3.3 Typical phase diagram used in this chapter. 


interface can be computed from the liquidus slope (which is generally negative; 
see Fig. 3.3), by Eq. (3). 


Ti = Tm + mci (3) 


However, thermodynamic equilibrium is established only in the case of a resting 
interface. In order to make it move, it is necessary to introduce a slight deviation 
from equilibrium. In the case of solidification this is kinetic undercooling, mea- 
sured as the difference between the thermodynamic melting temperature Tm and 
the actual temperature T; of the moving interface [Eq. (4)]. 


AT = T, = 7; (4) 


In the case of the faceted interface, growth can occur only by nucleation of a 
cluster on the perfectly flat surface. When this cluster is created, the whole sur- 
face is quickly covered, so that the factor limiting the velocity is the cluster 
creation step. This gives an interface velocity varying exponentially with under- 
cooling, and then growth only occurs at high values of undercooling, when the 
cluster size is small enough to have some chance of appearing spontaneously 
thanks to the density fluctuations of the liquid phase. 

Undercooling values of several 10 K result; experiments show, however, that the 
undercooling for faceted interfaces is generally of the order of a few degrees only. 
In practice, facets contain defects such as steps and kinks or unsaturable defects 
such as screw dislocations (see Fig. 3.1). Following the pioneering work of 
Burton, Cabrera, and Frank (Burton et al. 1951), numerous authors have derived 
relationships, such as Eq. (5), between the velocity and the undercooling for an 
imperfect faceted interface. 


v= KAT," 1<n<4 (5) 
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Kr is the kinetic coefficient, depending on the number of defects on the surface 
and also on its crystallographic orientation. 

In the case of the rough interface, the interface velocity is calculated from the 
balance of the structural units coming from the liquid to the interface and leaving 
the interface for the liquid phase. To a first order, with some simplifying assump- 
tions ([111] surface composed of cubic structural units), Eq. (6) can be shown to 
hold (Kurz and Fisher 1998). 


vi = K, AT; (6) 


K, is of the order of 1, showing that for classical metallurgical or crystal growth 
rates (1076—1074 m s71) the kinetic undercooling is very low for rough interfaces 
and can be neglected. 


3.1.3 
Chemistry of the Solid—Liquid Interface: the Segregation Problem 


Figure 3.3 shows a typical, simplified phase diagram of a binary alloy. For a given 
temperature, T), the composition of the solid is related to the composition of the 
liquid by the segregation coefficient k [Eq. (7)]. 


k=% (7) 


cI 


In most cases, the solid is less concentrated than the liquid (k < 1) and this situ- 
ation will be considered throughout this chapter; the reverse situation (k > 1) 
leads to symmetric results. In case of faceted growth, k is likely to depend on the 
crystallographic orientation of the interface. 

It follows that the advancing interface rejects solute in the liquid and a balance 
of the incorporated and rejected solute gives the flux toward the liquid [Eq. (8)]. 





D (=) ==) (8) 


This flux generates a boundary layer in the liquid, close to the interface. If diffu- 
sive conditions prevail in the liquid, the boundary layer thickness is of the order 
of ô [Eq. (9)]. 


Uj 


5 (9) 


In this case, after an initial transient (whose length is of the order of D;/kv;) cor- 
responding to the building of the solute boundary layer, Tiller et al. (1953) have 
shown that a steady state is obtained, with a growing solid of initial composition 
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co in equilibrium with a liquid of composition co/k. This result is valid as far as 
the diffusive solute boundary layer is not affected by convection. According to Eq. 
(9), this is more likely for high interface velocities (even with a growth rate of 
10-3 m s~, for a typical diffusion coefficient of 1078 m? s~!, the boundary layer 
is 10 um thick, and likely to be affected by the convection, which vanishes very 
close to the interface where solid is growing). 

The very general case, then, is that the liquid is mixed by some natural or 
forced convection and the boundary layer thickness depends on the intensity of 
convection. If the mixing is strong enough, the solute boundary layer can be 
neglected and the liquid can be considered as homogeneous. Scheil and Gulliver 
derived Eq. (10) for the relationship between solid composition c, and solidified 
fraction (Scheil, 1942). 


cs = kco(1 — p (10) 


Figure 3.4 gives the shape of this segregation behavior for different values of k. 
This equation is not valid right up to the end of the ingot, where the concentra- 
tion tends toward infinity. In practice different phenomena may occur when the 
concentration increases, such as precipitation of eutectic material, variation of k 
with c, or destabilization of the interface (see Section 3.2.1). 

For a moderate convection, Burton, Prim and Schlichter (Burton et al. 1953) 
have shown that Eq. (10), and consequently Fig. 3.4, can be used provided that k 
is replaced by an “effective” segregation coefficient, kep, depending on the bound- 
ary layer thickness. Its value varies between 1 (the diffusive case seen above) and 
k (fully mixed liquid, Scheil-Gulliver law); for the interested reader there is a 
general discussion of these matters, with the calculation of k, in some configu- 
rations, in Garandet et al. (1994). 

In the above explanations, cases have been considered where the diffusion of 
the solute in the solid is negligible, which generally is the case (substitutional 
alloys). However, this is not valid for solutes of small atoms in lattices of large 
atoms, as for example in the case of C in Fe (interstitial alloys). If solid diffusion 
is rapid, on the scale of the solidification time, the solid can be considered homo- 
geneous at any time, especially at the end of solidification, and in equilibrium 
with the liquid interface composition. In the case of ideal mixing of the liquid, 
this leads to the well known lever rule, Eq. (11). 





efit afi= af +E- f) = c (11) 


The chemical segregation problem is unavoidable. Strong mixing of the melt in- 
creases the local homogeneity of the solid but increases the macrosegregation all 
along the ingot. The only way to reduce the segregation is to feed the crucible 
continuously with fresh solvent in order to keep the liquid composition constant, 
but this leads to process complexity and this solution is seldom used. 
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Fig. 3.4 Concentration profile along a solidified ingot for various values 
of the segregation coefficient k (Garandet et al. 1994). 


3.1.4 
Temperature of the Solid—Liquid Interface 


The nucleation theory (for details, see Chapter 7) can be applied to solidification 
and, from Gibbs’s classical treatment, it appears that the critical nucleation radius 
in a liquid showing a certain undercooling can be expressed as Eq. (12), where T 
is the Gibbs coefficient (close to 1 x 10°” K m for metals). 
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* 215 == 2T 
r = AS AT AT 





(12) 


It follows that any curved solid-liquid interface (even at rest) in equilibrium with 
its liquid shows a capillary undercooling depending on the local curvature x, ac- 
cording to Eq. (13). 


AT cap = FK. (13) 


This effect is significant only for a radius of curvature of the interface less than 
10 pm. 

From the above considerations, it follows that the temperature of the solid— 
liquid interface depends on the chemical composition, on its velocity, and on its 
curvature. For example, the interface temperature of an alloy growing with a 
rough interface should be expressed as Eq. (14), from Eqs. (3), (6), and (13), 
where Tm is the thermodynamic melting point of the pure substance. 


T; = Tn + me; a Tk (14) 
K, 





This equation of state of the solid—liquid interface defines its position and shape. 
The next Section aims to present some classical studies of the solid alloy struc- 
tures that result from the interaction of Eq. (14) with the thermal and chemical 
fields surrounding the interface. 


3.2 
Solidification Structures 


The solid-liquid interface morphology changes with the process parameters alloy 
composition, thermal gradient, and solidification rate. Figure 3.5 shows how it 
changes when the solidification rate increases. Similar diagrams can be plotted 
as a function of the alloy composition or thermal gradient. 

Derivations of the most important characteristics of the diagram in Fig. 3.5, i.e., 
the interface destabilization, which corresponds to the transition between planar 
(a) and cellular (b) interface, the periodicity of the cellular structure (b), the 
characteristics of the dendrite field (c), and the transition toward rapid solidifica- 
tion (d), are given below. Another case is included, when an eutectic structure is 
obtained with two different solid phases growing simultaneously from a single 
liquid. 

In practical applications, three regimes are used in order to grow materials. 

e Planar interface growth is used to get high-quality single 
crystals for electronics, optics, detectors, and so on. 


3.2 Solidification Structures | 71 





Y T 


(a) W) (c) (d) 


Fig. 3.5 Evolution of the solid-liquid interface morphology with 
increasing velocity. (a) Planar interface generally observed in single- 
crystal growth; (b) cellular structures; (c) dendrites observed in 
classical metallurgical processes; (d) flat interface obtained in rapid 
solidification. 


e Dendritic growth is the most common because it is used 
universally for all metallurgical processes: steel, cast iron, Al- 
and Cu-based alloys, and others. The reason is that the 
dendritic regime leads to tiny microstructures, furthermore 
accompanied by eutectic areas, and this gives excellent 
mechanical properties to these alloys. 

e Rapid solidification is used in order to obtain very small 
microstructures and amorphous materials. It is also a way to 
get metastable phases that cannot be solidified at lower 
growth rates. 


3.2.1 
The Interface Stability and Cell Periodicity 


Morphological destabilization of the solid—liquid interface is a mechanism aim- 
ing to increase its area in order to exchange solute or heat better with the sur- 
rounding liquid. This generally leads to important modifications of the structural 
quality of an alloy and has deserved continuous attention, considering the practi- 
cal importance of solidification in material processes (Coriell and Mc Fadden 
1993). 

Destabilization occurs when a perturbation of the interface is likely to expand 
toward the liquid instead of decreasing and vanishing. To the first order, this can 
be checked by comparing the actual thermal field in the sample and the local 
melting temperature, which depends essentially, for a flat interface, on the local 
chemical composition [Eq. (3)]. Due to the chemical boundary layer at the inter- 
face, the chemical composition decreases, and the melting temperature increases, 
with the distance to the interface. The variation of melting temperature at the in- 
terface, toward the liquid, is given by Eq. (15), from Eqs. (3) and (8). 
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AT, dTm (de vicl(1 — k) 
( dz ) dc (=) D; (15) 


If the thermal gradient in the liquid, at the interface, is lower than this value, a 
liquid layer close to the interface is at a temperature lower than its melting tem- 
perature and any perturbation of the interface will grow toward the liquid. On the 
contrary, in a high enough liquid thermal gradient, there is no undercooled liquid 
layer. It follows that the interface stability is written as Eq. (16). 





vic!(1 — k) 


VT} > —mı D 
l 


(16) 


As explained in the Introduction, the composition at the interface, c;, is likely to 
change all through the solidification process (see for example Eq. (10) in the case 
of complete mixing). For a solute rejected into the liquid, the concentration in- 
creases continuously and destabilization occurs toward the end of the ingot. It is 
only in the case of pure diffusion in the liquid that the composition at the inter- 
face reaches a steady-state value, co/k. 

These destabilization mechanisms and criteria were initially proposed by Tiller 
et al. (1953) and are in acceptable agreement with the experimental observation of 
destabilization for slow growth rates. However they do not take into account the 
energy cost linked to the increase in the interface area; taking this into account 
will make it possible to find the typical size of the resulting structure. This analy- 
sis has been performed by Mullins and Sekerka (1964) by studying the effect of 
a sinusoidal perturbation of the interface, of amplitude ¢ and period A, on the 
thermal and chemical field around it and taking into account the effect of the in- 
terface curvature on its temperature [Eq. (13)]. This gives the rate of variation of 
the perturbation amplitude as in Eq. (17), where A and VT are mean values taken 
from the liquid and solid regions, and the é are positive functions of the growth 
rate. 


1d An? 
€ L mVcé, — AVTér -r7 


e dt 2? (12) 





Looking at the right-hand side of Eq. (17), it appears that the first term, related to 
the solute field, is positive and therefore is the source of the destabilization, 
whereas the other two terms, related to the thermal field and to the capillarity, 
are negative and thus stabilize the interface. 

From the physical point of view, the interface deformation is an increase in its 
area in order to improve the rejection of solute toward the liquid. However it is 
then more difficult to extract the latent heat toward the solid: this explains the 
stabilizing effect of the thermal field. Furthermore, creating surface has an ener- 
getic cost, reflected in the capillary term, which is also stabilizing. 
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From this expression an improved stability criterion, Eqs. (18) and (19), can be 


deduced (Sekerka 1965). 





AHm VT} (As +A)  c}(1-— k) 
(Se )> am, tn 


Uj 
with: 


_ Tkv; 
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(18) 


(19) 


This expression differs from Eq. (16) by taking into account a mean thermal con- 
ductivity and the latent heat of transformation but, more importantly, by the func- 
tion S(A), which takes into account the stabilizing effect of capillarity. 


This function is plotted in Fig. 3.6. When A = 1, S(A) = 0 and the interface is 
stable. This occurs for high values of the interface velocity and will be discussed 





Log, A 


Fig. 3.6 Variation of Sekerka’s stability function with the dimensionless 


parameter A (Sekerka, 1965). 
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in Section 3.2.3. At low growth rate, the latent heat term becomes negligible, 
A— 0, S(A) — 1, and the stability criterion takes a form similar to Eq. (16). 

Finally, the periodicity of the fluctuation can be obtained by setting Eq. (17) 
equal to zero, which, for small growth rates, can be written as Eq. (20) (Kurz and 
Fischer 1998). 


An? 


0=mVc-VT-T = (20) 





Equation (21) follows. 


T 
ges, 21 
"| mVe-VT (21) 


It can be noted that X takes a real value if Eq. (22) is true; this is the condition for 
instability already given in Eq. (16). 


mVc > VT (22) 


For typical growth parameters used in metallurgy, the periodicity is of the order 
of 1 pm. 


3.2.2 
Dendrites 


Destabilization of the interface leads to the formation of cell and dendrite struc- 
tures. From the practical point of view, cells create defects in semiconductor crys- 
tals and crystal growers carefully avoid working under destabilizing conditions. 
Dendrites have been studied intensively in materials processing because they af- 
fect the properties of the alloy. This influence can be either negative (e.g., the 
disappearance of the transparency of silicate glasses in the case of dendrite gener- 
ation within them), or positive (e.g., the increase in the elastic limit of eutectoid 
steels when the microstructure decreases in size). 

The interface destabilization studied in Section 3.2.1 corresponds to the transi- 
tion between Fig. 3.7(a) and (b). If the destabilization is increased (in this case by 
increasing the growth rate), the interface shows cells, then fully developed den- 
drites. The tips of the perturbations reject solute very efficiently and the regions 
between them trap solute, then grow more slowly. The resulting structure has a 
periodicity of its chemical composition, very often associated with precipitates or 
eutectics between the dendrites or cells. 

It should be borne in mind that the structure obtained during solidification, 
close to the melting point, is likely to be significantly modified during the natural 
cooling of the solid, or during subsequent mechanical or thermal treatments. 
This modification depends on the alloy of interest: in the case of steels, the origi- 
nal structure of which can be totally lost after forging, whereas the solidification 
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Fig. 3.7 From planar interface to dendrites: evolution of the solid—liquid 
interface structure when the growth rate is increased. The alloy is a 
transparent succinonitrile/acetone (4% mixture), solidified between two 
glass plates under a microscope. The liquid is on the right in all the 
pictures (Trivedi and Somboonsuk 1984). 


structures in aluminum-based alloys do not change significantly during subse- 
quent processes. These solid-state transformations are covered in Chapter 7. 


3.2.2.1 Different Types of Dendrites 

The dendrites obtained by destabilization of a macroscopic solid—liquid interface 
are called “columnar dendrites’. They appear only in the case of an alloy pro- 
cessed under conditions of interface destabilization and their growth is con- 
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Fig. 3.8 Equiaxed dendrites in an Al-Ni alloy; each dendrite constitutes 
an individual grain (typical size 200 um). The eutectic structure 
between the dendrites cannot be seen at this magnification. 





Fig. 3.9 Typical columnar dendrites in an Al-7%Si alloy: SEM 
observation of the interface after quickly removing the liquid. These 
dendrites have a typical size of 200 um. Their faces are all oriented in 
the same direction, because they belong to the same single crystal 
grain (Access-Aachen). 


3.2 Solidification Structures 


strained (oriented) by the thermal gradient, so that the latent heat is evacuated by 
the solid. A common error is to associate structural grains with the dendrites. 
Generally speaking, the solid-liquid interface of a single grain is composed of 
several columnar dendrites: for example, single-crystal turbine blades are grown 
in the columnar dendritic mode. 

Another type is called an “equiaxed dendrite’. It is obtained when crystal seeds 
nucleate inside the liquid phase. This happens when the liquid is undercooled. 
Each seed grows on its own, at the beginning as a sphere, which may be main- 
tained throughout the solidification (globular structures); more generally, how- 
ever, dendrite arms appear on the crystal by destabilization. Growth is free (in all 
directions) and controlled by both solute and thermal diffusion into the liquid in 
the case of alloys. Equiaxed dendrites occur for pure elements as well, in which 
case the growth is controlled by thermal diffusion alone. Due to the release of 
latent heat, the solidified dendrite becomes warmer than the surrounding liquid 
and the latent heat is evacuated toward the liquid. Each equiaxed dendrite gives 
birth to an individual grain in the final structure. 





Fig. 3.10 Shape of the columnar interface of an AlaCu intermetallic 
compound: SEM observation after quickly removing the liquid (Dupouy, 
1986). 
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Fig. 3.11 Typical equiaxed carbon flakes in gray cast iron: optical 
metallography, typical size 500 um. 


The shape of the dendrite depends on the material. In the case of solid solu- 
tions based on simple crystalline metals with high symmetry (bcc, fcc) numerous 
growth directions are available and the structure shows an axis and secondary and 
even ternary arms typical of dendrites (Fig. 3.9). In general, the axis and the arms 
have simple crystallographic orientations, [001] in the case of cubic crystals, and 
the growth rate anisotropy is low. 

In the case of intermetallic compounds with crystallographic structures of 
lower symetry, only a few growth directions are available; the dendrites are fac- 
eted and the growth rate anisotropy is high (Fig. 3.10). 

Non metallic compounds have high melting entropies and they grow as thin 
plates, flakes, or needles (Fig. 3.11). 


3.2.2.2 Kinetics of Columnar Dendrites 
The fundamental hypothesis of all models for the study of dendrite dynamics 
considers that the growth is controlled by what happens at the tip of the dendrite. 
This rather surprising assumption (the dendrite tip radius is typically 1 um and 
the dendrite length may be several hundreds of micrometers) is based on obser- 
vations performed by Huang and Glicksman (1981) and Esaka (1986) and has 
been verified for the columnar dendrite. 

For a given velocity, the problem is to find the most appropriate curvature of 
the dendrite tip to evacuate the amount of solute rejected. It is supposed that the 
solute field around the dendrite tip is purely diffusive. As shown in Fig. 3.12, 
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Fig. 3.12 Chemical and thermal fields in front of the tip of the dendrite 
(Kurz and Fischer 1998). 


supersaturation and undercooling are related by Eq. (23) and depend on the 
chemical composition at the tip. 
Gi — Co —AT 


Q= = - 23 
ch1-k) mike = 





It is therefore necessary to compute the solute field around the tip in order to 
solve the problem. 

A parabolic shape of the dendrite is generally assumed and fits well with the 
observation of dendrites in transparent media in the diffusive approximation 
(see Fig. 3.13b). 

The diffusion equation in this case has been solved by Ivantsov (1947) and 
gives Ivantsovs function Q [Eq. (24)]. 


Q = Iv(Pe) = ePe| f dz (24) 


Pe Z 


The leading parameter is the Peclet number, Pe [Eq. (25)]. 


vir 
Pe=— 


=>, (25) 
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a) b) 


Fig. 3.13 Shape of succinonitrile dendrites: (a) under a liquid flow 
which is coming from the left and stabilizes the exposed side; (b) under 
microgravity conditions where no convection occurs (Huang and 
Glicksman 1981). Dendrite tip radius is a few um. 


If Pe is low (low growth rate), the supersaturation tends toward Pe, which is also 
the solution obtained by considering that the dendrite is a cylinder ending in a 
half-sphere, instead of a parabola. 

This solution does not take into account the effect of capillarity, which has been 
added by Huang and Glicksman (1981). For example, in the low-Pe approxima- 
tion, Eq. (24) is modified to Eq. (26). 


2r 
Q= Pe4 mia (26) 





The relationship imposed by Eq. (26) between tip radius and velocity at given 
undercooling is shown in Fig. 3.14, in which the Ivantsov solution is the straight 
line on the right-hand side and departure from this law under the effect of capil- 
larity is seen to occur for small radii of curvature. 

For a given supersaturation (undercooling in the thermal case) the number of 
couples (v;,r) is practically infinite and another criterion is necessary to solve the 
problem. It was first considered that the tip radius is given by the extremum of 
the curve in Fig. 3.14, because it corresponds to the maximum velocity and mini- 
mum undercooling of the dendrite. However this leads to tip radii which are too 
small compared with the experimental values. 

Langer and Müller-Krumbhaar (1977) have postulated that the tip radius is of 
the order of the periodicity of the interface destabilization. Their argument is 
that if the dendrite tip is smaller than this value the dendrite is likely to disap- 
pear, and if it is higher the dendrite tip will be destabilized and two dendrites 
will appear. From Eq. (21) we get Eq. (27), where the unknown variable is the 
solute gradient at the tip, which depends on the concentration at the interface. 
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Fig. 3.14 Relationship between the velocity and the tip radius for a 
given undercooling/supersaturation (Kurz and Trivedi, 1990). 


/ T 
r=2n mVc — VT (27) 


Combination of Eqs. 8, 23, 26, and 27 gives a relationship between the growth 
rate and the tip radius which, in the case of a low growth rate and a negligible 
thermal gradient, can be written as Eq. (28) (Kurz and Fischer 1998). 


Dir 
comı(k — 1) 


rv; = An? 


(28) 


This criterion is in good agreement with the dendrite tips measured by 
Glicksman. 


3.2.2.3 Kinetics of Equiaxed Dendrites 

For dendritic structures of pure metal growing in an undercooled melt (thermal 
equiaxed dendrites), the treatment of the columnar dendrite is applied, but the 
solute diffusion equation is replaced by the heat transfer equation, and Eq. (24) 
is replaced by Eq. (29). 





AT = =) Iv(Pe) (29) 


P 


The growth is not constrained, the velocity depends on the undercooling of the 
melt, and, in the simplified case of low Pe, the kinetic equations are (Kurz and 
Fischer, 1998) (30a) and (30b). 
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= DC, 2 
a (30a) 
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r 1 AT (31a) 


In the case of an alloy, it is necessary to take into account simultaneously the 
thermal and the solute fields around the dendrite (Kurz and Fischer, 1998). This 
gives Eqs. (30b) and (31b). 


D; 


= ZZ 2 
ee (30b) 
T 
r= 4 (31b) 


The equiaxed dendrite arms are very sensitive to sedimentation and convection, 
as shown by the comparison of experiments performed on Al-Ni alloys under 
microgravity (no sedimentation and convection), those under normal gravita- 
tional conditions (Dupouy and Camel, 2001), and those on transparent materials 
(Gerardin et al. 2001). Figure 3.15 shows the phase field simulation of the growth 
of an equiaxed dendrite in a liquid flow (phase field simulation techniques are 





Fig. 3.15 Two-dimensional phase field simulation of free dendritic 
growth in a fluid flow. The flow enters at the top and leaves through the 
bottom. Colours indicate temperatures and the light lines are the 
streamlines (Boettinger et al. 2002). 
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discussed in Chapter 10). The dendrite arm facing the incoming liquid is acceler- 


ated, so that the equiaxed crystal becomes elongated in the direction of flow. 


3.2.2.4 Characteristic Dimensions of the Dendrite 
Knowing now the geometry of the dendrite tip, it is possible to compute the 
characteristic dimensions of the dendrite field: height, a, and periodicity, 21, un- 
der the hypothesis of an elliptical shape and regular hexagonal arrangement of 
the dendrite field. With a and b as half-major and -minor axis, respectively, the 
geometrical properties of the ellipse give Eq. (32). 


p= (32) 
Neglecting the tip undercooling, Eq. (33) follows from Fig. 3.16. 


AT, 
21 =1.7b = 1.7\Vra = LA a (33) 


Substituting r from Eq. (28) gives Aı according to Eq. (34). 





4.3 4 DUcom(k = 1) 


VVT kvi 


Ay = 





(34) 














Fig. 3.16 (a) Columnar Al dendrites (0.5 mm diameter) in an 
Al-26wt%Cu alloy grown under microgravity in order to prevent 
convective perturbation (Dupouy et al. 1992); (b) their schematic 
representation as a hexagonal packing of ellipses; c) elevation along the 
line A-B; d) corresponding temperatures in the phase diagram. 


84 


3 Solidification and Grown-in Defects 


Table 3.2 Experimental growth rate exponents for the primary spacing of 
dendritic arrays, for various alloys: A} = vf? VT”. 








Dendrite Solute [wt%] m n 
Al 2.2 to 10.1 Cu —0.43 —0.44 
5.7 Cu —0.36 - 
0.15 Mg, 0.33 Si, 0.63 Mg, 1.39 Si (at %) 0.28 0,55 
0.1 to 8.4 Si —0.28 —0.55 
1 to 27 Cu —0.5 —0.5 
1to5 Sn —0.5 —0.5 
0.1 to 4.8 Ni —0.5 —0.5 
5 Ag —0.5 —0.5 
Pb 2 to 7 Sb —0.42 
5 to 10 Sb —0.75 —0.45 
10 to 50 Sn —0.45 —0.33 
8 Au —0.44 
10 to 40 Sn 0.39 to —0.43 0.3 to —0.41 
Fe 0.4 C, 1 Cr, 0.2 Mo 0.2 0.4 
8 Ni —0.19 = 
0.6 to 1.5 C; 1.1 to 1.4 Mn —0.25 —0.56 
0.035 C, 0.3 Si —0.26 = 





This parameter gives the main periodicity of the alloy structure and also of the 
defects that are generally trapped between the dendrites, precipitates, minority 
phases, microsegregations, and so on. Table 3.2 gives experimental values of the 
exponents of the growth rate and the thermal gradient, which are the principal 
process control parameters. It follows that Eq. (34) describes well the tendency 
but somewhat underestimates the effect of the growth rate. 

Another important parameter is the secondary spacing, 22, which is the spacing 
between the secondary arms along the dendrite side. As follows from Figs 3.8 and 
3.16(a), because of the evolution of the structure during cooling, these arms are 
degenerated and often are the only measurable structures. The original size of 
these arms, close to the dendrite tip, is the destabilization periodicity but, as can 
be seen from Fig. 3.13, it increases rapidly because of diffusion in the liquid be- 
tween the arms, the solute going from the smaller arms toward the larger ones 
under the effect of capillarity (Ostwald ripening). This occurs as long as the sides 
of the dendrites are in contact with liquid. This phenomenon has been studied by 
Kattamis and Flemings (1965) and Feurer and Wunderlin (1977); they found an 
expression [Eq. (35)] for the spacing of secondary arms. 
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In Eq. (35) the subscript b stands for the bottom of the dendrite, often at the eu- 
tectic composition and temperature (as shown in Fig. 3.16d). This expression has 
been obtained with the help of crude hypothesis and simplifications, yet it gives a 
good description of the phenomenon and compares well with experimental obser- 
vation (see Fig. 3.17). 


3.2.2.5 Microsegregation 

The cellular or dendritic morphology is the source of chemical heterogeneities in 
the material, as solute is trapped between the cells or dendrites and the composi- 
tion varies periodically through the microstructure. 

Equation (10) can be used to account for this periodic variation of solute, by 
considering the solidification of the dendrite thickness in the y direction, perpen- 
dicular to its z axis [Eq. (36)]. The tip of the dendrite solidifies at a composition 
close to the melt composition (Fig. 3.12) and solidification proceeds along a dis- 
tance /1/2 till it reaches the side of the facing dendrite: 


= a (36) 
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When this concentration reaches the solubility limit, other solid phases precipi- 
tate, often as eutectic structures. 


3.2.3 
Rapid Solidification 


The study of rapid solidification processes began in the 1960s, as it is a way to 
obtain materials in nonequilibrium and metastable states: microcrystalline or 
amorphous phases, such as metallic glasses. They are obtained by a rapid cooling 
of the sample, at 10°-10° K s~! (101! K s™! in certain laboratories). In this sec- 
tion the three major aspects of rapid solidification are explained: the stabilization 
of the solid—liquid interface, how it is possible to obtain metastable phases, and 
in which case an amorphous solid can be obtained. There are no direct relation- 
ships between these phenomena: amorphous and stable or metastable crystalline 
phases can be obtained below, as well as above, the absolute stability velocity. 


3.2.3.1 Absolute Stability and Diffusionless Solidification 

From Eqs. (18) and (19), it can be seen that, during directional solidification, the 
interface becomes unconditionally stable when the parameter A equals 1, which 
gives immediately the expression for the absolute stability velocity, Eq. (37). 


mıDı(1 = k)Co 
Tk 





(37) 


Ug = 


It is independent of the thermal gradient because its value becomes negligible 
compared with the solute gradient in front of the interface. With classical mate- 
rial parameters, the absolute velocity is in the range 0.1-1 m s7!. 

At very high growth rates, the solute boundary layer in the liquid decreases so 
much [Eq. (9)] that it becomes of the order of the interface thickness. Coming 
back to the argument developed in the Introduction [Eq. (2)] it follows that the 
interface velocity becomes so high that the atoms or molecules do not have 
enough time to rearrange at the interface and the liquid is solidified without any 
segregation; the solute segregation coefficient tends toward 1. Of course, the 
chemical potential of the solid and the liquid are no longer equal, as they are not 
in equilibrium. Aziz (1982) has proposed a relationship [Eq. (38)] for the velocity- 
dependent segregation coefficient: 





ko = qe (38) 
1 
Ds 


As k, approaches 1 with increasing velocities, this favors the absolute stability of 
the front in the case of directional solidification. In the case of growth from the 
undercooled melt, dendrites become purely thermal once diffusionless solidifica- 
tion conditions are reached at the tip (k, = 1). 
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Fig. 3.18 Free energy of the solid and the liquid phases at equilibrium. 
The dotted line is a tangent to both energy curves corresponding to the 
solidus and liquidus lines (“double tangent construction”). 


In any case, the interface velocity cannot be higher than the velocity of sound in 
the liquid, which is of the order of 10? m s~! for metals. 


3.2.3.2 Nonequilibrium Phase Diagrams 

An interesting fact in rapid solidification is that the undercooling of the interface 
increases with the growth rate (see Eqs. (5) and (6) for the kinetic effect, or Eqs. 
(23)-(25) for a columnar dendrite) and liquids can be solidified far from their 
equilibrium melting temperature, in regions of the phase diagram where meta- 
stable phases can be obtained. 

At equilibrium, for a given temperature T, the compositions of the solid and 
liquid phases are defined through the free energy diagram (Fig. 3.18) (see Chap- 
ter 7 for calculation of phase diagrams). 

When rapid solidification occurs, the chemical compositions of the solid and of 
the liquid can be equal, as explained above, but they cannot be higher than co, 
otherwise the energy of the solid would be higher than the energy of the liquid. 
Therefore, for the given temperature T, it is possible to get a solid with composi- 
tions ranging from ce to co. The corresponding T-co curve, known as the Tọ line, 
can be plotted in the phase diagram of the alloy, as shown in Fig. 3.19. It is 
located between the liquidus and solidus lines. 

Provided that the undercooling is large enough, it can be seen from Fig. 3.19 
that the composition domain of the equilibrium phases can be increased. But 
the most important application is that other phases, which cannot be solidified 
at equilibrium, can be obtained. 


3.2.3.3 Structure of the Rapidly Solidified Phase 
Crystals show well-ordered piles of atoms. On the contrary, amorphous solids, 
such as glasses, show atomic arrangements that are closer to the structure of 
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Fig. 3.19 Comparison between the equilibrium and the kinetic phase 
diagram of a given alloy. The eutectic temperature is substantially 
lowered, the eutectic composition width decreases, and a new solid 
phase 0 appears. 


liquid phases, with absence of topological long-range order. Obtaining one or the 
other structure depends on the probability of nucleation of crystal seeds in the 
cooling liquid. 

The nucleation rate of seed crystals depends on two factors. In order to nucle- 
ate, a seed should be large enough, so that enough solid is created to counterbal- 
ance the energy cost of creating the seed surface. Once the seed has nucleated, it 
should be fed by diffusion through the liquid in order to grow. Both mechanisms 
are energetically activated and the nucleation rate, i.e., the number of growing 
seeds nucleated per second, is given by Eq. (39), with AG, depending on the 
undercooling of the liquid according to Eq. (40) (see Chapter 7 for a detailed dis- 
cussion of the nucleation mechanisms). 
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Fig. 3.20 Evolution of nucleation rate below the melting temperature. 





l= Ine Gn/keT p—AGD/keT (39) 
16r yl? 
AG, =" a (40) 


Figure 3.20 shows both contributions and the corresponding nucleation rate. 

In general, this curve is presented in a plot of temperature versus time; see Fig. 
3.21 (known as a TTT diagram, for Time-Temperature-Transformation), where 
the curve gives the time after which a given nucleus density or solid fraction is 
obtained. 

Three domains can be distinguished in this diagram. Above the melting tem- 
perature, the sample is liquid. Below the melting temperature, the crystalline 
phase is obtained in the domain on the right of the “nose’’-shaped curve. In the 
case of a small undercooling, the time to obtain the first crystal seed may be very 
long (path 1). When the undercooling increases, the probability of seed nuclea- 
tion increases as well and the critical nucleus size decreases. However, after pass- 


liquid 





log(t) 


Fig. 3.21 TTT diagram, showing how the cooling rate selects either the 
crystalline or the amorphous structure of an alloy. 
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ing a certain critical undercooling, it becomes difficult to get crystals because the 
nucleation and diffusion processes are both thermally activated. 

It follows from this diagram that, whatever the material or alloy, there always 
exist cooling conditions permitting either the crystalline (path 2) or the amor- 
phous (path 3) phase to be obtained. However, the critical cooling rate to get 
crystals may be extremely low for materials such as silica glasses. On the contrary, 
extremely high cooling rates, of the order of 10° K s“! and obtainable only on 
thin materials (ribbons, wires and suchlike), are usually necessary to get amor- 
phous metallic binary alloys. In practice, the difficulty of obtaining a crystal seed 
increases with the complexity of the crystallographic structure, which in turn in- 
creases with the number of different atoms in the structure. It follows that it is 
easier to get metallic glasses of quaternary or quinary alloys: centimetric bulk 
metallic glasses of Zr-Ti-Ni-Cu-Be are currently produced with cooling rates 
of the order of 10 K s~! (Johnson 1999). 


3.2.4 
Eutectic Structures 


Eutectic structures occur when two or more solid phases grow simultaneously 
from the liquid. This generally leads to tiny structures, of the order of 1 um. Eu- 
tectics are composite materials and often show very good mechanical properties. 


3.2.4.1 Size of the Eutectic Structure 

The derivation of the period of a eutectic structure has been introduced by Jack- 
son and Hunt (1966). The calculation presented here is a simplified version for a 
symmetric phase diagram and equivalent physical parameters of both phases, to- 
gether with linearized fluxes. 





Fig. 3.22 Typical fibrous (Ag2Al phase in a Cu-Zn-Al-Ag alloy) and 
lamellar (Al-23%Cu alloy) eutectic structures. Their characteristic 
dimension is 1 um (Access-Aachen). 
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Fig. 3.23 Lamellar eutectic growth and corresponding phase diagram. 





Fig. 3.24 Diffusion of solute rejected by one phase in order to feed the adjacent one. 


Each phase rejects the solute corresponding to the growth of the other phase, 
which then will diffuse in order to feed the adjacent lamella (Fig. 3.24). 

In a first approximation, the flux of B atoms from the « to the £ phase can be 
written as Eq. (41). 


(Cz — Ch) 
4 


2 


Js=-Dı (41) 


From [Eq. (8)], the flux of solute B rejected by the « phase is given by Eq. (42). 
Js = vicz(1 — k) (42) 


Under conditions of stationary growth, both fluxes are equal and the composition 
is close to the eutectic composition, which gives Eq. (43). 
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Fig. 3.25 Geometry of a junction of two lamellae with 
the equilibrium of the surface energies leading to the 
curvature of the solid—liquid interface at the lamella 
top. 





A2 


_ Avjcg(1 — k) 


cz =e 
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(43) 


Equation (44) is the corresponding expression in terms of chemical undercooling 
(see Fig. 3.23). 


Rv; (44) 


The geometry of the junction between two lamellae gives another expression [Eq. 
(45), derived from Eq. (13)] for the undercooling (Fig. 3.25). It is supposed that 
the top of the lamella is a portion of a circle and that the geometry is symmetric. 


2r 
AT cap =a (45) 
A 


The temperature of the eutectic interface T; is plotted in Fig. 3.26 with the two 
undercooling terms AT,, and ATcap [Eq. (46)]. 
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Fig. 3.26 Temperature of the eutectic interface versus eutectic spacing. 
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Fig. 3.27 Variation of the eutectic spacing with the growth rate for the 
Al-AlzCu eutectic (Jones 1984). 


T; = Tg — AT — AT cap (46) 


The system is likely to adopt the position where the energy, and thus the under- 
cooling, are minimal. 

The eutectic spacing is then obtained by derivation of Eq. (46) with respect to A 
and finally gives Eq. (47). 


Dr (= = =) 
mp My 


cg(1 = k) 


2v = 4 





(47) 


This simplified relationship captures the basic physical concepts leading to the 
eutectic spacing: the periodicity of the structure is a compromise between the in- 
terdiffusion of the species, which works better when the distance between lamel- 
lae is decreased, and the surface energy, which tends to increase the thickness of 
the lamellae. It should be noted that in a first approximation the thermal gradient 
and the alloy composition have no effect on the eutectic structure. Equation (47) 
is in good agreement with results for an Al-Al,Cu eutectic (compare Fig. 3.27). 


3.3 
Defects in Single and Polycrystals 


Section 3.2 has reviewed the main theories developed in order to explain the 
structures obtained after solidification. However, the reality is generally more 
complicated and materials and structures are not as ideal and perfect as could be 
expected from the previous explanations. 

This section is therefore devoted to analysis of the defects which are generated 
during the solidification of alloys. This applies essentially to two classes of alloys 
and growth regimes: single crystals, which are expected to have a highly perfect 
atom arrangement, and dendritic structures, because perturbations of the den- 
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dritic field are likely to create defects in the material which may cause the 
mechanical properties of the alloy to deteriorate and restrict considerably its 
technical application. Section 3.3.2 is devoted to the development of the grain 
structure of an alloy. 


3.3.1 
Defects in Single Crystals 


A convenient way to classify defects is to use their characteristic dimension. 


3.3.1.1 Point Defects 
A full treatment of point defects, such as vacancies, interstitials, antisites and so 
on, is provided in Chapter 5. 

Solidification by itself has no direct effect on point defects in pure elements. As 
the liquid phase is not constrained and electrically neutral, the basic principle is 
that the interface acts as an ideal source and sink of point defects that are incor- 
porated in the solid at their equilibrium value, which depends essentially on the 
temperature. Considering Si at its melting temperature, vacancies and intersti- 
tials show a level of 10'° cm”? in the solid, while holes and electrons are at the 
101° cm“? level. However, only a rough interface is an ideal surface, faceted inter- 
faces may decrease the level of defects because the energy of one atom absorbed 
on or desorbed from the interface is high in this case. 

The situation is more complicated in compound phases. For GaAs, vacancies 
and interstitials are at the level of 10° cm (0.05%) for a perfectly stoichiometry 
and neutral material, but it is difficult to obtain stoichiometry. Figure 3.28 shows 
an enlarged view of the phase diagram of GaAs. It appears that the congruent 
point is shifted to the As-rich side, which means that only Gag.497Aso.503 can be 
grown without segregation effects. However, during cooling, this will cause As 
to precipitate. The situation is complicated by the fact that As has a high vapor 
pressure above liquid GaAs (2 atm at the melting point). Perfectly stoichiometric 
GaAs can be grown from the liquid phase, provided that its composition is con- 
trolled continuously during growth, which is technically (but not easily) obtained 
by controlling the As pressure in the gas in equilibrium with the liquid. 

In turn, antisite defects (for definition and more details, see Chapter 5) are 
controlled by the stoichiometry of the compound. An important defect in GaAs, 
known as EL2, is Asca. The reaction governing this defect is represented by 
(Lagowski et al. 1982): 


0 = 2 = 
ASas t hea ASca u + has’ + 4e 





It is found that the concentration of EL2 is dependent on stoichiometry, but also 
on the electrical state of the semiconductor [Eq. (48)]. 


[heal 


[Asca] = K(T) Ihale J4 


(48) 
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Fig. 3.28 GaAs phase diagram with an enlarged view of the solidus 
close to the stoichiometric point superposed on it (Wenzl et al. 1991). 


1D: Dislocations Dislocations are covered in detail in Chapter 6, and only a few 
comments on their interaction with solidification will be given here. 

The only stress applied on the growing solid—liquid interface is the hydrostatic 
pressure. For materials which are very soft at the melting temperature, for exam- 
ple, Hglz, this may have an effect on the dislocations created in the crystal. How- 
ever, this pressure is negligible in more conventional materials and a no-stress 
mechanical boundary condition is typically used on the interface. It follows that 
the solidification process by itself does not generate dislocations. 

Stresses occurring after solidification, during the cooling process, or because of 
possible adhesion on crucible walls easily generate dislocations in the crystal; es- 
pecially close to the melting point where plasticity is enhanced (Gondet et al. 
2003). Since this process is not directly linked to solidification matters, it will not 
be covered here, but the reader should be aware that it is the main cause of dislo- 
cation generation in industrial single crystals (see Völkl, 1994). 


96 


3 Solidification and Grown-in Defects 


Fig. 3.29 X-ray topography of the seed of a zone-melting 
Si single crystal. Dislocations appear as white lines and 
their number decreases as growth proceeds, from top to 
bottom (Hurle and Rudolph 2004). 





Solidification may be used to decrease the number of dislocations in crystals. If 
dislocation lines are not perfectly perpendicular to the growing interface, they will 
move, as growth proceeds, toward the edge of the crystal, where they will disap- 
pear. This property is used for “cleaning” seeds in the pulling of Si single crystals 
(Fig. 3.29) (Dash 1959). 

Last but not least, the reader should keep in mind, as mentioned in the Intro- 
duction, that the existence of dislocations at the growing interface has a strong 
effect on the undercooling, especially in the case of faceted growth. 


2D: Twins and grain boundaries Although point defects and dislocations can, to a 
certain extent, be accepted in single-crystal production, this is not the case for 
twins and grain boundaries which separate the sample into two or more parts 
with different crystallographic orientations, making the material unsuitable for a 
use as single crystal. 

It is well known that twins and grains are easily produced during mechanical 
deformation of materials (see Chapter 6). The thermal stresses occurring during 
the cooling phase of crystal growth processes are not large enough to produce 
twins, but they produce dislocations that may align in sub-grain boundaries and, 
ultimately, give grains (Boiton et al. 1999). Since this is not directly linked to 
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solidification it will not be discussed further here (for a discussion of thermal 
stresses in crystal growth, see Völkl, 1994). 


3.3.1.2 Twins 

Twins are specifically linked to single crystals, because they are due to a defect in 
the stacking of atom piles, leading to a rotation of the crystal structure (Fig. 3.30). 
The energy associated with the twin plane is due to the stacking faults and is 
relatively low compared with a grain boundary, so that twins can grow easily un- 
der certain conditions. 

The twinning of a crystal during Czochralski pulling will be discussed, but the 
case can be extended to other crystal growth processes where a crucible wall is in 
contact with both liquid and solid. A first condition for the twinning to occur is 
that the solid—liquid interface is faceted at the crystal—liquid—gas triple line (Fig. 
3.31). It can be shown that a twinned seed cannot nucleate in the middle of the 
facet, because its energy, increased by the twin energy boundary, is higher than 
the energy of a regular seed. 

Hurle (1995) has shown that twinning will occur on the facet at the triple line, 
if a twinned seed can nucleate. Such a seed is a truncated disk and its energy is 
lower than for a regular, circular, seed. The question is under which conditions 
a truncated twinned seed has a lower energy than a truncated regular (well- 
oriented) seed. The key point is that the lateral surface of the seed (facing the 
gas) should have a given orientation, generally <111>. This happens if the angle 
a in Fig. 3.31 is 70.5°, which is likely to occur in the conical part of the crystal, 
when its diameter is increased. Temperature oscillations, leading to diameter 
fluctuations, are also a cause of reaching this particular value. 

In order to make possible the growth of the twinned seed, it should also be 
small enough; this gives a value for its undercooling according to Eq. (49), where 
subscripts TB, C and T respectively stand for the Twin Boundary, for a Circular 
seed, and for the Truncated seed. 


YTB AGc 
AT; = 
aASm AGc = AGr 





(49) 
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Fig. 3.30 Typical twin boundary, due to an error of piling A, B, and C atomic planes. 
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Fig. 3.31 Normal growth in the Czochralski process: the facet at the 
triple line grows step by step. A twin may occur if a seed of this 
orientation nucleates at the triple line. 


It follows that the facet on which the twinned seed is nucleating should be 
highly undercooled. As dislocations facilitate the growth of faceted crystals and 
then decrease their undercooling, twins are more likely to appear in crystals with 
a very low dislocation density, which is in good agreement with the experimental 
observation. 


3.3.1.3 Grains 

Single-crystal growth often fails because the occurrence of grains. Due to the en- 
ergy associated with the grain boundary, the nucleation of a randomly disoriented 
seed directly on the growing interface is not possible and the mechanism should 
be different from that of twin nucleation. The basic principle is still the decrease 
in energy of the seed of a grain with respect to a seed correctly oriented on the 
interface. This may occur if foreign surfaces, on which a disoriented seed can 
grow, are present in the system. Such surfaces are typically precipitates or cruci- 
ble walls. In addition, the grain seed should nucleate ahead of the interface and 
then a certain undercooling is necessary. 

As shown in Fig. 3.32, two configurations may lead to grain nucleation in the 
layer of undercooled melt ahead of the interface: nucleation on a crucible wall, 
and in holes of a rough crucible. Only the case of a seed nucleating on the 
crucible wall (Duffar et al. 1999) will be discussed here, but the treatment of the 
other configuration is equivalent. In processes without crucibles, grains nucleate 
on foreign particles or on precipitates in the melt. 

From the nucleation theory, there is a critical radius for nucleation of a 2D 
nucleus on a faceted interface [see Eq. (12)] and the associated energy is given 
by Eq. (50). 
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Fig. 3.32 Nucleation of solid seeds: (1) on the interface; (2) in a hole in 
a rough crucible. (3) Spurious nucleation on the crucible wall. 
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On the crucible wall, the critical seed has a similar radius, but the energy is de- 
creased by a factor depending on the contact angle of the seed on the wall [ Eq. (51)]. 


_ 16my3_ (2 + cos 0)(1 — cos 0)? 
3ASm AT? 4 





AGw (51) 


Figure 3.33 shows that if the contact angle is low enough, spurious nucleation 
may occur, even for low values of undercooling. 


10° 
107° Liquid 
1071 Solid 
fir sous 
10+? Substrate 
~ 
ka 


105 


10-26 2-D nucleation 


10-17 





10:3 


0.01 0.1 Undercooling 1 


Fig. 3.33 Nucleation energy of a 2D nucleus on the interface and on 
walls with various contact angles, plotted as a function of the 
undercooling. 
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Fig. 3.34 Metallographs of marked GaSb solid-liquid interfaces, in 
contact with the crucible wall: (a) silica (9 = 100°); (b) vitreous carbon 
(0 = 90°); (c) BN with carbon coating (0 = 90°); (d) BN (0 = 40°, the 
crucible wall is on the right). 


Figure 3.34 shows the angle made by a solid-liquid interface on a crucible wall. 
The measured angles, introduced into Eq. (51), are in agreement with the obser- 
vation that spurious nucleation in GaSb crystal growth never occurs in silica cru- 
cibles, but always happens in BN crucibles. 
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3.3.2 
Grain Structure of an Alloy 


As already mentioned in Section 3.2.2 with respect to dendrites, the grain struc- 
ture of an alloy is of primary importance to its mechanical properties. Grain 
boundaries are defect zones with a high energy (0.5 J m~?, close to the energy of 
a solid—liquid interface), attracting stresses and impurities, and helping the pre- 
cipitation of foreign phases. On the one hand, this results in a cleaning effect 
of the intragranular material but, on the other hand, grain boundaries may be 
locations of corrosion, mechanical weakness, or crack initiation. Although this is 
extremely alloy-dependent, it is generally recognized that the quality of an alloy is 
improved when grains are smaller. 

Grains can be columnar, when they are elongated in the direction of the 
growth, and in this case each grain is constituted by several columnar dendrites 
(see Section 3.2.2). Grains with an isotropic shape are called equiaxed and they 
are generally produced from individual equiaxed dendrites. As shown on Fig. 
3.35, it is possible to change the growth conditions in order to get one or the 
other type of grain structure. 

Typical defects are associated with the grain structure. 

e Chemical segregation may occur between dendrites and 
grains, and on the full ingot scale. Precipitation of oxides, 
sulfides, or eutectic phases, or nucleation of dissolved gases 
(H2) often follows. 

e Shrinkage may also occur between dendrites and grains, and 
on the ingot scale. It is a major defect that deserves much 
attention. Its distribution across the solid depends strongly 
on the solidification conditions. 





Fig. 3.35 Transition from columnar (left) to equiaxed (right) grain 
growth through an increase in the growth rate from 2 to 15 um s7 
(refined Al-3.5%wtNi, VT = 20 K cm!) (Dupouy and Camel 2000). 
Diameter 8 mm. 
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Fig. 3.36 Typical grain structure in a cast alloy. 


In practice, all metallurgical processes consist in pouring the liquid into a mold. 
The extraction of heat toward the walls leads to a typical grain structure in the 
cast ingot which is shown in Fig. 3.36. There are three zones: the outer dendrites, 
the columnar region, and an equiaxed area in the center. 

The outer equiaxed zone is composed of small grains that grew by heteroge- 
neous nucleation in the undercooled liquid at the mold walls (step 1; compare 
Fig. 3.37). As there is a great heat extraction by the mold, their growth rate is 


Step 1 Step 2 Step 3 





Seeds on the 
crucible wall 





Only correctly 
oriented grains 
survive 


Dendritic 
growth 





Fig. 3.37 Three steps of the transition from the outer equiaxed zone to 
the columnar structure. 
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Fig. 3.38 Grain structure of an alloy as predicted by Gandin’s stochastic 
model (Gandin and Rappaz, 1994). The growth proceeds from the left 
to the right and the effect of melt convection can be seen. The colors 
are representations of the various crystallographic orientations of the 
grains. 


high and destabilization occurs, leading to columnar dendritic growth (step 2). 
Dendrites follow the crystallographic orientation of the initial grain and those ori- 
ented opposite to the thermal flux “overgrow” the others, so that grain selection 
occurs (step 3). 

So far, these nucleation, growth, and grain selection mechanisms have been 
modeled only through stochastic numerical simulation, with realistic and predic- 
tive results (Fig. 3.38). The method and some results are given in Rappaz and 
Gandin (1993) and Gandin and Rappaz (1994). 

The central equiaxed region appears when the thermal gradient in the liquid 
vanishes and seeds transported from the front (detached dendrite arms) are able 
to grow in the liquid. Depending on the application of the alloy, it is possible to 
avoid the equiaxed structure (photovoltaic silicon, turbine blades) or on the con- 
trary to favor it, by mixing the liquid or using refining particles. 

In Section 3.3.2.1 the equiaxed grain growth regime is studied, in an attempt 
to find out the typical size of the grain structure. The parameters involved in the 
columnar to equiaxed transition are explained in Section 3.3.2.2. 


3.3.2.1 Equiaxed Growth in Presence of Refining Particles 
Grain refining is used to get a homogeneous and tiny equiaxed structure. It 
consists in enhancing the heterogeneous grain nucleation with the help of 
foreign solid particles introduced in the melt. The principle is based on the fact 
that heterogeneous nucleation on foreign particles is easier than homogeneous 
nucleation, especially if the contact angle of the seed on the particle is low [Eq. 
(51)]. 

The first condition is to have nucleating particles (refiners) in the melt. Finding 
adequate particles is not easy because they should not dissolve in the melt and 
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they should have nucleation properties only for a sufficiently undercooled melt. 
In practice, two refinement processes are used (Jackson et al. 1966). 

e Introduction of the appropriate particles in an undercooled 
melt. A very typical example is the introduction of Ti (1-10%) 
and B (0.1-3%) in Al alloys. 

e Strong mixing of the melt, for example by electromagnetic 
forces. The common assumption is that the melt flow is 
strong enough to detach secondary arms of columnar 
dendrites (this detachment occurs by chemical dissolution 
rather than mechanical effect) and these solid particles are 
seeds for the equiaxed growth if they remain in or pass into 
an undercooled liquid. 


The second condition means that the melt should be, and should remain, under- 
cooled as long as possible. However, because of the equiaxed dendrite growth, la- 
tent heat is released in the system and the liquid temperature increases. It is only 
in the case of efficient solute rejection by the grains that the liquid remains 
undercooled (chemical undercooling, Eq. (3)) and that the growth of the already 
existing grains is restricted. This ability is measured by the parameter of growth 
restriction, q [Eq. (52)]. 


q = mı(k — 1)co = kATo (52) 


The number of grains in the alloy is a direct consequence of the competition be- 
tween the heterogeneous nucleation rate, which increases with the undercooling, 
and the growth rate, which decreases as the undercooling increases. 

In the model proposed by Maxwell and Helawell (1975), three different regimes 
are considered. 

In a first step, the hot liquid cools down, passes the melting temperature, then 
is undercooled. The energy barrier for nucleation of a seed on a foreign particle is 
given by Eq. (51) and therefore decreases when the undercooling increases and 
nucleation starts at a given undercooling. The nucleation rate is given by Eq. 
(39) and is proportional to the number of foreign particles N,. Considering the 
low undercooling, the diffusion term is taken to be constant and Eq. (53) results. 


aN = Np loe ACnlkeT (53) 


In the second step, particles are growing. It is supposed that the undercooled 
liquid is isothermal and that the solute fields around the particles are not interact- 
ing, so that the liquid between two grains is at the initial composition. These as- 
sumptions are based on the fact that the thermal diffusivity is two to three orders 
of magnitude higher than the chemical diffusivity. Then the characteristic length 
for chemical diffusion is smaller than the typical grain size (200 um) but the char- 
acteristic length for the heat transport corresponds to hundreds of grains. 
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The crystals are spherical and their growth rate is limited by the solute rejection 
and then varies inversely with q. The time evolution of the radius of a particle 
growing and rejecting solute has been obtained by Aaron et al. (1970) as Eqs. (54). 
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ATs = mi(c! — Co) (54) 


The undercooling of the melt is the sum of the solute undercooling [Eq. (54)] and 
of the capillary undercooling [Eq. (13)], or as stated in Eq. (55). 


ATs = AT — AT cap (55) 


During the growth, the system is cooled at a rate P and the latent heat is released 
proportionally to the increase of the solid fraction, so the variation of liquid 
undercooling is given by Eq. (56). 


dAT p_AHm df; 
dt C, dt 





(56) 


Equations (53-56) form a system into which the number of particles, their size, 
and the undercooling enter as unknown variables. This system can be solved nu- 
merically if an initial value is given for the radius ro. 

The third step starts when the temperature of the melt increases enough to 
stop the grain nucleation. No more grains are formed, but the other grains con- 
tinue growing and it has been shown that it is during this step that they are de- 
stabilized and show dendrite arms. 

This model is able to reproduce the melt undercooling, the undercooling break- 
down, and the recalescence, and, most importantly, gives the tendencies concern- 
ing the final number of grains. Some interesting results are shown in Fig. 3.39. 
The contact angle of the seed on the foreign particle has a huge influence on the 
number of grains. It is also shown that the cooling rate is likely to help to control 
the number of grains. 

Finally, the model is able to explain the experimental fact that the grain size 
decreases with the number of refining particles, but only to a certain extent. If 
too many particles are present, the number of growing grains increases quickly 
at the beginning and the latent heat released prevents the nucleation of other 
grains. It is then possible to determine the number of foreign particles leading 
to the smallest grains. However, the model remains only qualitative. 

Greer has proposed an improvement of the previous model, taking into account 
the statistical distribution of particle size and a free growth of the dendrites 
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Fig. 3.39 Equiaxed grain density versus foreign particle density. 
Parameter: contact angle for P= 0.5 Ks"! (left), and cooling rate for 
0 =7° (right) (Al-Ti, ro = 1 um) (Maxwell and Hellawell 1975). 


(Greer et al. 2000). This analysis has been performed in the case of Al alloy cast- 
ing with Al-Ti5-1B as refiner. 

Suppose that Al seeds nucleate on the large [0001] faces of TiB, particles: if the 
wetting angle is low enough, this is likely to happen at low undercoolings. The Al 
seed grows laterally and when the face is covered, it keeps on growing, but with 
an increase in the curvature of the solid-liquid interface (Fig. 3.40). The radius of 
curvature of the nucleus cannot be lower than the critical radius for nucleation, 
r*, which depends on the undercooling [Eq. (12)]. Therefore free growth occurs 
only when the critical radius becomes lower than the particle radius. It follows 
that, for a particle population ranging from 0.1 to 10 um, this kind of free growth 
occurs for undercoolings from 6-0.06 K, which indeed corresponds to the under- 
coolings observed in the processing of refined Al alloys. 

It follows also that during the liquid cooling seeds appear first on large particles 
then on smaller and smaller particles when the undercooling increases. 

Greer’s model then adopts Maxwell-Hellawell’s treatment, where the growth of 
existing seeds causes a decrease in the undercooling and of the subsequent reca- 
lescence. This means that all the particles under a certain diameter do not lead to 


N 


Al TI 


Fig. 3.40 Growth of Al seed on the Al-Ti5-1B refiner 
particle. 


3.3 Defects in Single and Polycrystals | 107 








700 


8 





er : l — 
1 Econ | 
= 500 = | 
2 ct 5 | 
È 409 a E predicted | 
5 300 a >” 
c £ f 
¥ oa 
5 er o Non: fi 
~ot. } 
100 alo | 
6 to a ROE VE OES EEES eee Le, 
o 2 4 6 8 10 0 1 2 3 4 5 
Addition Level of Refiner (ppt) Cooling Rate (K s™'} 


Fig. 3.41 Variation of grain size with refiner quantity and with cooling 
rate, as predicted by Greer’s model and compared to experimental 
values (Greer et al. 2000). 


seed nucleation because the undercooling does not decrease sufficiently. In prac- 
tice only 5% of the particles are useful for seeding, but, as they are the biggest 
ones, these represent a significant part of the refiner mass. 

The prediction of the grain size versus quantity of refiner and cooling rate 
agrees quantitatively with the experimental observations (Fig. 3.41) and shows 
that above a quantity of about 1 ppt (part per thousand) (by weight) of refiner, or 
a cooling rate of 1 K s™t, a further increase is not useful for decreasing the grain 
size. It is worth noting that the model does not use any adjustable parameter. 

It should be noted that the model is based on the hypothesis that the nuclea- 
tion on the particle is easy (i.e., the contact angle is low) and therefore cannot be 
applied if the nucleation barrier is too large. It has been remarked that certain 
solutes, such as Zr, have a strong effect on the nucleation properties and thus 
act as killers of the refining effect. 


3.3.2.2 Columnar to Equiaxed Transition 

As shown in the introduction to Section 3.3.2, columnar and equiaxed transitions 
are competing at some moment of the ingot solidification. It therefore is impor- 
tant to know when the transition between the two structures will occur and what 
the parameters influencing this transition are. 

The classical model has been proposed by Hunt (1984). A columnar dendritic 
interface is growing; its growth rate is imposed by the heat extraction through 
the mold (constrained growth). The dendrite tip undercooling is self-adjusted to 
the given velocity (see Section 3.2.2.2). A region of undercooled liquid is thus 
created ahead the columnar front, whose extent and undercooling depend on the 
solidification rate and temperature gradient. It is supposed that solid seeds are 
nucleating (in the presence of refining particles) in this undercooled region and 
grow as equiaxed dendrites. The structure will remain columnar if the volume of 
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Fig. 3.42 Schematic plot of the competition between columnar and 
equiaxed growths. The columnar front is undercooled by AT, and seeds 
nucleate in the liquid below the melting temperature Tm and then grow, 
as the columnar front is approaching. 


equiaxed grains is negligible compared with the advancement of the columnar 
front. Conversely, if the equiaxed grains grow fast enough to occupy the major 
part of the volume before being passed by the columnar font, the resulting struc- 
ture will be equiaxed. 

The nucleation rate is expressed, from Eq. (53), taking into account the remain- 
ing number of free nucleating particles, as Eq. (57). 


= = (No — N)Ipe ACr/kaT (57) 


An undercooling is necessary for nucleation [AG, varies as 1/ATy*; compare Eq. 
(3.51)]. At the columnar dendrite tip, the undercooling is maximal and equal to 
AT, and it follows that there are two situations: 
e if ATy > AT,, the equiaxed dendrites cannot nucleate and no 
equiaxed grain will appear 
e if AT, > ATy, seeds will nucleate at the forefront of the 
columnar tips, and as Eq. (30b) states, the growth rate of 
these equiaxed dendrites is proportional to the square of the 
local undercooling [Eq. (58)]. 


dr D; 
dt 4Comı(k — 1)T 





AT? (58) 


This local undercooling varies between ATy and AT, at a rate depending on the 
velocity of the columnar front and on the thermal gradient in the liquid [Eq. (59)]. 


dAT 
=. 5 
z aT (59) 
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When the equiaxed dendrite comes into contact with the columnar front, its 
radius is given by Eq. (60) and the volume of equiaxed grains formed per unit of 
time by Eq. (61). 





Dı t 5 Di ATc AT? 
= — | AT’d= dAT 0 
j 4Com(k — orl, 4Comı(k — seh. viVT (60) 
4 
Veq = Nozar’ (61) 


Hunt considered that the structure will be equiaxed if this volume is more than 
66% of the total volume solidified per unit time (at the velocity v;) and columnar 
if it is less than 0.66%. It finally follows that the structure is columnar if Eq. (62a) 
is satisfied, and equiaxed if Eq. (62b) is true. 


VT > 0.617(100No) /?(1 — AT3/AT?)AT; (62a) 
VT < 0.617No'3(1 — AT3/ATZ)AT, (62b) 


Between these two values, the structure is a mixture of columnar and equiaxed 
grains. AT; is linked to the growth rate v; through Eqs. (27) and (28). 

The effect of the leading parameters is plotted in Fig. 3.43. For a given thermal 
gradient, the equiaxed growth is possible only at high velocity, because at low 
velocity the undercooling is too small to allow for seed nucleation. Increasing 
the thermal gradients helps the columnar growth, because the thickness of the 
undercooled layer depends on the thermal gradient. This is the reason why equi- 
axed grains are observed frequently at the end of the solidification, when the 
liquid becomes almost isothermal. 

The limiting velocity below which no equiaxed grain can nucleate decreases 
when the solute concentration increases and when the nucleation undercooling 
decreases (Fig. 3.44). 
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(a) (b) 
Fig. 3.43 (a) Equiaxed and columnar domains in a v;/VT diagram, and 


(b) effect of the number of refining particles on the equiaxed side of the 
transition. Al-3%Cu (Hunt 1984). 
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Fig. 3.44 Equiaxed and columnar domains in diagrams showing growth 
rate versus thermal gradient. (a) Effect of the alloy composition and (b) 
of the nucleation undercooling on the equiaxed side of the transition. 
Al-3%Cu (Hunt, 1984) . 


The model neglects convective effects, which are known to be strong, because 
in reality there is transport of the smallest equiaxed grains by the convective flow, 
into regions where they may melt back. Larger grains tend toward sedimentation. 
In any case, the model is useful as it helps an understanding of the leading pa- 
rameters acting on the columnar equiaxed transition. Experiments performed 
under microgravity conditions in order to avoid any convective perturbations 
have made it possible to discuss the validity of the model, which is limited by its 
sensitivity to the initial parameters (undercooling and number of seeds) (Dupouy 
and Camel 2001). 


3.3.3 
Macro- and Mesosegregation 


In this section, perturbations of the columnar dendritic field are discussed. The 
region between the tip and the bottom of the dendrite, where solid and liquid ex- 
ist simultaneously, is often called the “mushy zone”. Neglecting the undercooling 
of the tip, its thickness can be calculated through Eq. (63) (see Fig. 3.16). 


ATo 
a= or (63) 
This mushy zone in certain solidification processes may reach several decimeters. 
The thermal and chemical characteristics of the mushy zone are directly related 
to the dendrite, as studied in Section 3.2.2. However it also has a strong interac- 
tion with the flow field, acting as a porous medium (Darcy flow), with the partic- 
ularity that the permeability depends on the solidification conditions, which in 
turn are influenced by the flow field (see Fig. 3.13). By changing the thermal 
and solutal fields around the dendrite, and inside the dendritic field, convective 
flows have a strong influence on the solidification conditions. The coupling of 
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the interdendritic flow with hydrodynamics in the bulk liquid leads to defects in 
the material and chemical segregations on the ingot scale. The chemical segrega- 
tion is called “positive” where the local chemical composition is higher than the 
mean ingot concentration, and “negative” where it is lower. 

Independently of forced convection that may be imposed on the system 
(through mechanical, electromagnetic, or other processes), natural convection al- 
ways exists because of the thermal field perturbation by the solidification front, 
but also, and even more so, because of the chemical gradients that induce varia- 
tions in the liquid density. Both thermal and solutal fields are acting on the liquid 
density through their expansion coefficients [Eq. (64)]. 


pi = pr (1+ BrAT + Beci). (64) 


Since the chemical and thermal fields are related by the thermodynamic equilib- 
rium hypothesis along the dendrite side [Eq. (3)], Eq. (65) follows. 


n=n(ı (fr fe)ar) (65) 


The thermal expansion coefficient is of the order 1074 K~! and the chemical term 
in Eq. (64) is of the order of 10-?-10~? K~!. Therefore the thermal convection 
within the mush can generally be neglected. The resulting Archimedes force is 
calculated through Eq. (66). 





Fy = BcATg/mı (66) 


It follows that the convective effects will depend on the orientation of the solute 
gradients relative to gravity. 

From a solute balance at the local scale, Flemings and Nereo (1967) derived the 
equation of solute distribution [Eq. (67)], taking into account the local convective 
flow. 





fi _ (1-45) (, , VEN fi 
Oc) (1-k) (: Nr) cl’ (67) 


Here ô, is the solidification shrinkage, and v; the liquid velocity, which is given by 
Darcy’s law, with a permeability factor depending on the liquid fraction according 
to Eq. (68). 


v= m (Vp— pig) K= af? (68) 


If there is no convection and no shrinkage, Eq. (67) is the differential form of the 
classical Scheil’s law, Eq. (10). If convection is insignificant or parallel to the ther- 
mal gradient, integration of Eq. (67) gives a modified Scheil’s law, Eq. (69). 
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cy = cll- fy vie) (69) 


Transition toward the convective regime occurs when the fluid velocity becomes 
of the same order as the growth rate. 

In order to show the effect of the orientation of the thermal gradient (or equiv- 
alently, of the growth direction) versus gravity, examples of solidification of Al-Cu 
alloys grown for various configurations are shown below. Considering the com- 
plexity of the interactions and coupled mechanisms involved, only qualitative ar- 
guments will be given. 

Six configurations exist, depending on the direction of solidification (horizon- 
tal, vertical upward, and vertical downward) and on the alloy concentration: 
hypo-eutectic (the rejected Al is lighter than the Cu solvent) or hyper-eutectic 
(the rejected Cu is heavier than the Al solvent). These can be grouped into three 
different convective situations: horizontal, vertical with solute stabilization of the 
liquid (Al-rich downward or Cu-rich upward), and vertical with solute destabiliza- 
tion of the liquid (Al-rich upward or Cu-rich downward). 

The hypo-eutectic horizontal case is shown in Fig. 3.45. Dendrites are located at 
the bottom of the ingot cross-section because the flow, driven by the rising of the 
Al-enriched liquid, increases the chemical composition at the top, where the 
eutectic grows. Conversely, in the case of a hyper-eutectic alloy, the eutectic is 
located at the bottom and the dendrites at the top. 

In the vertically destabilizing case (Fig. 3.46) the liquid is unstable and driven 
away from the interface, which creates convective chimneys in the dendritic field. 
The solute is carried through the mush to those particular places, where the con- 
centration increases and a purely eutectic structure may eventually form. This 
causes freckles in the solid; such freckles are extremely detrimental to the me- 
chanical properties of the alloy. 


eutectic layer 


freckle mushy zone : imposed 
\ radial AC 


stable 
convective 





Fig. 3.45 Effect of convection on the structure of a horizontally 
solidified Al-rich Al-Cu alloy; (a) metallographic picture in cross- 
section; (b) sketch of the flow and its effect on the microstructure 
(Dupouy and Camel 2000). 
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Fig. 3.46 Effect of convection on the structure of an Al-Cu alloy grown 
under vertically destabilized configuration: downward solidification of a 
hyper-eutectic alloy. One freckle appears here as a region with smaller 
dendrites (Dupouy and Camel 2000). 
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Fig. 3.47 Effect of convection on the structure of an Al-Cu alloy grown 
under vertically stabilizing configuration: upward solidification of a Cu- 
rich alloy (Dupouy and Camel, 2000). 


The vertically stabilizing case is shown in Fig. 3.47. No interdendritic flow 
would be expected if the isoconcentrations were strictly planar and horizontal. 
However, defects inside the dendritic field induce radial solute gradients which 
force convective loops in the mushy zone, and cause radial segregation, possibly 
leading to the formation of purely eutectic regions as in the example shown. 
However, contrarily to the previous case, the average concentration in a cross- 
section remains equal to the nominal concentration (absence of longitudinal 
segregation). 
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3.4 
Outlook 


The objectives of this chapter have been to introduce the reader to the fundamen- 
tal aspects of solidification in a simplified way, to show the basic physical phe- 
nomena involved in structure formation of alloys, and to explain the generation 
of defects during solidification. 

The most important parameter determining all the solidification-related phe- 
nomena is the undercooling of the solid—liquid interface, which depends essen- 
tially on the local chemical composition and curvature. The kinetic undercooling 
is negligible in the case of metallic alloys which present a rough interface, but 
may reach several Kelvins for materials exhibiting faceted growth, such as inter- 
metallics or semiconductors. 

Therefore the structuring of the alloy is the result of the competition between: 

e diffusion in the liquid, which governs the chemical field close 
to the interface: solutal undercooling decreases as 
microstructure size decreases 

e capillary forces: capillary undercooling decreases as 
microstructure size increases. 


This competition is active continuously in all structuring processes: interface de- 
stabilization and formation of dendrites, cells, eutectics, and grains. 

Simultaneously, the solid-liquid interface interacts with the surroundings. The 
essential external parameter acting on solidification is the thermal field applied 
to the alloy through the heat fluxes extracted, or generated, in the growth facility. 
This heat extraction is responsible for the spatial temperature distribution, acting 
on most of the structuring processes through the thermal gradient, and for the 
temperature variation with time, which fixes the growth rate. 

Defects are related to foreign elements, which lead to chemical segregation, nu- 
cleation of particles or bubbles, or act as nucleating agents. A second effect 
concerns the interaction of convection with the solidification process through 
perturbation of the thermal and chemical fields. It should be kept in mind that 
the convective patterns are often coupled with the solidification process. This is 
a consequence of the solute rejection in the liquid, which makes the problem ex- 
tremely difficult to tackle. 

Owing to such complexity, the present development in solidification research 
tends essentially toward numerical simulation. This is the only way to get expla- 
nations or predictive conclusions in the industrial processes, which are geometri- 
cally complex and time-dependent. Currently, software is being developed in 
order to fully solve the problem in real configuration, computing the tempera- 
tures, velocities, solute field, and structures (dendrite or eutectic sizes, etc.). The 
models generally take into account the mechanical evolution of the material after 
solidification and the associated defect generation. 

Numerical simulation is also at the forefront of more fundamental research, es- 
pecially by the use of phase field simulation techniques, which give new insight 
into the structuring phenomena and their interaction with the surroundings. 
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Fig. 3.48 Phase field simulation of equiaxed growth. Three equivalent 
seeds are introduced into an undercooled melt. At the end of the 
growth, the equiaxed grains are closely connected to each other. The 
blue color in the last picture represents liquid which is unsolidified due 
to solute rejection and trapping in the dendrite arms or between the 
grains. 


This numerical technique is particularly well adapted to take into account the 
complex shape of the solid-liquid interface. An example of simulation of equi- 
axed growth is given in Fig. 3.48. A comprehensive review of this contribution 
can be found in Boettinger et al. (2002) and partially in the introduction to phase 
field methods given in Chapter 10. 
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Nomenclature 


RCH TE rn 


lattice parameter, structural unit size 
Burgers vector 

chemical composition 

fraction (of solid or liquid) 

gravity vector 

vacancy or hole 

segregation coefficient 

Bolzmann constant 

liquidus or solidus slopes 

pressure 

radius (of nuclei, of curvature, of dendrite tip .. .) 
fluid velocity 

solid-liquid interface velocity 

mole fraction 


x, y,z axes 


amon 


SH“p we SR eo 


m MD US DR R 


NG 


heat capacity 

diffusion coefficient 

Young’s modulus 

Grashof number 

Gibbs free energy 

enthalpy 

nucleation rate (s7!) 

flux and flux density 

kinetic coefficient or permeability coefficient in Darcy’s law 
a number (of nuclei, particles etc.) 
Peclet number 

ideal gas constant 

Reynolds number 

entropy 

temperature 

volume (V,, atomic volume) 


Jackson’s reduced transformation entropy 
thermal dilatation coefficient 

solutal dilatation coefficient 

surface energy (solid—liquid, liquid—gas etc.) 
boundary layer thickness 

relative density jump aty the interface (p, — pı)/pı 
fluctuation, strain, perturbation 

curvature (1/r) 

characteristic solidification structure periodicity or length 
viscosity 

stress 


References 


T Gibbs parameter (y/AS, Km) 

A thermal conductivity 

AG, AH, AS formation/migration/etc. energy values (differences) 
undercooling 
supersaturation 


contact angle 
reduced supersaturation 
gradient 


Joep ob 
Sq 


Subscripts or superscripts 


cap capillary 

s solid 

l liquid 

i at the solid—liquid interface 


m melting 
0 reference 
ch chemical 
5: surface 

T thermal 
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4 
Lattice Statics and Lattice Dynamics 


Véronique Pierron-Bohnes and Tarik Mehaddene 


4.1 
Introduction: The Binding and Atomic Interaction Energies 


The cohesion of solids is mainly due to the electrostatic attraction between the 
negatively charged electrons and the positively charged nuclei. The magnetic 
forces and the repulsions between electrons have a weak effect on cohesion but 
are to be taken into account to give a good description of the total energy of 
the crystal, and are responsible for the relative stability of the different phases. 
The cohesive energy is the energy that must be added to the crystal to separate 
its different components into neutral atoms at rest at an infinite distance (Kittel 
1976). 

Crystals of inert gases have a cohesion based on polarization effects leading to 
polar interactions between induced dipole moments (van der Waals forces). Their 
cohesive energy is some tens of meV per atom. Ionic crystals are composed of 
positive and negative ions with strong electrostatic interactions (the Madelung en- 
ergy) and a strong cohesion of tens of eV per atom. Covalent crystals also have 
strong bonds (some eV per atom). Two atoms share a pair of electrons, one elec- 
tron coming from each atom. Covalent bonds are very directional and the pair of 
electrons can be found with high probability near the bond line. This makes pos- 
sible the stability of very open structures, such as the diamond structure.” 

In metals, the atoms are not completely ionized but the outer-shell electrons 
are free to move across the sample, giving rise to a high electrical conductivity. 


1) 


An open structure is a non-compact 
structure. The compactness is defined as 
the ratio between the volume occupied by 


the atoms, considered as spheres in contact, 


and the total volume of the crystal cell. The 
two most compact structures are the face- 


centered cubic (fcc) structure and the 
hexagonal close-packed structure (hcp, with 
a c/a ratio of 1.633). Both have a compact- 
ness of 0.74, whereas the diamond and the 
body-centered cubic (bcc) structures have a 
compactness of 0.34 and 0.68, respectively. 
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In addition to the electrostatic energy between all the charges present, a repul- 
sive interaction between electrons has to be taken into account because of Pauli’s 
exclusion principle: two electrons at the same position cannot have all their quan- 
tum numbers equal. When two atoms come close, their electronic distributions 
overlap and an electron initially belonging to one atom can occupy a free orbital 
of the other atom. Since free states have comparatively high energy levels this in- 
duces a strong repulsive force between atoms at a very close distance. 

Moreover, the electronic levels of the atoms, initially all at the same energy, 
undergo a degeneracy break for the outer electronic shell when the atoms come 
close (as in a diatomic molecule, an atomic level at E separates in a bonding level 
at E — ôE and an anti-bonding level at E + dE). This induces a lowering of the en- 
ergy of the valence electrons, because the valence band is not full. In the transi- 
tion metals and their alloys, the binding energy is due to the outer s and p shells 
but also contains a contribution of the inner more localized d or f shells. The co- 
hesion energy varies considerably depending on the metal: it is typically of 1 eV 
in alkali metals and as high as 8-9 eV in refractory metals such as Ta, W, Re, and 
Os. These aspects will be discussed in more detail in Chapter 11. 

The crystal structures of metals are generally compact: fcc or bec. As the atoms 
approach to a close distance, the energy decreases due to hybridization and to the 
Madelung energy. This binding energy is thus responsible for an attractive force 
whereas Pauls exclusion principle is responsible for a repulsive force. The equi- 
librium distance corresponds to the exact compensation of these two forces. Metal 
bonds have no definite directions relative to the central atom; as in the covalent 
crystals, there are no forces perpendicular to the lines connecting two neighbor- 
ing atoms. The relative stability of the structures is determined by the values of 
the total energy in the different possible structures. This induces a great variabil- 
ity of the metal structures in comparison to ionic, van der Waals, or covalent crys- 
tals: the phase diagrams of metallic alloys are thus often richer and more complex 
than the phase diagrams of systems forming, for example, ionic compounds. For 
this reason too, metals and alloys are less brittle, more elastic, and more plastic 
than covalent structures. In alloys, bonding is often mixed and there is some 
ionic or covalent contribution due to the electropositivity difference between the 
different metals, for example. 

The evaluation of the total energy % of an intermetallic alloy is quite complex 
because there is a multitude of electrons interacting with many nuclei. It is nec- 
essary to make some approximations when writing the Hamiltonian and when 
calculating the total energy from this Hamiltonian in the different accessible 
states. 

On the basis of the reasoning of Born and Oppenheimer (1927), many quan- 
tum mechanical calculations of the electronic structure of a system of atoms treat 
the nuclei as classical point masses. The energy of a system of atoms is adiabati- 
cally parameterized by the positions of the nuclei (adiabatic approximation). Ac- 
curate methods exist for calculating the energy of a few atoms in some configura- 
tions; an overall evaluation of the energy hypersurface, however, is impossible 
and, even if it was carried out, one would not gain physical insight. 
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Several approximations to the many-body problem have been developed (Carls- 
son 1990; Moriarty et al. 2002; Pettifor et al. 2002) designed to describe certain 
aspects of interest of a given system. The different approaches used to approxi- 
mate the adiabatic energy hypersurface may be classified accordingly in three 
groups. 

e The total energy is decomposed into many-body potentials, 
i.e., pair, three-body, four-body ..., N-body potentials. 
Carlsson (1990) denotes these many-body potentials as 
cluster potentials. However, there has been no unique 
prescription for constructing a many-body potential which 
can be used for any configuration of a given system. 
Moreover, the many-body potentials in general exhibit a 
rather slow convergence. 

e Effective potentials show a faster convergence. Carlsson 
(1990) refers to this large class of potentials as cluster 
functionals. The energy is written as a sum of a pair 
interaction term and a rest term. The rest term is written 
formally as the sum of contributions of single atoms, the 
contribution of a single atom depending on the surrounding 
of the atom as a function of atom pairs, triplets etc. Again no 
clear prescription of how to construct cluster functionals 
exists. They are often constructed according to physical 
intuition and therefore their transferability is often limited. It 
has been demonstrated recently (Drautz et al. 2004) that by 
using an adequate definition of perfectly transferable many- 
body potentials, a systematic and objective comparison of the 
existing potentials could be performed. 

e The cluster expansion method writes the total energy as a 
sum of contributions over all conceivable clusters in the 
system. This method has been used mainly in the past with a 
frozen lattice: the atoms are fixed on certain lattice positions 
and there are only chemical degrees of freedom. The cluster 
expansion method was introduced by Sanchez et al. (1984). 
Extensive work was also done by Zunger and coworkers (Lu 
et al. 1991; Zunger 1994) in an effort to handle the effect of 
local relaxations and long-range elastic interactions. For 
practical calculations the cluster expansion has to be 
terminated at a maximum cluster; Finel (1987, 1994) has 
proposed a method to produce a convergent series of 
approximations. More details can be found in Chapter 11 or 
in Mueller (2003). 


In any case, the simplest interaction Hamiltonian [Eq. (1)] is based on empiri- 
cal pairwise energies and has been shown to be valid in many metals (Ducastelle 
1970; Bieber and Gautier 1984). 
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H=5 eppi o 
LJ,lm 

Here the summation runs on the atomic species I, J and the sites l, m. p! is the 
occupation operator of site | by atom I. ep is the pair interaction energy between 
atoms I, J at sites l, m. It depends on the positions of the sites. In disordered al- 
loys, an isotropic approximation has to be made with an average lattice parameter. 
Often, due to the size mismatch of the constituents of an alloy, the atoms do not 
take their ideal lattice positions, but relax into a lower energy state where the 
atomic positions are displaced from their ideal lattice sites. The relaxation energy 
of a given configuration is defined as the energy difference between the energy 
of the configuration of atoms on an ideal undistorted lattice and the energy of 
the configurations with the atomic positions locally relaxed in the whole crystal. 
For many systems, the calculation of phase diagrams requires local relaxations 
in the cluster expansion coefficients to be taken into account (Lu et al. 1991; 
Zunger 1994). The relaxations can be taken into account in molecular dynamics, 
for example, for pure metals or ordered alloys, by using periodic conditions. In 
disordered alloys, the size of the system is quite limited and the calculation is per- 
formed by averaging on several small samples with random filling of the lattice 
sites (Grange et al. 2000; Galanakis et al. 2000). 

This point will be developed in more detail in Chapter 10 and we will only give 
here the general form of the total energy as a function of the lattice parameter in 
Pt and CoPt. The calculation can be made using molecular dynamics in the sec- 
ond moment approximation (Tréglia et al. 1988; Goyhenex et al. 1999). In the 
tight binding formalism (Friedel 1969), this energy is written as the sum of two 
terms, an attractive band energy and a repulsive pair interaction. The band term 
is obtained by integrating the local electronic density of states up to the Fermi 
level and gives rise to the many-body character of the potentials (Legrand et al. 
1990). The realistic density of states is replaced by a schematic rectangular den- 
sity having the same second-moment [Eq. (2)], where the summation is made 
on sites m within a fixed radius of neighbors of site | (usually up to the second 
neighbors), ro is the nearest-neighbor distance, £y are the hopping integrals for 
atoms I and J, and qj; are parameters. 


1/2 
E = fys exp(—2q1(Tim/To — J (2) 


The repulsive term is described by a sum of Born-Mayer ion-ion repulsions 
(Ducastelle 1970) with a schematically parameterized exponential form [Eq. (3)]. 
The summation is made on sites m within the same radius of neighbors of site |, 
Ay, and py are parameters. 


Ej = 5 Ay exp(—py(Tim/To = 1)) (a) 
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Fig. 4.1 (a) Band E’ and ionic repulsive E' energy contributions 
between two atoms in the Pt system as a function of the bond distance. 
Top right: zoom on the sum showing the minimum at the nearest- 
neighbor distance 0.277 nm. (b) Calculations using molecular dynamics 
in pure Pt: variation of the energy per atom with the relative change in 
lattice parameter (o) and its adjustment to a third-degree polynomial 
(thick line). Thin line: second-degree part of the polynomial. 


The values of the parameters are the same as those used in a recent study on 
Pt/Co(0001) (Goyhenex et al. 1999). For each pure metal species (Pt, Co), the pa- 
rameters were determined by fitting the cohesive energy, the lattice parameter, 
and bulk modulus (Kittel 1976). For mixed pairs (Co, Pt), the parameters are ad- 
justed to fit the order—disorder transition temperature or the dissolution energy 
of atom I in the pure metal J. 

Figure 4.1(a) shows the E’ and E’ contributions per bond in pure Pt and Fig. 
4.1(b) the variation of the total energy with the lattice parameter in pure Pt. We 
see from Fig. 4.1(b) that the total energy is mainly quadratic with a small asym- 
metry that will be responsible for anharmonicity (see Section 4.4). 

In intermetallic compounds and alloys, in ordered phases, the structure may be 
tetragonal due to either the anisotropy of the Bravais lattice itself or the aniso- 
tropic distribution of atoms on the lattice sites. The L1, structure, for example 
CoPt shown on the insert in Fig. 4.2(a), consists of a succession of pure Co and 
pure Pt planes along the [001] direction. This direction is thus no longer equiva- 
lent to the two other (001) directions. The lattice parameter a in the [100] and 
[010] directions will be a little smaller than c, the lattice parameter in the [001] 
direction, because of the pure (001) planes of the largest atoms (here Pt). 

Figure 4.2 shows the variation of the energy per atom as a function of a and c 
around its minimum. For each a or c value, the energy per atom value plotted 
corresponds to the local minimum of U(c) (or U(a)) at constant a or c, respec- 
tively. On the same graph is plotted the variation of c or a, respectively corre- 
sponding to this minimum. 
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Fig. 4.2 Calculations using molecular 
dynamics in long-range ordered CoPt. 

e: Variation of the energy per atom when 
varying c (a) or a = b (b) and its adjustment 
to a third-degree polynomial. o: Variation of 
one lattice parameter when varying the other 
one. In (a), the slope —0.50(1) gives 


4.2 
Elasticity of Crystalline Lattices 
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va = 0.50(1). In (b), the slope —2v./(1 — ve) 
= —1.56(1) gives v; = 0.44(1). Insert in 

(a): CoPt Llo phase: pure planes of platinum 
atoms (dark) and cobalt atoms (light) 
alternate along the [001] direction of the 
initial fcc phase. 


The elasticity of a metal is directly related to the chemical bonding between atoms 
and can be considered at both macroscopic and microscopic scales. In this sec- 
tion, we will describe both approaches and make the link between them. 


F/s 





Fig. 4.3 (a) Variation of the stress (F/s where 
F is the force and s the section of the wire) 
necessary to make a given change of length 
(Al) of a wire. A reversible elastic deforma- 
tion is observed between O and E. Above 
the elastic limit (E.L.) the deformation is 
plastic and the reversibility is lost. Coming 


Ax(010) 





(b) 


back to a zero stress, an elastic range is 
again observed with the same slope (ABC) 
but not necessarily with the same range. The 
slope is a characteristic of the material and 
corresponds to a deformation of the lattice 
without any bond change. (b) Example of 
homogeneous deformation of a solid. 


4.2 Elasticity of Crystalline Lattices 


4.2.1 
Linear Elasticity 


When a stress (force per unit surface) is applied to a solid, its external form is 
changed. If the intensity of this stress is small enough, this change is reversible 
(Fig. 4.3). The deformation is then only due to a deformation of the lattice cells 
without any bond break. But the energetic cost of such a deformation of all lattice 
cells is very high. Consequently, for higher stresses and deformations, plasticity is 
observed. It corresponds to a deformation by slipping some lattice cells along 
high-density atomic planes: this is the motion of a dislocation (see Chapter 6). 
Very few chemical bonds are broken and reconstructed in the elementary step. 
The energetic cost of this elementary step is thus not high and the elastic limit 
is therefore low (generally less than 1%) and depends very much on the disloca- 
tion density in the sample, which in turn depends on the mechanical and ther- 
mal history of the sample. 


4.2.2 
Elastic Constants 


If the lattice is homogeneously deformed (Fig. 4.3b), we can calculate the position 
of a point A of the lattice after deformation x’ from its initial position x from Eq. 
(4), where « and £ are x, y, or z indices; usp is the strain tensor. 


x! = Xa + 5 Uap Xp (4) 
7 


Its components are assumed to be small compared with unity. The energy 
can then be expressed as a Taylor expansion in u,,. Limiting this expansion to 
second-order terms and taking into account the symmetry of the problem as 
well as the different relationships expressing that the energy is minimum, it can 
be shown (Born and Huang 1968) that the energy density depends only on the 
symmetrical parameters Usp + Upa- 

Voigt proposed to introduce: 


Sp = Upp for p=1 to 3 (x to z) 
Sp = Usp + Uga, With p = 9 — a — B for a # B (5) 


(for example: s4 = u23 + u32) 


The components s, with Voigt’s indices are the elastic strains in the classical 
elasticity theory. They specify the size and shape of a macroscopic specimen as 
well as of the lattice cell. 

In alloys and intermetallic compounds, there may be several atoms within a lat- 
tice cell. Equations (4) and (5) describe the deformation of the lattice cell. The 
components wk of translation of atom k inside the cell obey the translation sym- 
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metry but may have a different deformation law than the cell (e.g., the piezoelec- 
tric effect). The energy density %/V contains contributions from both types of 
deformation: 


1 , , 1 1 
U/V == 2 5 As ww Ar 2 5 BE wës, + >> € pa SpSo (6) 


a, k, B,k! a,k,p p,a 


k and k’ scan the atoms in the lattice cell (the basis of the crystal); ./ ee Bi, Cpo 
are constants. 

The energy density is not affected if an arbitrary vector is added to all the w£. 
This shows that the energy density depends only on the difference of translation 
between an origin atom and the other atoms or between the different sublattices. 
These shifts are microscopic and do not affect the macroscopic dimensions of the 
sample. They are described as internal strains, as opposed to the external strains 
sp. The internal and external strains are coupled through the mixed term in the 
energy density. Thus, when a body is elastically deformed, some internal strains 
can be induced. Their values will minimize the total energy. Canceling the deriv- 
atives of the energy density as a function of the w* provides linear expressions of 
the wi as a function of the s,. Finally, the energy density can be written as Eq. (7), 
without loss of generality. 


1 
UIV = PD Cpa SpSo (7) 


Po 


In the presence of internal strains, the macroscopic elastic constants c,, defined 
from Eq. (7) contain a contribution due to internal strain. In any case, it is not 
possible to observe the external strains while keeping the induced internal strains 
at zero. 

This result is the classical expression of elasticity. The stresses are given by the 
first derivatives of the energy density with respect to the elastic strains [Eq. (8)]. 


Sp = 5 CpoSo (8) 


The first three components are the forces per surface unit applied along direc- 
tion p perpendicularly to the surface normal to the p axis of surface ., (Fig. 4.4b): 
for example, Sı = T11/.41. The other three components are the shear stresses: 
they correspond to forces applied parallel to both opposite surfaces (Fig. 4.4c): 
for example, S6 = T/A = Ta [A. 

Equation (8) is called Hooke’s law: the elastic stresses are linear functions of 
the elastic strains. The tensor of the elastic constants c,, is symmetric: Cpe = Cop- 
There are altogether 21 independent coefficients in the general case. If there are 
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{a) 





Fig. 4.4 Examples of deformations of a solid: (a) compression, (b) elongation, (c) shear. 


no internal strains (this is the ideal defect-free case because the presence of a cen- 
ter of symmetry in the crystal implies the absence of internal strains), the elastic 
constants verify the Cauchy relationships. 


C23 = C44, C13 = C55, C12 = C66, C14 = C56, C25 = C64, C36 = C45 (9) 
There are then only 15 independent coefficients in the general case without in- 
ternal strains (but the Cauchy relationships are generally not fulfiled in metals 


and fulfiled in some ionic crystals only). 


4.2.3 
Cases of Cubic and Tetragonal Lattices 


In a cubic lattice, Eqs. (10) apply. 


C11 = C22 = 033, C44 = C55 = Cog, 023 = C13 = C12, 
(10) 








C56 = C46 = C45 = C34 = C34 = 035 = C15 = C16 = C26 = O 


There are only three nonzero independent elastic constants. 

In the case of Pt, the plot of the energy density, %/V, as a function of c keeping 
a = b constant (not shown), gives Eq. (11) and induces cy; = 3.468(15) in good 
agreement with the value found in the CRC Handbook of Chemistry and Physics 
(1997): 3.467 x 101! N m>. 


M/V(10"! Nm) = —0.624 + 1.7338(c/co — 1)’. (11) 


In a tetragonal lattice, index 3 is no longer equivalent to indices 1 and 2, there 
are six nonzero independent elastic constants: 


C11 = C22 É C33, C44 = C55 É C66, C23 = C13 É C12 (12) 
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4.2.4 
Usual Elastic Moduli 


The most famous macroscopic elastic modulus is the bulk modulus K, the 
inverse of the compressibility xr, which is the resistance of the crystal against a 
volume change under pressure (Fig. 4.4a). This resistance is due to the repulsive 
term in the interatomic energy: when the atoms are brought nearer, the exclusion 
principle energy and the exchange energy increase rapidly. 


1 1 /av\ J OU 
K=-= =V 13 
K E (5) | (ei (3) 


K is directly related to the curvature of the cohesion energy as a function of the 
volume, or the interatomic distance r, or the lattice parameter a (Fig. 4.1b). If we 
develop the cohesion energy at the third order in ô= (r — ro)/ro = (a — ao) /ao 
and write: 








U = Uo + E8? — 653 (14) 


we get K = 2£/9V,, where Vo is the atomic volume. 
In the case of Pt in Fig. 4.1(b), the energy per atom, 








UJN(eV) 5.86 + 1226? — 5778? (15) 


and ao = 0.3917 nm give K = 2.887 x 101! N m7”, in perfect agreement with the 
value tabulated by Kittel (1976): 2.88 x 10!! N m. 

The other elastic constants are also directly related to the interatomic forces, 
but their calculation is not as easy. Young’s modulus E relates the relative change 
in length of a wire to the tensile force applied to it (Fig. 4.4b); a priori it depends 
on the direction: 


Tu Ax 
S E E E 16 
1 A 1 x ıu11 151 ( ) 








where .°/, is the surface of the section of the wire and T}ı the force applied to the 
wire. 

Poisson’s ratio v describes the associated change in size in the transverse direc- 
tion 


Ax Ax 
u22 v12 v21 = —V1251. (17) 
x x1 











A priori Poisson’s ratios are different if the structure has no cubic symmetry. 
Writing Eq. (18), 





Sy = Eis, = C1151 + C1292 + C1393 = (C11 — V12C12 — Y13013 )S1 (18) 


4.2 Elasticity of Crystalline Lattices 
leads to Eq. (19) 
Ey = c11 — v12C12 — 13013 (19) 


and equivalent relationships for the other directions. 
From Eq. (20) we get more relationships, making it possible in simple cases to 
write Es and vs from the cs. 





S2 = C2181 + 02252 + 62353 = (C21 — Vi2e22 — ¥13623)S1 = 0 (20) 
In a cubic structure, we get Eqs. (21), (22), and finally (23). 


E= C114 — 2VC12 (21) 


ee (22) 
Cu + C12 


(c1 — C12) (C11 + 2612) 
c11 + C12 


E= (23) 





Young’s modulus and bulk modulus can be related by writing that the change 
in size in direction p is due to the stress S applied on all three faces, with strains 


additivity. 





eee ae 24 

S EAR BE a 
and 

ME S a a (25) 

K pv S E, 

p 
In the cubic symmetry case, this leads to: 
E 
K=—_ (26) 
3(1 — 2») 


In Pt we get v = 0.42, in good agreement with the values found in tables (Le 
Neindre): v = 0.396. 

When a shear stress is applied to a cube of the crystal, it is deformed according 
to Fig. 4.4(c). The shear modulus G quantifies the angle @ describing this defor- 
mation: sẹ = u12 + U21 = 20. In first approximation, the deformation occurs at 
constant volume and we can define the shear modulus G by: 

Tıa Ax 


Si G G G 27 
6 A m u12 S6 ( ) 








To describe correctly the shear modulus in a simulation, we have to take into ac- 
count more neighbors than the first two coordination shells. 
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4.2.5 
Link with Sound Propagation 


The pressure modulations of sound induce vibrations in solids. These vibrations 
can propagate, thanks to the elastic properties of materials. 

Consider an elemental rod of section ./ made of a material with density p (Fig. 
4.5). The elemental mass p./.dx,, positioned between x, and x, + dx, at rest, is 
submitted to —T,,(x,) and Ty.(x, + dx,) at x, and x, + dx, respectively. We can 
write the equation of motion for this mass element: 


02x! 


Tan Xq + dx) — Tan) = pl di, st (28) 





and the elasticity law, assuming that the only deformation is along o, is: 


ô / 
Taa(%q) = Cam z (29) 





Thus we get the propagation equation: 


er 1 8x 

ia eat al 30 
ax? v? or (30) 
where Vy = (Cin/ p)” * is the sound propagation velocity in the « direction of the 
solid. We will see in (Section 4.3) that it is related to the phase velocity of longitu- 


dinal phonons in the « direction. In a system without cubic symmetry, the sound 


y x x+dx 





x(1+u,,) (x+dx)(1+u44) 











öy = xXu,, (x+dx)u4, 


Taf Tater | 





Fig. 4.5 Top: elementary rod at rest; definition of the parameters before 
the deformation. Middle: changes in the case of a longitudinal wave. 
Bottom: case of a transversal wave. 
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Fig. 4.6 Variation of the total energy when a Pt atom moves in pure Pt. 
We can see that the relaxation has an important effect on the total 
energy (black: without relaxation; white: with relaxation) and that the 
energy well is quite isotropic (circles: toward first neighbor along 

[1/2 1/2 0] and squares: toward second neighbor along [1 0 0). 


propagates with different speed in different directions, as shown by the different 
curves of total energy %(x) when an atom is moved to different positions around 
its equilibrium position in the lattice. 

Whereas in pure Pt the curve is identical in any direction (Fig. 4.6), in an ani- 
sotropic alloy such as CoPt the potential varies very differently according to 
whether it is in the plane or out of the plane (Fig. 4.7). Nevertheless, in the (001) 
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Fig. 4.7 Variation of the total energy in CoPt neighbors along [0 0 1]: toward the window 


when a Pt (a) or a Co (b) atom moves in of four small cobalt atoms. (b) For Co atoms, 
plane (white) or out of plane (black), toward the energy well is quite isotropic, except 

a first (squares) or second (circles) neighbor. toward first neighbors out of the plane. The 
(a) For Pt atoms, the energy well is quite elastic energy is the largest when cobalt 
isotropic in the plane, but not out of the atoms move toward neighboring platinum 


plane. The flattest is toward second atoms. 
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planes, the [110] (toward first neighbors) and [100] (toward second neighbors) di- 
rections are equivalent. 

Whereas gases and liquids can propagate only longitudinal waves, solids can 
also propagate transversal waves, thanks to their shear properties (Fig. 4.5, bot- 
tom). The calculations are very similar to the case of a compression wave. We 
get a propagation velocity (Cap /p)'! * for transversal wave propagation along « 
with a vibration along £. 


4.3 
Lattice Dynamics and Thermal Properties of Alloys 


In this section, we present an overview of the theory of lattice vibrations in crys- 
tals. First, we describe the semiclassical approach to the lattice dynamics for a 
three-dimensional crystal in the harmonic approximation. The lattice dynamics 
theory being relatively heavy with a multitude of indices (Cartesian coordinates, 
atom numbers, unit cell numbers, etc.), the theory is also applied in the case of 
a simple diatomic linear chain to point out the important features. The concept of 
the normal modes of vibration or phonons is then deduced from the quantum 
description of the lattice vibrations. Calculations of the related thermodynamic 
quantities such as specific heat, thermal conductivity, and thermal expansion are 
also presented. 

At first glance, a theory of lattice vibrations would appear daunting for a system 
containing typically 10” ions interacting with each other and with an even huger 
number of electrons. However, there is a natural expansion parameter for this 
problem, the ratio of the electron mass to the ionic mass (m./M; « 1), which al- 
lows us to make reliable approximations and to derive a good theory. If we imag- 
ine that, at least for small displacements, the forces binding the electrons and the 
ions can be modeled as harmonic oscillators, then 


Force oc e° /a? oc moza ~ Mjo?a (31) 


where e is the elementary charge; œe and œr are the electron and the ion frequen- 
cies respectively; a is the lattice parameter. The ratio of the ion frequency to the 
electron frequency would then be 


2 
or Me —3 4 
aL a (Peha 6 10 32 
(z) 2 (32) 


This means that ions are essentially stationary during the period of electronic mo- 
tion. One therefore can assume that the ions are stationary at fixed locations in 
the lattice and that the electrons will be in their ground state for this particular 
instantaneous ionic configuration. This approximation, formulated by Born and 
Oppenheimer (1927), is known as the adiabatic approximation. It is based on the 
fact that the typical electronic velocities are much greater than the ion velocities, 
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as we will see in Chapter 11. At this stage we can decouple the motion of the elec- 
trons from that of the ions. In the following section we will derive the equation of 
motion of the ions. 


4.3.1 
Normal Modes of Vibration in the Harmonic Approximation 


4.3.1.1 Classical Theory 

We consider a general three dimensional lattice containing N unit cells and p 
atoms per unit cell. A unit cell, indexed by |, is situated at position r; = haı + 
laz + laz, where aı, a2, a3 are the primitive translation vectors. The equilibrium 
position of the kth atom in the Ith unit cell is given by: rı = rı + rg, where rg 
gives the equilibrium position of the k-th atom in the considered unit cell. If 
now the atom is vibrating around its equilibrium position, its coordinate will be 
defined by Ri, = 11, + U, g where u x is the relative displacement of the kth atom 
in the Ith unit cell around its equilibrium position. 

Let us now derive the equation of motion of the atom considered. If we assume 
that the total potential energy % of the crystal is a function of the instantaneous 
positions of all the atoms, we can expand the potential energy in a Taylor series in 
powers of small displacements around its minimum %o: 





OU 
U = Uy 5 


a |” 7 ED Ole VE) (leap UR) + 


0 lk, Vk’ ab 











= Uo + Ui +U +--- (33) 


where the indices « and ß£ stand for the Cartesian coordinates x, y, and z. The 
equilibrium value, Wo, is a constant and is unimportant for the dynamical prob- 
lems. The second term of the expansion vanishes in the equilibrium configura- 
tion and 


OU 


UN 
DER) =, ou) |g 


(34) 
If the atomic displacements are now assumed to be small, so that the forces may 
be regarded as essentially linear functions of the atomic displacements (Hooke’s 
law), the higher orders in Eq. (33) can be neglected. The consideration of only the 
quadratic term in the above expansion is known as the harmonic approximation. 
In this scenario, the lattice is treated as a collection of coupled simple harmonic 
oscillators and the potential energy of the crystal reduces to 


4 =: XO So Gp (Ik, Ik!) uy (Ik) up (I'R’). (35) 
Ik, 1k! aß 


The force constant matrix elements ®,,(Ik, 1’k’) have to obey a number of sym- 
metry requirements. First of all, the infinitesimal translational invariance of the 
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crystal (i.e., when all the atoms are equally displaced, there is no force on any 
atom) leads to 


S duplik, l'k') = 0 (36) 
Ik 


The invariance of the potential energy % and its derivatives against symmetry op- 
erations, such as translations, ensures ®, (Ik, I'k’) = ®,,(Ok,! —I’k’). The sym- 
metry operations do not leave only the potential and its derivatives invariant but 
also the undistorted lattice. Under these symmetry operations, therefore, not only 
the numerical value of the potential energy but also the form of the expression for 
it, Eq. (35), is invariant. This invariance implies that the force constant of the 
ideal lattice does not depend on the the absolute positions of the primitive cells 
but only on their distance vector. The symmetry of the force constants is techni- 
cally very important because it reduces significantly the calculation efforts. 

If we take the derivative of % with respect to the relative atomic displacement 
u,(Ik), we get the force Fx, acting on the atom (Ik) in the direction «. In the ab- 
sence of any external applied field (stress, electric or magnetic field) the equation 
of motion reads 





Fi = Mü, (Ik) = =) O,4(lk, Vk’) up(I'k’) (37) 


=a Vk'p 


where m; is the mass of the kth atom and ® is the interatomic force constant 
matrix, also called the matrix of coupling parameters, defined in Eq. (34). In 
fact, O,(Ik, l’k’) represents the force, in the direction «, acting on the kth atom 
of the Ith unit cell due to a unit displacement of the k’th atom of the l'th unit 
cell in the direction £ with all other atoms in their equilibrium positions. 

The equation of motion (37) can be written in a symmetrized form as 


w, (Ik) = -X 2p "wp(k'V) (38) 
kip 


where w,(lk) = \/m,u,(lk) and the matrix 


a2 
gl _ 1 OU 


Ika Jmymy ĉualkl)ôug(k'') 





(39) 


is called the dynamical matrix. The matrix 2 is real and symmetric. For the same 
symmetry reasons as for the force constant matrix, the matrix elements pe do 
not depend on the absolute positions of the primitive cells but on their relative 
distance. Thus Die —_ DEP (ry -n)= DEPU —1). 

To solve the system of Eq. (38), we try a solution in the form of plane waves: 


Wa (Ik) = e,(q) expļi(q-r; — ot)] (40) 
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Inserting Eq. (40) in Eq. (38), we get 


oeral) =) DP (U — 1) expliq(ty — nı)]. (41) 
V'k'B 


As we can see, the plane waves make it possible to simplify the problem from a 
coupled system of 3pN differential equations to N systems of 3p linear equations. 
For a crystal with p atoms per unit cell and for each value of q, there are 3p solu- 
tions with eigenvalues œ? (q) and eigenvectors e;(j,q) to this problem. A com- 
plete solution of the eigenproblem of Eq. (41) is sought in terms of œ = ;(q), 
called the phonon dispersion relation for eigenvalues, and e = e;(q) (also denoted 
e = e(j,q)), called the dispersion relation for eigenvectors. An eigenmode with 
frequency wj and eigenvector e;(j,q) is called the jth normal mode, or phonon, 
of the system. The graphical representation of the 3p functions @;(q) as a func- 
tion of q is called the vibrational dispersion spectrum of the system. The index 
j=1,2,3,...,3p stands for the phonon branch. Since the dynamical matrix is 
real and symmetric, the eigenvalues are real. The harmonic system is stable if 
all eigenvalues are positive. It is sometimes convenient to use, instead of real ei- 
genvectors, complex ones. For example, in a translationally invariant lattice, the 
normal modes are plane waves which are mostly characterized by eigenvectors 
of the form exp(iq.r). The fact that the atomic displacement is real requires the 
condition: ef (j,q) = ex(—j,q); @j(q) = @j(—q). Orthogonal eigenvectors can be 
chosen: 


EVD hr (42) 
ka 
and 
>> eh q)-Ck'a! (j, q) = Okk' Oo! (43) 
J 


Out of the 3p phonon branches, there will be three acoustic branches such that 
o(q) — 0 when q— 0 and 3p — 3 optical branches such that w(q) — constant 
# 0 when q — 0. Atomic vibrations corresponding to any of the branches, acous- 
tic or optical, can either be longitudinal such that e || q or transverse such that 
e | q, or a mixture of longitudinal and transverse. The elastic continuum theory 
shows that in an isotropic crystal, it is always possible to construct three mutually 
independent polarization modes for a given q: er || q and er, L er, L q. In an 
anisotropic crystal, a clear relationship between e and q does not exist, except 
when q is along a high-symmetry direction. For example, in cubic crystals the 
concept of pure longitudinal and transverse polarization modes is defined only 
when q is along the symmetry directions [100], [110], and [111]. In tetragonal 
crystals even the [111] branches become mixed. 
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Fig. 4.8 Phonon dispersion for [001], [110], and [111] directions in Llo 
ordered FePd at 300 K (a) and in disordered FePd at 1020 K (b). Solid 
and broken curves indicate dispersion curves of longitudinal and 
transverse branches. The thin solid curves indicate branches of mixed 
polarization. 


Figure 4.8 shows a vibrational spectrum of long-range ordered L1o FePd (see 
insert of Fig. 4.2a) measured at room temperature by inelastic neutron scattering 
along the [001], [110], and [111] directions (Mehaddene et al. 2004). The underly- 
ing lattice of the L1o structure is tetragonal with two atoms per unit cell; thus, we 
get six phonon branches (3 x 2 = 6), three acoustic and three optical, for each 
value of q. Longitudinal and transverse branches are displayed as thick solid and 
dashed curves, respectively. In crystals of tetragonal structure, however, some of 
the branches are mixed in character, and those branches are plotted as thin solid 
curves. In some high-symmetry directions such as [001] and [110], all branches 
are purely either longitudinal or transverse. In the [001] direction, the transverse 
branches are doubly degenerated, similarly to the case of cubic crystals. The vibra- 
tional spectrum is much more simple in the same FePd sample measured at 
1020 K (Fig. 4.8), where the structure is disordered. In the disordered state, all 
atoms are equivalent; the underlying lattice is thus an fcc Bravais lattice with a 
single atom per unit cell, giving rise to only three normal modes for each q, two 
transverse (degenerate along [100] and [110]) and one longitudinal. 


4.3.1.2 Diatomic Linear Chain 

The essential features of the lattice dynamics can be understood most simply 
by studying the dimensionally reduced systems such as planes or atomic chains. 
In the latter case, this would be equivalent to looking in a given direction of space 
of the three-dimensional system. In the case of FePd, for instance, the [011] direc- 
tion can be viewed as a linear chain with a basis of two atoms with masses mı 
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Fig. 4.9 (a) Diatomic linear chain with alternating mı and m, masses. 
The unit cell has length 2a. (b) Phonon dispersion spectrum of a 
diatomic chain showing the acoustic and the optical branches. The gap 
in between is due to the different atomic masses (m < mz) in the unit 
cell. 


and m. Considering that each plane interacts only with its nearest-neighbor 
planes and that the force constants are identical between all pairs of nearest- 
neighbor planes, the system can be modeled by an infinite chain of 2N atoms 
forming N unit cells, each of length 2a (Fig. 4.9a), where a is related to the lattice 
parameters of the L1o structure by: a = 4/ afi, + cf,,/2. 

Suppose that, at a particular time, the neighboring 2nth and 2n + 1th atoms in 
the chain have displacements uz, and w2,,, from their equilibrium position re- 
spectively. The equation of motion then reads, in the harmonic approximation: 





My tian = klwanzı + wan-ı — 229] (44) 


maWan+ı = kluanı2 + um — 2W2n41] (45) 


where k is the force constant between the atoms mı and mp. Let us look for solu- 
tions of the form 


Urn = &ı &pli(2nga — ot)| (46) 


Want = €2 &pli(2n + 1)qa — wi] (47) 
Combining Equations (44-47), we get the coupled eigenvalue equations 


—wo em = kleze” + ee 144 2e1] (48) 
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—w enm = kleie * + ae” — 26] (49) 


Nontrivial solutions to this system are found by solving the secular equation 
|2ij — w*d;| = 0 where 2; is the (2 x 2) dynamical matrix given by 


J= @ 2k/mı -(2k/mı) cos en 


2k/m2) cos qa 2k/m 
The solutions are 
2 
4 
o? = k en + „(" E m) sin? qa (51) 
mıma mı m mı m 


The two signs in Eq. (51) correspond to two branches of the phonon dispersion 
curve. From Eqs. (46) and (47), the ratio of the vibration amplitudes eı and ez is 








e  2kcosqa — 2k— mo? 





52 
& 2k—mw* 2kcos qa (52) 


For small q values (long wavelength), we have cos ga ~ 1 — $q’a* + --- The two 
roots are 


k 
242 


1 
=) and o=) i 
My, mM 


en (53) 
which correspond to the upper and lower branches in Fig. 4.9(b) respectively. The 
upper branch in Fig. 4.9(b) corresponds to a negative ratio eı/e,, meaning that 
the two atoms are moving in opposite directions. Such a mode can be excited by 
an electric field of the appropriate frequency in the case of ionic crystals; this 
branch is thus called an optical branch. The lower branch has a positive ratio 
e1/e2, equal to 1 at small q, meaning that the two atoms in the unit cell are mov- 
ing in phase with each other. This is a characteristic of a sound wave. Hence 
the lower branch is called an acoustic branch. We can deduce from the above re- 
sults the shape of the dispersion curve in the case of monoatomic chains. When 
mz — mı, the gap between the optical and the acoustic branch disappears at the 
zone boundary and the upper branch can be unfolded to cover the region z/2a to 
n/a and —n/2a to —n/a, which is indeed the Brillouin zone of the monoatomic 
lattice. 


4.3.1.3 Quantum Theory 

So far we have treated the atomic vibrations within classical mechanics. For many 
applications, however, quantum properties are needed. In this case, the energy 
function [Eq. (35)] is the potential part of the Hamiltonian of the system, #. 
The vibrational wavefunctions can then be derived by the standard methods to 
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treat the harmonic oscillator. The Hamiltonian of the system in the harmonic ap- 
proximation reads: 





2 
pe > p“ (1k) + S Y ap ( Ue, Ik"), (Ik) up VE") (54) 


2m, Ik, Vk! of 


where p(lk) is the momentum operator of the kth atom in the Ith unit cell. 
The form (54) of the Hamiltonian is quite complicated; it can be simplified by 
introducing normal coordinate operators X(q) and P(q), Fourier transforms of 
the coordinate u and p: 


u( Ik) = 2» X(q, k) explig.n) (55) 
p(k) = m2> P(q, k) exp(-iq.n) (56) 


where V is the volume of the crystal. Since u(lk) and p(lk) are Hermitian, we 
have 


x(q. © = X(~4.b) =) ul explige) (57) 
l 

P! (q, E) = P(—a.b) = > pC) exp(—ign) (58) 
l 


i.e., the new coordinate operators are non-Hermitian. These operators satisfy the 
commutation relations: 


[X(q, k), P(q’, k’)] = thdgg duel (59) 


where I is the identity operator and h = h/2z, the reduced Planck constant. The 
substitution of Eq. (55) into Eq. (54) gives 





wp. Is Plq;k).P(q’,k) RR 
H = 2, ar exp[—i(q+q’)ri] 
qq 
1 1 IE 1 1 . 11! 
+ vo = Dplik, PR X,(q,k)Xplq’;k’) expli(q1+q'1')] (60) 


The Hamiltonian (60) can be simplified by performing the summation over | 
in the first term, introducing h=1-1’ and defining in the second term 
p(k’, q): 
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D,g(kk',q) = my Dap (kk', —q) 


= S > ©. (0k, hk’) exp(—iq.rp) (61) 
h 


The Hamiltonian can be then written 





: k).p! 
„year, S dykk, gla Xak) (62) 


qk 2m; ak k'ap 


At this stage, the crystal Hamiltonian is viewed in terms of the coordinates 
X(kq) and the momenta P(kq) of pN atoms, coupled by a set of harmonic force 
constants ®(kk’,q). Thus for each value of q the problem of finding the normal 
modes of the system is equivalent to finding the eigenstates of the Hamiltonian. 
For this purpose, following Section 4.3.1 we introduce the polarization vector 
e;(j,q) to represent the magnitude and direction of vibration mode (j, q), where 
j denotes the polarization branch. The eg(j,q) obey the orthogonality relation 
(42). With the introduction of eg(j,q), we do another set of normal coordinate 
transformations: 





=) vme*k(j,a)X(q,k) and P(j,q) = > =&x,Q).Plq; ) 
i i (63) 


We introduce the phonon annihilation and creation operators defined respec- 
tively by Eqs. (64) and (65): 





ba = oD xij (64) 


\ a Pura 2h 


t 
bja TA Ha L (65) 


It can be verified that these operators obey the commutation relations [bj q, bi, a) 


= 0qq'9j, ;'- Equation (64) can be inverted. Using w;(q) = @;(—q) and Eqs. (58) and 
(63), we have: 








se ho — 
X(kq) 2; 2mo;(a) e;(q, D) (b; a b; 9) (66) 
ho; 
P(kq) =) AT RC ua (67) 
J 


We now have expressions of the coordinate and the momenta vector X(kq) and 
P(kq) in terms of the phonon creation and annihilation operators and the polar- 
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ization vectors. If we substitute Eq. (66) in Eq. (62), we get a simple expression of 
the Hamiltonian: 


f 1 
H = Y hola) (3) Biat 5) (68) 
qJ 


At this stage we expressed the Hamiltonian of the system as a sum of 3pN 
Hamiltonians of independent harmonic oscillators. The eigenvalues of # are 
well known from the quantum treatment of the harmonic oscillator which can 
be found in many text books (Cohen-Tanoudji et al. 1977). Let us denote by |njq> 
an eigenstate which has n phonons of wave vector q and polarization j. The effect 
of the operators b; q, b'_ and b! qj.q on the state |Mqj> are given as follows: 


JSD “jq 
b! lra) =7/ niq + 1|Njq + 1> 
b; alfj) = YMalnja — 1> (69) 


j = 
b; j,qlMiq> = Njq|Njq> 


b! adi.a is a phonon number operator which gives the number of phonons in the 
(j,q) state. Using Eqs. (68) and (69), we get 


1 
Hin = Etola (na +5) Inn = sala) (70) 
Ja Ja 


and the average energy of the phonons in the mode (j,q): &q = haj(q)njq where 
the thermal average ñq is the Bose-Einstein distribution function. The term 4 in 


2 
Eq. (70) is due to the zero-point energy. 


4.3.1.4 Phonon Density of States 

In the following sections, we will perform sums or integrals of functions of the 
dispersion over the crystal momentum state q within the reciprocal lattice. How- 
ever, the translational symmetry of the crystal often greatly reduces the set of 
points for the summation. Let us once more consider a three-dimensional crystal 
with N = L? unit cells, where L is the linear size of the crystal along x, y, and z. If 
we apply the periodic boundary conditions along the three directions, we seek 
phonon states of the type y(r) = A e&xpli(q.r — wt)] obeying y(r) = y(r + rz) 
where rı = Lay. Thus 


2n An ; Nr 


x: Ay; 4z = 0; + L ‚+ pet I 





(71) 


i.e., the allowed q values form a cubic mesh in q space, with one q confined to a 
volume (2r/L)°. Therefore, for each phonon polarization index, a unit volume in 
the q space contains (L/2z)* values of q. (L/2z)* is called the “phonon density of 
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states,” DOS, in the q space and gives the number of vibrational states or normal 
modes between wave vectors q and q + dq: 


v 


=d 72 
On)? q (72) 


g(q) dq = 


where V = L? is the volume of the crystal. Equivalently, we can define g(w), the 
number of normal modes which have a frequency between œ and œ + dw. The 
partial phonon density of states of the branch j is 


V 


gloj) = arl 


dSo 
|Vawj| 


To get the total density of states, we sum over all branches: 





V dSo 
Ho) = Delo) = a | ives (73) 


vg = Vqa; is the group velocity. For some q values, vg approaches zero and leads 
to g(@) — œ. Such points in the q space are called critical points and the singu- 
larities in the density of states are known as van Hove singularities. 

The phonon density of states is one of the most important quantities in lattice 
dynamical studies. Its determination, however, requires the calculation of the in- 
tegral which appears in Eq. (73) over the whole Brillouin zone. In general, such 
an integration requires knowledge of phonon frequencies of wave vectors q in the 
entire Brillouin zone. However, in most experimental determinations of the nor- 
mal mode of vibration such a task is not feasible with regard to the time and to 
the experimental effort required. In practice we proceed by an interpolation of the 
normal modes of vibration using some phenomenological models. Furthermore, 
the point symmetry of the system is taken into account to reduce the phonon fre- 
quency calculation to wave vectors within the irreducible part of the Brillouin 
zone. 

Figure 4.10 shows an example of phonon density of state in the L1ọ ordered 
FePd at 300 K and in the fcc disordered state at 1020 K (Mehaddene et al. 2004). 
The occurrence of van Hove singularities is due, as explained below, to the max- 
ima in the dispersion curves, which are numerous in the ordered phase because 
of the greater number of branches (Fig. 4.8a). Actually, in the fcc disordered 
state it is easy to link the van Hove singularities to the responsible phonon 
modes in the dispersion curve, which are mainly due to the zone boundary 
modes. The cut-off frequency decreases from 7.5 THz at room temperature to 
6.8 THz at 1020 K, reflecting the increasing anharmonicity and the softening of 
the lattice at high temperature. The shape of the DOS at 1020 K is common to all 
fcc crystals. 
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Fig. 4.10 Phonon DOS of FePd in the Llo ordered state at 300 K and in 
the disordered state at 1020 K. 


4.3.1.5 Lattice Specific Heat 

The heat capacity at constant volume is an important quantity when dealing with 
the thermal properties of alloys. It defines the change in the internal energy % of 
the system when the temperature T is increased by an infinitesimal amount: 


At this stage, we should keep in mind that not only the phonons contribute to the 
internal energy %. The electrons, as well as the magnetic moments, have a contri- 
bution to % but their dependence on T is different. In the following, we derive an 
expression of the phonon contribution to Cy. 

We consider first the high-temperature limit. According to the equipartition 
theorem, which holds at high temperature, the thermal energy of a given system 
is simply given by kgT per quadratic term in the Hamiltonian (}kgT for both 
kinetic and potential energies). In the case of a crystal, with pN atoms, each of 
them is free to move in the three directions of space, we get % = 3pNkgT and 
Cy = 3Npkz. The molar specific heat of any solid at high temperature is thus 
Cy! = 3.Vakg = 3R, which is constant and independent on T. MA is Avogadro's 
number and R is the ideal gas constant. This is known as the law of Dulong and 
Petit. 

To derive an expression for Cy at low temperatures, where the classical theory 
does not hold any more, we consider the phonons as a set a undiscernible par- 
ticles that a priori can occupy any possible (j, q) state with energy hoj(q). The par- 
tition function of such a system is Z = ),exp(-ß%,) where ß = (kp T) and 
the index s stands for the different states. The energy %, of the phonon gas is 
given by Eq. (68); we have 





ITS oleh har] = TE eroa 
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From mechanical statistics (Gallavotti, 1999) we know that the internal energy 
can be deduced from the partition function as 





p Oa 4 ho;(q) : 
U = 25 In Z = > 3 i 2, Aaho;(q) (76) 
Ją Jq 


where the thermal average ñq is the Bose-Einstein distribution function and rep- 
resents the average number of phonons in the state (j, a;(q)): 


1 








"u = pho 1 "a 
From Eq. (74), we can deduce Cy: 
_ (22N _ oo hola)? expl hola) 
er (=), L ur Taa ga 


In the previous derivations, we have assumed a finite model crystal. For an in- 
finite periodic lattice, the allowed q values become dense and q a continuous vari- 
able. The summations over q are replaced by integrals: 


);- EO d’q > so) dw (79) 


q 


The final expression for Cy is 





fen (i)? exp( Bho) 
ee i glo) keT? (exp( Ph) — i" 


(80) 
If we now consider the high-temperature limit, we can derive the Dulong and 
Petit law obtained using the classical theory: 


Ges kn | E g(c) do = 3pNkp (81) 
0 


The integral in Eq. (81) gives the total number of normal modes of vibration, 
which is equal to the total number of degrees of freedom of the crystal. 


4.3.1.6 Debye’s Model 

The Dulong and Petit law has been corroborated by many experimental observa- 
tions performed on metals and insulators as well as on semiconductors at high 
temperature. In the low-temperature limit, the experimental observations show 
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that Cy varies with T? in insulators and with T in metals. The behavior in metals 
is dominated by the electronic contribution. A satisfactory explanation of the ex- 
perimental results, for both high- and low-temperature variations of Cy in insula- 
tors, emerged from the model of Debye for the phonon density of states. 

In 1912, Debye used the isotropic continuum approximation of the phonon dis- 
persion and considered the contributions from acoustic phonons in all polariza- 
tion branches with a frequency lower than a cut-off frequency wp, which is called 
the Debye frequency (Debye, 1912). In this approximation, the velocity of sound 
is taken as constant and equal for all polarization types, as it would be for a clas- 
sical elastic continuum. Thus the dispersion relation reduces to 


o = vq (82) 


where v is the constant velocity of sound. In the Debye model, considering a uni- 
form density in the reciprocal space and taking Eq. (82) as valid, the phonon den- 
sity of states reads 


fo? :0 <p 


golo) = { (83) 


0:0>wp 


where f is a constant. The singularities in the real density of states are smeared 
out. The value of the constant f is chosen so that the integration in w gives the 
total number of normal modes: we get f = 9pN/a},. 

If we replace the expression for the Debye density of states (83) in Eq. (80), we 
get the specific heat in the Debye model: 








T 3 pOp/T xte” T 3 
Cy = 9pNk | dx = 9pNk (=) Op/T 84 
nz a(z) o te =i) m Op (00/7) ee 


where x = hw/kgT; 0p = hop/ks is called the Debye temperature and ¢ is the 
Riemann “zeta” function. At high temperature, x «1, we get Cy = 3pNkz, in 
agreement with the Dulong and Petit law. At low temperature, x > 1, 0p/T % œ 
and (co) = 4n*/15 give 


36n* TY 


The Debye model successfully explains both low- and high-temperature depen- 
dence of the lattice specific heat of insulating crystals. It should be emphasized 
that real crystals cannot be treated adequately as an elastic continuum and that 
their atomic nature must be considered. This requires an accurate evaluation of 
Cy from g(œ) using a realistic lattice dynamical model. The Debye formula of 
Cy can nevertheless be used; @p is then treated as an empirical parameter in Eq. 
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(84). A usual way to choose the Debye temperature as a temperature-dependent 
parameter (p(T) is to adjust the measured heat capacity with Eq. (84). 


4.3.1.7 Elastic Waves in Cubic Crystals 

In the long-wavelength limit, q— 0, the displacement of the atoms will vary 
slowly in space. In this limit, the acoustic phonons, for which the atoms move 
in phase as shown in the case of the diatomic chain, correspond to sound waves, 
which are usually described by differential equations on an elastic continuum in- 
stead of motion Eq. (44) of lattice dynamics. In this limit, it can be shown that, 
for cubic crystals, the equation of motion can be written (Kittel 1976) as: 


O Uy 3u, O07 Uy ony Our 
ij — fi fi N N N 85 
Puy = C41 ox? | cu (SS "a2 J (c12 + C44) axdy ETR (85) 





where p is the mass density and u, is the x component of the displacement u. 
The corresponding equations of motion along y and z can be found by cyclic 
permutation. 

We can look for the solution corresponding to a longitudinal wave. For a wave 
along [100], i.e., q || [100], we can write: ux = uo exp[i(qxx — œt)]; uy = 0; uz = 0. 
Substituting in Eq. (85), we get w*p = cı1q?. The velocity of a longitudinal wave in 
the [100] direction is thus: v,[100] = w/q = (alps 

To obtain the speed of a transverse wave, we can try either uy = 
uo expli(q.x — œt)]; ux = 0; uz = 0 or uz = uo expli(qxx — wt)|; ux = 0; uy = 0. 
On substitution in Eq. (85) we get the same velocity of a transverse wave in the 
[100] direction, vr[100] = (cu/p)”. 

Thus, for a wave propagating along the x axis, we have a longitudinal wave with 
particle displacement along the x axis, and two identical but independent trans- 
verse waves with particle displacements along y and z. 

For a wave along [110], qx = qy = q/V2, a little bit of algebra is required to de- 
termine the different velocities. The longitudinal wave with atomic displacement 
along [110] has the speed v;,[110] = [(c11 + c12 + 2ca4)/2p|/? and there are two 
non-degenerate transverse waves with the speeds vr,[110] = (c44/p)'/?; vr,[110] 
= |(c — C12) /2p] 2 which correspond to atomic displacements along [001] and 
[110] respectively. The wave propagation in the [110] direction is of special inter- 
est because the three cubic elastic constants, c11, cı2, and c44 can be deduced from 
the propagation velocities in this direction: c44, for example, can be obtained from 
the slope at the origin of the Brillouin zone of the phonon dispersion curves in 
the [110] direction or it can be measured by ultrasonic resonance spectrometry. 

For a wave along [111], qx = qy = qz = q/V3, the longitudinal wave has the 
speed v,[111] = w/q = [(c11 + 2c12 + 4ca4) /3p]"”. The two transverse waves have 
the identical speed vr, [111] = vr,[111] = [(c11 — c12 + C44) /3p]'/” and correspond 
to displacements along [110] and [112]. 

In directions other than [100], [110], and [111], the waves in a cubic crystal can- 
not be interpreted as purely longitudinal or purely transverse unless the material 
is isotropic. 
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4.3.1.8 Vibrational Entropy 

From a structural point of view, lattice vibrations or phonons can be viewed as 
defects inducing a disorder in a perfect crystal. This disorder can be quantified 
through the vibrational entropy, which plays an important role in phase stability. 
To understand the effect of lattice vibrations on phase stability, it is useful to write 
the free energy of a phase F=% — TS as the sum of its configurational and vi- 
brational parts: 





F =U conf H M vib T (Sconf H Spip) (86) 


Doing this, we neglect any other contribution, such as magnetic and electronic 
entropies. From statistical mechanics, we know that the vibrational entropy Svip 
is the negative temperature derivative of the vibrational free energy (Dalvit et al. 
1999) Sy; = —OF,iy/OT where Fpp = —kgT In & is the free energy of the system 
and & the partition function, given in the case of a phonon gas by Eq. (75). The 
vibrational entropy per atom in the harmonic approximation reads 


Svib = — 3kg [solano In(fijq(@)) — (1 + Ajalo)) In(1 + Hjq(@))] do (87) 


where fijq(@) = [exp(Bhaj(q)) — 1] "|. Basically, the vibrational contribution Fpp = 
U ib — TSyiy has two distinct effects: change in the phase stability of an alloy or 
ordered compound in a given configuration with respect to the pure solid constit- 
uents (formation energy), and change in the stability of ordered compound with 
respect to the random alloy at the same composition (ordering energy). The first 
effect will affect the free energy of formation and may lead to changes in the 
structure. Previous studies have shown that in phase-separating alloys such as 
GaP-InP (Silverman et al. 1995), Cu-Ag (Sanchez et al. 1991; Asta and Foiles 
1996), or Ni-Cr (Craivich and Sanchez 1997), one often finds a vibrational stabili- 
zation of compounds, leading to increased solubilities and lowered miscibility 
gap temperatures. The second effect will affect the order—disorder transition tem- 
perature. One can obtain an expression that relates the transition temperature 
with configurational entropy only (TË nf *) to the transition temperature with 
configurational and vibrational entropy lane a) by writing the free energy for 
phases pı and pz, with and without vibrations, and solving the two sets of equa- 
tions. This final expression, which does not include thermal expansion effects, is 
(Ozolins et al. 1998): 


ASP” =. 

ray = Tag” (14 iEn) (88) 
conj 

As expected, AS??? / A * determines the effect of lattice vibrations on calcu- 

lated phase diagrams. The configurational entropy per atom for a binary solid so- 

lution depends on the state of short-range order. Its maximum value for an ideal 

(random) solution is 
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Sconf = kale In(c) + (1 — c) In(1 — c)] (89) 


The maximum value of ASon¢ is 0.693kg/atom: this value occurs when a fully or- 
dered state (Sco, = 0) at c = 0.5 transforms into a fully random solid solution. In 
most cases, AS?" -” will be smaller, owing to short-range order in the disordered 
state and also to some disorder in the low-temperature phase. Thus, even a rela- 
tively small AS}, > can lower Thnf hip by a significant amount. Indeed, experi- 
mental values of AS?” = 0.14kp/atom for Cu3Au have led to expectation of a 
large lowering of the order—disorder temperature transition. However, if there is 
a large contribution of anharmonicity involved in AS?) ”, then the vibrational 
entropy increases rapidly with temperature and the effect of vibration will be 
smaller. Equation (88) is then incorrect. 

One typical example of the major role played by the vibrational entropy is 
the stability of the -phase of many metals and intermetallic compounds at high 
temperature. Because metallic bonding is essentially nondirectional, the greatest 
stability is expected for those crystallographic arrangements that maximize 
the coordination number of each atom (Zangwill and Bruinsma 1987). It may be 
concluded therefore that the bcc phase is energetically unstable in comparison to 
close-packed structures. However, the large entropy of the bcc phase results in a 
lower free energy, which is responsible for the stability of this phase. In the late 
1940s, Zener (1947) predicted that the large entropy of the #-phase has a mostly 
vibrational origin. Forty years later, phonon dispersions of group III and IV met- 
als have been investigated by Petry et al. by growing single crystals in situ at high 
temperature in the neutron scattering furnace (Flottmann et al. 1987). The dis- 
persion curves are characterized by low-energy phonons in the [112] direction 
(Petry et al. 1991a,b; Trampenau et al. 1991; Güthoff et al. 1993). This 
makes them different from group V and VI metals, where low-energy phonons 
are absent and where the bcc phase stability has an electronic origin. In terms of 
free energy, it is mainly the increased entropy due to these low-energy phonon 
modes which stabilizes the bcc structure of group IV metals. The anomalous de- 
crease in entropy in the bcc phase is caused by the increase in frequency of the 
low-energy phonons with increasing temperature. We note that this behavior is 
anomalous because with increasing temperature metals usually become softer 
and their phonon frequencies decrease. Phonon softening with decreasing temper- 
ature is a common behavior of many bcc materials and reflects their ability to un- 
dergo a phonon-triggered martensitic transformation from the high-temperature 
bcc phase to a close-packed structure at low temperature. 

The link between the phonon softening and the martensitic transition will 
be discussed in Section 4.4.3. The martensitic transition takes place when the 
entropic contribution to the free energy is compensated by the energy difference 
between open and close-packed structures. For instance, the gains in vibrational 
entropy AS,ji, at the transition for Ti and Zr are found to be equal to 0.29 and 
0.26kg/atom, respectively. For comparison, the total change in entropy is AS = 
0.42 and 0.40kg/atom, i.e., the electronic contribution can be estimated to be 
ASa = 0.13 and 0.14ks/atom, respectively. This means that for Ti and Zr the lat- 
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tice vibrational entropy accounts for 65-70% of the excess entropy at the marten- 
sitic transition. 

The same tendency has been observed in the Cu-based martensitic alloys. 
Assuming that the phonon modes in the close-packed phase can be described 
within the Debye theory, Romero and Pelegrina (1994) have calculated the 
AS,;p = 0.15kg/atom for the Cu-based martensitic alloys. Owing to the small 
value of the electronic contribution (L. Mafosa et al. 1993a), to a very good approx- 
imation, this excess of entropy of the £-phase, responsible of its high-temperature 
stability, has a vibrational origin that arises from the low-energy TA2[110] pho- 
nons and is practically independent of composition. Therefore, the martensitic 
transition in Cu-based alloys can be considered as a transition driven purely by 
vibrational entropy. 


4.4 
Beyond the Harmonic Approximation 


Up to now, we have treated the lattice vibrations in the harmonic approximation, 
which gives a satisfactory description of vibration properties in solids at low tem- 
perature when the vibrational amplitudes are small in comparison to the nearest- 
neighbor atomic distance. The harmonic approximation gives the simple picture 
of non-interacting phonons but has, however, a number of consequences. Being 
eigenstates of the harmonic Hamiltonian, the phonons have infinite lifetime, 
their energy linewidths are zero, and their mean free paths are infinite. In reality, 
lattice forces are anharmonic for all crystals at finite temperature, a fact which 
corresponds to the experimental observations of thermal expansion and finite 
values of the thermal conductivity due to multiphonon scattering. To correct for 
these shortcomings, anharmonicity has to be taken into account. However, the 
well-resolved experimental techniques used for lattice vibration investigations, 
such as neutron scattering, indicate that one-phonon scattering processes are 
clearly dominant. This suggests that anharmonicity, expressed by the third and 
higher-order terms in the expansion (33), can be viewed as a perturbation of the 
non-interacting phonon states of a crystal. 

The third term %; in the expansion (33) can be transformed using Eq. (55) into 
Eq. (90), 


os il 
W= = aa 2, bc, arate I Bap q.4',4")X2(q)Xp(q')X, (4) (90) 


where 


bop(4.4',4") = XC bap,(0,h',h") exp(iq’ tn’) exp(iq” tn”) (91) 
h', h" 
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and G is a reciprocal lattice vector. The anharmonic term %3 involves three pho- 
non wave vectors, which, due to the translational invariance, have to satisfy the 
conservation of momentum: 


G=q+q +q" (92) 


The first anharmonic term introduces a coupling process between three phonons. 
Processes with G = 0 are called “normal” processes and with G # 0 “umklapp” 
processes. Similarly, the fourth term, %4, causes in first-order perturbation four- 
phonon interactions, and so on. For many purposes, the temperature-dependent 
properties of anharmonic crystals can be studied using the quasi-harmonic ap- 
proximation, in which the equilibrium positions are taken as parameters which 
are determined by the minimum of the temperature-dependent free energy. The 
expansion is then, as in the harmonic approximation, limited to the quadratic 
terms in the displacement from these minimum positions. The free energy takes 
the same form as in the harmonic approximation, i.e., the sum of potential en- 
ergy and vibrational free energy. 


4.4.1 
Thermal Expansion 


In the previous section, we discussed the effect of anharmonicity in terms of 
phonon-phonon coupling. In this section we show that anharmonicity has 
another effect too: it gives rise to the thermal expansion of the crystal. 

The thermal expansion «p = (1/Vo)(6V/0T), is due to the anharmonicity of 
the total energy: the curve of Fig. 4.1(b) is slightly flatter toward the high values 
of a than at the small values of a. For a given energy increase kgT, the average 
lattice parameter will therefore be larger than ro. In the simple model of Eq. (14), 


SS U(0)/kpT) do 
TE exp(-%(ö)/ksT) dò 





<> = 


is calculated by expanding exp((6?/kgT) ~ 1+ (6?/kgT for small 6; thus we get 
the thermal expansion coefficient «p = (9¢/4é)kg (Kittel 1976; Philibert et al. 
1998). In pure Pt, we can compare this with the tabulated value: «p = 8.90 x 
10° K-1. From the adjustment of the curve (a) in Fig. 4.1(b), we get 
7.50 x 106 K-t. 

The thermal expansion can also be related to the specific heat via the bulk mod- 
ulus. The pressure of the system, expressed in the frame of the canonical descrip- 
tion in statistical mechanics, is P = —-(07/0V)|- where F = —kgT In & is the 
free energy of the system and £ is the partition function. Thus the vibrational 
free energy is 


= =kT) In| 1 — exp(—fha;(q))] + 2 ħoj(q) (93) 


jaq 
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The derivative of 7 with respect to V gives 
p- [FA _ 3 °F ba,(q) 
OV) |r daj(q) eV 


ja 
From Eq. (94), we see that the pressure and the volume of the system, depend on 
T only if the term (öo;(q)/öV) is not equal to zero. Thus the harmonic approxi- 
mation, in which the frequencies w;(q) are volume-independent, cannot account 
for the thermal expansion. 
Inserting Eq. (93) in Eq. (94) we get 








(94) 
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where ñiq is the equilibrium phonon distribution function. The Griineisen pa- 
rameter y;(q) for the mode (j,q) characterizes the variation of the normal mode 
frequencies «;(q) with the volume V of the crystal. To obtain the Griineisen pa- 
rameters of the mode, measurements or calculations of the phonon dispersion at 
different temperatures are needed. Usually neither is available. Therefore, aver- 
aged Grtineisen parameters are often used: 


V 0@p un V 00D = ô(ln Op) 
op ôV p ôV alln V) 





(96) 


The experimental values of y typically vary between 1 and 2 for most solids. 

The thermal expansion coefficient, defined by « = (1/V)(ðV/ðT)p, is propor- 
tional to the specific heat and is given by (Ashcroft and Mermin 1976) «= 
yCy/K, where K is the bulk modulus defined by K = —V(éP/@V),. Griineisen 
considered y as independent of temperature; the thermal expansion parameter 
thus vanishes as T? when T — 0. This normal behavior is violated in a number 
of substances where, at increasing temperature, the lattice constant decreases at 
low temperatures and increases at high temperatures. Such a behavior can occur 
if some groups of phonons have opposite temperature dependences. This has 
been observed even in simple structures, such in the bcc phase of group IV 
metals (f-Zr, ß-Ti). In these systems, the majority of the phonons soften with in- 
creasing temperature, as expected for a normal metal. The phonons in a certain 
region of the Brillouin zone, however, show the opposite behavior: they harden 
considerably. This inverse temperature behavior is related to the martensitic tran- 
sition which, in this case, is a phonon-driven first-order displacive transition and 
will be discussed in detail in Section 4.4.3. 


4.4.2 
Thermal Conductivity 


If a temperature gradient is imposed across a solid, the elementary excitations 
such as free electrons and phonons will acquire more energy in the hottest part 
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than in the coldest part. This will induce a spontaneous flow of energy (heat) 
from the hottest to the coldest part of the specimen. In metals both electrons 
and phonons contribute to the thermal conductivity. In insulators, most of the 
heat is conducted by the phonons. If VT is the applied temperature gradient, in a 
steady state the rate of heat flow, per unit area normal to the gradient, is given by 


9=—-AVT (97) 


where % is the thermal conductivity. The form of Eq. (97) implies that the ther- 
mal energy transfer is a random process. The energy, carried by the phonons, 
does not proceed directly in a straight path from the hottest spot to the coldest 
spot, but diffuses through the specimen. If the energy were propagating directly, 
the flux would not depend on the gradient but on the difference in temperature 
between the hottest and the coldest spots of the specimen. We have seen in the 
previous section that anharmonicity induces a phonon-phonon coupling process 
which reduces the phonon lifetime. Besides, real crystals are of finite size and 
contain defects and impurities. This leads to phonon scattering mechanisms 
that can be taken into account by introducing a mean free path into the expres- 
sion of the thermal conductivity. If we assume that the gradient is applied along 
the x axis only, we get 2, = —# (dT /dx). 

We assume a phonon gas is confined in a box in which the particles are free to 
move. If v, is the x component of the particle velocity, the flux of particles in the 
x direction is $1 |vx|>, where n is the particle density and the brackets < > stand 
for the average value. At equilibrium, without a T gradient, there is a flux of equal 
magnitude in the opposite direction. We suppose now that heat is provided at one 
end of the box and we consider two isothermal cross-sections at T and T + AT. 
Because of the gradient in temperature now established, the particles will move 
from the region at temperature T + AT toward the region at T. If we assume 
that the particle is scattered in the two regions T + AT and T, and is free between 
them, then 


dT, dT 
AT= = tt (98) 


where r is the average time between the two scattering processes. While moving, 
the particle will give up the energy cAT where c is the particle heat capacity. Thus 
the flux balance between the two particle fluxes is 


dT 1 dT 1 dT 
9 = „2 = „2 — SS — 
2y n<ve>ct Tx 30 Ynct T 3 Cul Tx (99) 





with | = vt being the mean free path and C = nc. Thus X = { Cvl. The specific 
heat and the collision rate t™!, related to the phonon mean free path by 
v= It, are the main quantites which determine the temperature dependence 
of the thermal conductivity. Let us discuss qualitatively the temperature depen- 
dence of X. At high temperature, the Bose-Einstein distribution reduces to an 
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expression proportional to the temperature; thus, the total number of phonons pre- 
sent in the system is proportional to T. Since a given phonon is more likely to 
be scattered if there are other phonons present to reduce the mean free path by 
scattering processes, we expect that the collision rate r~! increases when increas- 
ing temperature. Furthermore, since at high temperature the phonon specific 
heat obeys the Dulong and Petit law and therefore is temperature-independent, 
we expect the thermal conductivity to decrease with increasing temperature. 
This statement is confirmed by many experimental results. Generally, the de- 
crease of X% at high temperature obeys a power law X oc T7" where r is between 
1 and 2. 

At low temperature, the average number of phonons with energy œ, according 
to the Bose-Einstein distribution, is given by exp(—fha) [see Eq. (77)]. In this 
temperature regime, the fraction of phonons with energies comparable to or 
smaller than kgT will be dominant. In particular, when T « 0p, the phonons will 
have œj(q) <@p and k «kp, where kp is the Debye wave vector defined by 
kp = wp/v. In the case of three-phonon scattering, since the total energy is con- 
served during the scattering process, the energy of the emerging phonon will also 
be small compared to hop. This requires that the wave vector of each phonon, 
and hence the sum of the wave vectors, are small compared to kp. Thus the addi- 
tive reciprocal lattice vector G appearing in the crystal conservation of momen- 
tum law [Eq. (92)] is zero. At very low temperature, the only collisions occurring 
with an appreciable probability are those conserving the total momentum transfer 
exactly, the so-called “normal” processes. The freezing out of the umklapp pro- 
cesses is of crucial significance for the low-temperature thermal conductivity. 
Normal processes alone cannot bring the system to full thermodynamic equilib- 
rium because they leave the heat energy flow 2 unchanged. If we start with a dis- 
tribution of hot phonons down a rod with 2 # 0, the distribution will propagate 
down the rod with 2 unchanged. Therefore there is no thermal resistance and the 
thermal conductivity will be infinite. Crystals have finite thermal conductivity at 
low temperature because there will still be some small probability of crystal- 
momentum-destroying umklapp processes that degrade the thermal conductivity. 
This requires the presence of phonons with wave vectors comparable to kp. The 
mean number of such phonons is given by the factor exp(—0p/T) where 0p is the 
Debye temperature. Thus, as the temperature drops, the number of phonons that 
can participate in umklapp processes drops exponentially, and the collision rate is 
expected to vary as exp(—To/T) where To is a temperature of the order of 0p. At 
very low temperature, however, we expect the thermal conductivity to be limited 
by temperature-independent scattering processes determined by the geometry 
and purity of the sample. 


4.4.3 
Soft Phonon Modes and Structural Phase Transition 


Many metallic and intermetallic compounds undergo a martensitic transition, 
which is a first-order, diffusionless solid—solid transformation from a symmetric 
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high-temperature phase called austenite to the low-symmetry low-temperature 
phase called martensite. Some of them have the ability to recover large stress- 
induced deformations by heating the alloy through the martensitic transition 
temperature. They are known as shape-memory materials (Funakubo 1987). This 
property makes them potential candidates for use as actuators, clamps, and sen- 
sors. Recent studies even suggest that the characteristic distortions of such mate- 
rials can be exploited to create micrometric machines (Bhattacharya and James 
2005). It is now well known that these displacive transitions are phonon-triggered 
and can be explained by the soft-phonon modes. The suggestion that certain 
kinds of solid-solid phase transitions might be triggered by phonon instabilities 
was first made by Anderson (1960) and Cochran (1964); since then many experi- 
mental studies on martensitic materials have been made. 

Usually, the frequencies of normal modes of vibration are expected to decrease 
for increasing temperature due to anharmonic interactions at high temperature, 
but in martensitic materials, such as in group III and IV metals, some phonon 
modes show the opposite behavior (Petry et al. 1991a,b; Trampenau et al. 1991). Pho- 
non dispersions of the bcc phase of group III and group IV metals have been in- 
vestigated using one high-temperature furnace for both growth of single crystals 
and neutron scattering measurement (Flottmann et al. 1987). The dispersions are 
dominated by (a) a valley of low-energy phonons with transverse polarization and 
propagation along [112] and [110] directions and (b) a strong dip in the longitudi- 
nal branch in the [111] direction for a value of two-thirds of the reduced wave vec- 
tor. The atomic motions corresponding to this mode can be viewed as bringing 
together two neighboring (111) planes, while the third stays at rest. This atomic 
motion gives rise, for a specific wave vector, to the f — œ phase transition (Fig. 
4.11). Furthermore, the phonons along these branches are strongly damped and 
show at large q a typical lifetime as short as one vibrational period. This feature 
seems to be common to other bec-based materials and is due to their natural in- 
stability toward the formation of close-packed structures. 

In the case of intermetallic compounds, Ni-based (Predel 1991; Saburi 1998) 
and Cu-based (Mafosa et al. 1993a,b, 1999; Nicolaus, 2000) materials are among 
the most investigated ones. Shape-memory alloys have an open bcc structure, 
denoted as the f# phase, only stable at high temperatures. For kinetic reasons, 
the £ phase can be retained as a metastable phase below its stability region by 
means of fast enough cooling. In this process, the bcc structure is configuration- 
ally ordered in the B2, L21, or DO; superstructures. On further cooling, the mar- 
tensitic transition takes place at a temperature that depends strongly on composi- 
tion. Because this transition is diffusionless, martensite inherits the ordered 
arrangement of the high-temperature f phase. Martensite is a close-packed phase 
that can be described by application of a combination of a shear mechanism and 
a shuffle to the high-temperature bcc phase (Burgers 1934; Kajiwara 1986) to- 
gether with a superimposed static modulation (or shuffle) which corresponds to 
a phonon mode on the TA,[110] with a specific wave vector. The low-energy 
TA2[110] phonon branch shows a hollow in these alloys. For example, lattice dy- 
namics of the Heusler alloy NizMnGa has been investigated by inelastic neutron 
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Fig. 4.11 (a) J — œ transformation scheme. The w phase has a 
hexagonal structure with three atoms per unit cell with a,, = apv/2 and 
Cu = ag\/3/2, where az is the lattice parameter of the bcc Bravais 
lattice. The c and a axes of the w-phase point along the [111] and [110] 
directions of the bcc lattice respectively. (b) Phonon dispersion in the 
[110] direction of L2; NizMnGa at different temperatures showing the 
wiggle around č = 0.33 (Zheludev et al. 1995). 


scattering (Zheludev et al. 1995). A wiggle in the TA,[€é0] branch deepens with 
decreasing temperature and results in a distinct minimum at čo = 0.33 below 
300 K (Fig. 4.11b). 

Furthermore, a diffuse elastic scattering develops at čo = 0.33 when the mar- 
tensitic transition temperature is approached, and is accompanied by an elastic 
peak at the same position in the q space. Such a behavior, which has been found 
in other Ni-based alloys such as NiAl (Shapiro et al. 1991) and NiTi (Mercier et al. 
1980), is known as a precursor effect of the martensitic transition. From first- 
principle calculations (Zhao and Harmon 1992), the elastic peak has been attrib- 
uted to a strong electron—phonon coupling and specific nesting properties of the 
Fermi surface, whereas Halperin and Varma (1976) considered it as an extrinsic 
property attributed to crystal imperfections. Although premonitory effects in the 
form of anomalously soft phonons are frequently observed, the soft-phonon fre- 
quencies may remain finite at the temperature where the austenitic phase be- 
comes unstable (incomplete phonon softening). The crystal surface can then play 
an important role. Landmesser et al. (2003) have studied such phase transi- 
tions between two and three dimensions by X-ray scattering under grazing an- 
gles in NiyMnGa. They have shown that a free surface should be considered as a 
defect for nucleation if the soft bulk phonon couples to the surface. The resulting 
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surface phonon promotes the phase transformation by lowering the nucleation 
barrier at the surface. 


4.5 
Experimental Investigation of the Normal Modes of Vibration 


4.5.1 
Raman Spectroscopy 


Raman scattering is a powerful light scattering technique used to diagnose the 
internal structure of molecules and crystals. In a light scattering experiment, light 
of a known frequency and polarization is scattered by a sample and analyzed in 
frequency and polarization. Raman scattered light is frequency-shifted with re- 
spect to the excitation frequency due to atomic or molecular vibrations, but the 
magnitude of the shift is independent of the excitation frequency. This “Raman 
shift” is an intrinsic property of the sample. Additional information, related to 
the spatial form of the excitation, derives from the polarization dependence of 
the Raman scattered light. The shape of an excitation in a material, for example 
a vibration pattern of the atoms in a crystal, and the polarization dependence 
of the scattering are determined by the equilibrium structure of the material 
through the rules of group theory. By this route one gets valuable and unambig- 
uous structural information from the Raman polarization dependence. 

The conservation laws for energy and crystal momentum transfers during such 
processes require 


inky = hnk; + hq + hG (100) 


where G is a reciprocal lattice vector. k;, kr, and w;, wp are the wave vectors and 
frequencies of the incident and the scattered photons respectively. Note that the 
photon wave vector inside the crystal will differ from its free space value due to 
the index of refraction n of the crystal. The upper (plus) sign in Eqs. (100) refers 
to the scattering process in which a phonon is created (the Stokes component of 
the scattered light) and the lower (minus) sign refers to the process in which a 
phonon is annihilated (known as the anti-Stokes component). If photons of visi- 
ble light are used, their wave vectors are small compared to the Brillouin zone 
dimensions, and the excited phonons lie only near the immediate neighborhood 
of the center of the Brillouin zone (q = 0). In this case, the crystal momentum 
conservation law (100) is obeyed only if the reciprocal lattice vector G is zero. To 
a good approximation, Raman scattering occurs from zero wave vector phonons. 
However, to the extent that the phonon wave vector differs from zero, phonon se- 
lection rules will deviate from the zero wave vector rules and will depend on the 
angle between the directions of propagation of the incident and scattered light. If 
optical phonons, which have no dispersion at the zone center, are excited, the 
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direction dependence of the Raman shift is quite small. On the other hand, for 
the acoustic phonons, which have a linear dispersion near the zone center, the 
angular dependence of the Raman shift is more pronounced. Because the pho- 
non wave vector is small, the acoustic phonon will have a small energy. Raman 
scattering from low-energy acoustic phonons is known as Brillouin scattering. 
The essential difference between Raman and Brillouin scattering is the sensitivity 
of the “Brillouin shift” to the relative angle of scattering. 

The frequency spectrum of the Raman scattered light maps out only a part of 
the excitation spectrum. Other spectroscopic techniques, such as infrared absorp- 
tion, are used to map out the non-“Raman-active” excitations. As will be seen in 
the next section, Raman spectroscopy is a less powerful probe of the phonon dis- 
persion spectra than neutron scattering. The great virtue of neutrons is that the 
dispersion is accessible over the whole Brillouin zone and once the scattered en- 
ergies have been resolved, the highly informative one-phonon scattering pro- 
cesses are clearly identifiable. 


4.5.2 
Inelastic Neutron Scattering 


The energy of thermal neutrons is in the same order of magnitude as that of the 
dynamical excitations in solids. So when the neutron is inelastically scattered 
by the creation or the annihilation of a phonon, the change in the energy of the 
neutron is a large part of its initial energy. Measurement of neutron energy thus 
provides accurate information on energies of phonons and hence on interatomic 
forces. Thermal inelastic neutron scattering is one of the major tools to measure 
the phonon dispersion relationship, i.e., the frequency œj as a function of q and 
polarization index j. 

Consider a neutron, of momentum k; and energy E; = hk? /2m, that is inci- 
dent upon a crystal. After the interaction, the neutron emerges with momentum 
ky and energy Ey = hk? /2m. During the neutron-crystal interaction, a dynami- 
cal excitation of wave vector q and energy hoj(q) is created or annihilated. The 
conservation laws of energy and momentum imply that the scattering process, 
in the case of phonon annihilation, obeys the conditions Er — E; = haj(q) and 


where G is a reciprocal lattice vector. These conditions are so restrictive that 
for given scattering angles only a few phonons of particular q and «;(q) can be 
involved in the scattering process. We can make use of this to determine the pho- 
non dispersion «;(q). Generally, a monochromatic beam arrives on the crystal and 
the energy of the neutrons scattered through a given angle is measured by using 
a time-of-flight apparatus or another crystal as analyzer. When a phonon is de- 
tected, the scattered energy gives access to œj(q) and Eq. (101) to the change 
in wave vector. As q has to lie on the first Brillouin zone, both q and G are ob- 
tained. One point on the phonon dispersion curve is determined. 
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In an inelastic neutron scattering experiment, the measured quantity is the 
partial differential cross-section which gives the fraction of neutrons of incident 
energy E scattered into an element of solid angle dQ with an energy between E 
and E + dE: 








do _ a’, | 
(a T) (4,2) 200 |k;| xa (a) eres) 2) 
where (Squires 1978; Lovesey 1984) 
A- = (ñj + 1 — alq))LQ-q- G), (103) 
A, = Ajlo + «j(q))0(Q +q- G) (104) 


and vo is the volume of the unit cell, Q = k; — ky the scattering vector, and jg is 
the Bose-Einstein distribution. F;(Q,q) is the dynamical structure factor, given 


by: 


m (- WLQ?) exp(iQ.r.) x (Qex(j,4)) (105) 





by is the coherent scattering length of the kth atom in the unit cell. W = 
<u(k)*>/2 is the mean square displacement of atom k. The delta functions in 
Eqs. (103) and (104) express the conservation laws. Equation (104) describes a 
scattering process in which a phonon is created. The neutron energy decreases 
by an amount equal to the energy of a phonon of type j with a wave vector q. Sim- 
ilarly Eq. (104) describes a phonon annihilation process where the energy of the 
neutron is increased. 

The intensity of the phonon observation by neutron scattering is determined by 
different factors. The term 1/œ;(q) in Eq. (102) indicates, independently of all 
other terms, that the intensity is inversely proportional to the frequency of the 
mode. This attenuation of the intensity results from the quantum mechanics of 
the harmonic oscillator. High-energy modes are thus always more difficult to ob- 
serve than the low-energy ones. Since neutron experiments suffer from the low 
flux of the existing sources, this effect frequently prohibits the study of the high- 
frequency part of the phonon dispersion curves. The term ñ;q results from the 
Bose-Einstein statistics of a given mode with frequency œj(q); the occupation 
number is given by Eq. (77) and tends toward zero for T — 0. For T — œ, itjq 
approaches hoj(q)/ksT, i.e., the classical relation. Equation (103) indicates that 
the Bose-Einstein function is increased by +1 in the case of a phonon creation. 
At low temperature, where ñ;q is close to zero, the phonon can be observed only 
in the creation mode, the cross-section for the annihilation process becomes van- 
ishing since the phonon states are no longer occupied. At finite temperature, the 
Bose-Einstein distribution further simplifies the observation of phonons with low 
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frequencies. One more factor in expression (105) is the scalar product Q.ex(j,q), 
which means that by measuring the intensity of the scattered signal, it is possi- 
ble, in principle, to deduce the polarization vectors e,(j,q). In general the polar- 
ization vectors corresponding to a given wave vector q are not related in a simple 
way to the direction of q. But in certain cases there is a simple relationship. For 
example, if q lies in the (001) plane of a cubic crystal, one of the ex(j,q) is along 
the [001] axis. If the scattering vector Q is arranged so that Q is in the (001) 
plane, Q.e;(j,q) is zero for this mode. This can be used to extinguish one polar- 
ization branch. Inelastic neutron scattering experiments are usually performed 
using either a time-of-flight setup or a three-axis spectrometer. In the time-of- 
flight setup, the energy transfer of the inelastically scattered neutrons is deduced 
from the time the neutrons spend in flight over the chopper-sample-detector 
distance and compared to that of the elastically scattered neutrons. The shift in 
time-of-flight can then be converted into a gain or a loss in energy. The time-of- 
flight method is a useful technique for determining the phonon density of states 
by measuring the incoherent one-phonon scattering as a function of energy. A ba- 
sic difficulty of the method is that incoherent scattering from multiphonon pro- 
cesses also occurs. Both one-phonon and multiphonon processes give incoherent 
scattering, and it is not even easy to estimate the contribution of the latter even if 
the use of an appropriate sample environment, such as hollow cylinders, may re- 
duce it. Furthermore, the time-of-flight method to measure phonon frequencies 
suffers from the disadvantage that the q value of the phonon cannot be preselected. 
The use of three-axis spectrometers, developed originally by Brockhouse (1960), 
overcomes this disadvantage. On a three-axis spectrometer, the energy and wave 
vector of the incident neutron are selected from the white beam by a first Bragg 
reflection on a monochromator (first axis, angle 0m; see Fig. 4.12). They will then 
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Fig. 4.12 Three-axis spectrometer scheme with the different angles. 
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Fig. 4.13 Measurement of a phonon (q, œ) in a q-constant mode. The 
ellipse shows the resolution function of the spectrometer. 


interact with the sample and will be scattered in a direction (second axis, angle y, 
defining the wave vector direction of the scattered neutrons) along which a sec- 
ond crystal, called the analyzer, is placed to select the energy by another Bragg re- 
flection (third axis, angle 04). Finally the neutrons are detected by a detector. The 
momentum transfer is chosen by rotating the sample and the analyzer to get the 
good exit wave vector direction and length. One point in the (q, œ) space corre- 
sponds to one configuration of the spectrometer, i.e., the angles Oy, 04, and y of 
the three axes. The idea is to scan the (q, œ) space using different configurations 
of the spectrometer; when a given point (q, œj) verifies the dispersion relation- 
ship œ = w;(q) of the crystal, the number of counted neutrons will substantially 
increase. 

Either q-constant or w-constant scans are made depending on the slope of the 
dispersion curve at the measured point (Fig. 4.13). Strictly speaking, because of 
the mosaicity spreads of the sample itself, of the monochromator, and of the ana- 
lyzer, the scattering will not occur at a point but rather on a volume centered on 
(q, aj). If a neutron is detected, it has a probability R(dq, dw) to have in reality a 
transfer (q + dq, a; + dw). R(dq, dw) is called the resolution function of the spec- 
trometer. The measured signal is the convolution of the inelastic scattering cross- 
section by the resolution function of the spectrometer. 


4.6 
Phonon Spectra and Migration Energy 


Of the two parameters that govern diffusion and ordering kinetics, i.e., Er the 
vacancy formation energy and Ey the vacancy migration energy, Ey is generally 
less known and more difficult to measure for ordered intermetallic compounds. 
In pure metals and random alloys, it can be deduced, for example, from stage 
II of resistivity recovery during annealing after low-temperature irradiation 
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(Schultz 1991), or from a thorough analysis of residual resistometry along isother- 
mal and isochronal annealing series (Balanzat and Hillairet 1981; Schulze and 
Lticke 1972). Those two methods are, however, very sensitive to the microstruc- 
ture of the samples and to any impurities or defects. 

An alternative method of determining Ey is an evaluation from lattice- 
dynamical properties, i.e., elasticity and phonon dispersion. Flynn (1968) pro- 
posed a model for estimating the migration energy in metals of cubic structures 
from elastic constants. Recently, a model relating the migration energy for 
nearest-neighbor jumps in cubic metals to the phonon density of states (DOS) 
has been proposed by Schober et al. (1992). 

Both models assume that the total energy is well approximated by a sinusoidal 
form so that the knowledge of the phonon properties (the bottom of the well) 
gives valuable information on the saddle point (the top of the ridge) of atomic 
jump processes. The validity of this approximation has been verified in various 
fcc metals (Schober et al. 1992) and L1, compounds (Kentzinger and Schober 
2000) by molecular dynamics. In the different phases, there is a geometrical fac- 
tor that makes it possible to take into account the departure from a sinusoidal 
form. This factor is determined by making the calculation in molecular dynamics 
in different systems within the phase concerned. 

Figure 4.14 shows the result of such a calculation in the ordered phase CoPt 
(Montsouka et al. 2006). The well corresponds to a second-order polynomial with 
equation: V = Wo + é'x*, with €’ values of 410, 276, 345, and 281 eV nm? in 
a-d, respectively. We clearly see that the departure from the sinusoidal form is pre- 
sent. All parameters of the sinusoidal function are fixed by the nearest-neighbor 
distance d and the saddle point energy Es: X = Wo + Es/2(1 — cos(zx/d)). In the 
case of a jump of the atoms out of the (001) plane, we have to add a slope due to 
the formation energy of the anti-sites (0.235 eV in both cases here). A comparison 
between different L1o systems will allow generalization to all L1o phases. The 
comparison of the different figures clearly shows that there is a strong anisotropy 
of the vacancy migration in the L19 phase. Surprisingly, the migration within the 
(001) planes is more difficult for stereographic reasons. 

Flynn’s model has been developed for monoelemental cubic lattices and as- 
sumes that the migration energy is dominated by the contribution of acoustical 
modes at the center of the Brillouin zone. Its obvious drawback is that the other 
low-energy phonons are not directly taken into account. The migration energy de- 
rived by Flynn (1968) in the continuum limit reads: 


Exp = cVoö? 
Here, 


15 3 2 1 


2°C Cy Cr in C4 








where c11, cı2, and c44 are the three independent elastic constants of the cubic 
crystal and Vo is the atomic volume. ô is a structure-dependent constant, which 
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Fig. 4.14 Line: variation of the total energy in CoPt when a Pt (a,c) or a 
Co (b,d) atom moves toward a nearest-neighbor vacancy within the 
same (001) plane (a,b) or not (c,d). o: Variation of the total energy in 
CoPt when a Pt (a,c) or a Co (b,d) moves around its equilibrium 
position toward a nearest-neighbor atom within the same (001) plane 
(a,b) or not (c,d) (Montsouka et al. 2006). 


is to be determined by minimizing the deviations between the experimental 
values and E ey in different materials of the structure concerned. Schober et al. 
(1992) have obtained ô? = 0.081 for the fcc structure. Using the elastic constants 
deduced from the phonon dispersion slope at the origin of the Brillouin zone in 
the [110] direction of the fcc disordered FePd, the migration energy in Flynn’s 
model has been found equal to 0.78 eV, a value which compares quite well with 
that obtained by the ultrasonic resonance spectrometry (Ichitsubo 2000) (0.67 eV). 

Based on earlier work of Flynn, Schober et al. (1992) have developed a model 
relating the migration energy to the whole phonon spectrum through the phonon 
DOS. In this model, the migration energy is derived from Eq. (106): 


Ey = a?’09,' where Go = |(g(o)/mw?) do. (106) 
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In Eq. (106), Ey separates into a structural factor (a?«), common to all fcc, and an 
electronic factor (4o), which is the static Green’s function and reflects the elec- 
tronic particularities. a is the lattice parameter and « a geometrical constant; « 
has been determined by computer simulation to be equal to 0.0135 in fcc metals 
(Schober et al. 1992). 

The underlying assumption of the model is that diffusion takes place via 
nearest-neighbor jumps and that the potential has a sinusoidal form along the 
atom trajectory. In fcc metals, it is commonly accepted that self-diffusion is domi- 
nated by the simplest vacancy mechanism, namely $(1,1,0) atomic jumps into 
a nearest-neighbor vacancy (Peterson 1978; Vogl et al. 1989; Petry et al. 1991b). 
In pure fcc metals, excellent agreement between calculated and measured values 
of Em was found, whereas in bcc metals, where the experimental values are less 
well known, predictions were obtained that show a pronounced systematic behav- 
ior with chemical group. Strictly speaking, this method to determine Ey is only 
valid for elemental crystals but as the model was not yet developed for the super- 
structures, this method has been used in Fe,_,Si, (Randl 1994; Randl et al. 1995) 
and Fe,_,Al, alloys (Kentzinger et al. 1996) to estimate the migration enthalpies 
assuming an average mass of the component. Recently, the same model has been 
used to deduce the migration energy in the L1g-FePd (Mehaddene et al. 2004). 
Since the model was not yet extended to Li, phases, as a first approximation, 
the FePd L1o structure was considered as an fcc lattice with an atomic mass equal 
to the averaged mass of the two atoms. The migration energy at room tempera- 
ture in FePd (0.91 eV) is close to the migration barriers obtained by Kentzinger 
and Schober (2000) in FePd; (0.97 eV for a Pd atom migrating within its own lat- 
tice and 1.14 eV when it jumps on the Fe lattice; 0.90 eV for a Fe atom leaving its 
own lattice). With an average atomic mass and the total DOS, a 0.94 eV value was 
obtained, very comparable to the value obtained within the same approximation 
in FePd. Recently, 0.7 eV was obtained as the average migration energy in the or- 
dered paramagnetic phase of FePd, using residual resistivity measurements (Par- 
tyka et al. 2000; Kulovits et al. 2005), in good agreement with the values deduced 
from phonon measurements using neutron inelastic scattering in the same 
chemical and magnetic state of order. 

In a further step, Schober’s model was extended to A;B compounds with L12 
structure using the static lattice Green’s functions of the two constituent species 
(Kentzinger and Schober 2000). The Green’s function matrix elements 9 u 
(a, 8 = x, y or z) of a given atom k can be expressed in terms of the eigenvalues 
œj(q) and eigenvectors (i.e., polarization vectors) ex(j,q) (j = 1,...,3p where p is 
the number of atoms in the unit cell) of the dynamical matrix at wave vector q 
which are known from Born—von Karman (BVK) fits of the measured phonon 
dispersion curves: gk = = Sei) /mo?) do, where m is the mass of atom k 
and gis (@) is the partial density of states: 


85 (0) DL G aef, aalo — @(q)) dq (107) 
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with vp being the volume of the unit cell; the integration extends over the first 
Brillouin zone. 

In the case of elemental fcc or bcc metals, the static lattice Green’s function 
matrix of an atom k(4) is diagonal and reduces to a single number (% ). In 
A3B compounds with L1, structure, the B atoms have cubic point symmetry and 
the Green’s function matrix G®® also reduces to a single number, 92°. The point 
symmetry of the A atoms is tetragonal. For an A atom at position (4,4,0) in the 
unit cell, Y““ is diagonal with GA = Ga # 42A, In this case, starting from 
the fully ordered lattice with one vacancy, three types of nearest-neighbor jumps 
can be considered: the jump of an A atom into an A vacancy (A — Va), the jump 
of an A atom into a B vacancy (A — Vg), and the jump of a B atom into an A 
vacancy (B— Va). 

The Green’s matrix elements of many L1, compounds which have been calcu- 
lated from the phonon spectra, via the densities of states using the force con- 
stants obtained in the BVK fits, are given and discussed in Kentzinger and 
Schober (2000). Here we report on recent calculation of the Green's function ele- 
ments in CoPt; and FePd;. The Green’s functions and the deduced migration en- 
ergies for each kind of atomic jump and at the different temperatures are com- 
piled in Table 4.1. In order to compare the different systems, the L1, phases 
have also been treated within Schober’s model for fcc pure metals using an aver- 
age atomic mass; the corresponding migration energies are denoted Ey; in Table 
4.1. The average migration energies calculated for these systems show a tendency 
to increase with the average atomic mass of the system (Table 4.1). The migration 
energies in the L1, ordered alloys also show, as expected, an increase with the 
size of the atom involved in the jump. Moreover, in each alloy, A — Vg is asso- 
ciated with the highest Ey and B — V, with the lowest (Table 4.1). The migration 
energy is expected to decrease with increasing temperature as a consequence of 
increasing anharmonicity. In fact, Schober’s model assumes that the diffusion 
jump - the most anharmonic conceivable event - follows a trajectory parallel to 
the direction of low-harmonic-restoring forces. In other words, in the directions 
where the harmonic part of the potential is low, we expect the anharmonic part, 
i.e., the migration barrier, to be low as well. Using a Monte Carlo model, based on 


Table 4.1 Static lattice Green functions (in 107? m/N) of CoPtz and 
FePd; in the Ll2 phase and migration enthalpies (in eV) deduced from 
the model developed by Kentzinger and Schober (2000).* 





Alloy T [K] GAA GPA GEB EN Ev Eye ew 
CoPt; 300 0.968 0.741 1.154 1.28 1.47 1.20 1.36 
CoPt; 930 1.048 0.734 1.101 1.35 1.43 1.19 1.33 
FePd; 80 1.264 0.929 1.553 0.97 1.14 0.90 0.94 





* A refers to the majority atoms (Pt or Pd) and B to the minority ones 
(Co or Fe). 


4.7 Outlook 


a nearest-neighbor vacancy jump mechanism and on an Ising Hamiltonian with 
effective pair interaction energies for first- and second-neighbors in L1 and L1o 
(Kerrache et al. 2000) compounds, the contribution of the ordering energy to the 
mean migration energy has been found equal to AE® /Tc = 0.28 meV K~! and 
0.19 meV K`}, respectively. AE? is the difference between the migration energy 
in the ordered and in the disordered phases and Tc is the order-disorder transi- 
tion temperature. The migration energy is larger in the ordered state because, 
at least in some directions, the migration of the vacancies is accompanied by a 
change of chemical long-range order that costs some additional energy. It is 
always the case when the vacancy changes sublattice, contrary to the case where 
the vacancy remains within one sublattice (the sublattice of majority atoms in the 
L1, and both planar sublattices in L1o) (Oramus et al. 2001; Kozubski et al. 2001). 
These values are close to AE#?/Tc = 0.23 meV K~! and 0.18 meV K™!, respec- 
tively, obtained experimentally from the phonon measurements via the Schober 
model considering the total phonon DOS and an average atom of the L1o-FePd 
and the L12-CoPt3. 


4.7 
Outlook 


Elastic and dynamic properties are still studied in many “modern” materials. The 
electron-phonon couplings have appeared as of prime importance in supercon- 
ducting materials, for instance. 

Interesting examples are also the magnetic martensitic materials which exhibit, 
beyond the conventional thermal shape-memory effect, a shape recovery under 
applied magnetic field and large magnetic-field-induced strains. They are known 
as magnetic shape-memory materials. In these magnetic materials, the motion 
of the twin boundaries of the different martensite variants costs less energy than 
the reorientation of magnetic moments within a magnetic domain, inducing a 
change in the shape depending on the direction of the applied magnetic field 
(Suorsa et al. 2004). This suggests the existence of a strong magnetoelastic cou- 
pling in these materials. This coupling has been found to be responsible of the 
appearance of a premartensitic phase in Ni»MnGa (Planes et al. 1997). A better 
understanding of this coupling along with the investigation of other candidate 
systems is one of the challenging tasks in this field for the coming years. Lattice 
dynamics investigations under applied magnetic field and/or mechanical stresses 
on these systems is a powerful technique for investigating the interatomic poten- 
tial and the interplay between the magnetic order and the normal modes of vibra- 
tion; it is a necessary step for any extensive applied research on these systems. 

Along with these developments, a new technique, based on time-resolved pho- 
non measurements, has been developed recently and used successfully for the 
study of the kinetics of unmixing in silver alkali halogenides (Eckold et al. 2004). 
It can be extended to investigate the kinetics of the phonon-driven structural 
phase transitions, offering a unique tool for examining the phonon-time depen- 
dence across the transition. 
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The study of elastic and dynamic properties of materials has moreover encoun- 
tered renewed interest with the development of thin films and self-organized sys- 
tems. As a matter of fact, the relaxation of strains is one of the most important 
phenomena which has to be taken into account to understand the different epi- 
taxial growth modes and the self-organization. 

When the lattice mismatch between a substrate and an epitaxial film cannot be 
neglected, the elastic energy accumulated in both the strained film and the sub- 
strate is one of the driving forces for the film relaxation, in addition to the surface 
energies and the cohesive energies. If the substrate is flat and perfect enough 
(with a single crystal buffer layer, for example), up to a certain thickness disloca- 
tions are not generated as long as the accumulated elastic energy is not greater 
than the nucleation energy of a dislocation (Pan et al. 1995). This activation en- 
ergy is large because a supplementary atomic plane has to be introduced, starting 
from the surface that is the only possible source. In the case of intermetallic or- 
dered films, the antiphase boundary energy is large and prevents the propagation 
of perfect dislocations in ordered alloys. In this case, other processes appear to 
relax the system: stacking faults and microtwins form preferentially to relax a 
strained layer (Halley et al. 2002, 2004), or exchanges between substrate and layer 
atoms can occur (Goyhenex et al. 1999). 

For the semiconductor optoelectronic industry, the optical properties of the sys- 
tems are strongly deteriorated by any defect. Compliant substrates have therefore 
been developed to grow zero defect — and zero strain — layers. These substrates 
are made of two atomically flat GaAs substrates that are twist-bonded so that a 
bicrystal is formed (Patriarche and Le Bourhis 2000; Le Bourhis and Patriarche 
2005). One of the substrates is then thinned down to a nanometer-sized thick- 
ness. When the angle between the respective lattices of the two crystals is equal 
to 45°, the bonding between the two crystals is very soft. An epitaxial layer on the 
substrate will be pseudomorphic: epitaxial dislocations will be located at the grain 
boundary of the bicrystal only. Until now, these types of substrate are developed 
for optoelectronics, but with the increasing miniaturization of the magnetic dots 
necessary for high-density magnetic storage it may be also used in intermetallic 
systems in the future as it will then be necessary to avoid defects. 

A very surprising result of these studies is that the macroscopic laws of elastic- 
ity (as described in this chapter) are valid down to a few atomic distances (typi- 
cally less than 1 nm) of the interfaces or surfaces. For example, a combination of 
TEM and X-ray diffraction allowed Kegel et al. (2001) to determine shape, strain 
fields, and interdiffusion in semiconductor quantum dots grown in the Stranski- 
Krastanov mode. In metallic systems, Muller et al. (2001) have shown that the os- 
cillations of in-plane lattice spacing observed during two-dimensional (2D) homo- 
and heteroepitaxial growth can be interpreted well by using the classical elasticity 
theory and the bulk elastic constants to formulate the problem of the elastic relax- 
ation of a coherent 2D epitaxial deposit. They have predicted a dependence of 
the amplitude of in-plane lattice spacing oscillation on the nucleation density in 
good agreement with experimental observations in several intermetallic systems: 
V/Fe(0 0 1), Mn/Fe(0 0 1), Ni/Fe(0 0 1), Co/Cu(0 0 1), and V/V(0 0 1). 


4.7 Outlook 


Similarly, deformations around a dislocation core (several tens of nanometers) 
are well described by the macroscopic laws of elasticity down to about 3 nm from 
the dislocation line. Until now, these studies have also been made mainly on 
semiconductors, because defects have a direct influence on the band structure 
properties of these systems. Moreover, the high-resolution images necessary to 
compare calculations with experiments are easier in these systems (Snoeck et al. 
1998). Ab initio calculations give a good description of the dislocation core struc- 
ture (Pizzagalli et al. 2003) whereas the method of finite elements yields the long- 
range strains (Kret et al. 1999). New static theories of gradient elasticity, nonlocal 
elasticity, gradient micropolar elasticity, and nonlocal micropolar elasticity have 
been developed to describe the dislocation altogether (Lazar et al. 2005). 

Elasticity measurements have been performed by Faurie et al. (2004) on W and 
Au thin films. They found the Young’s modulus to be slightly smaller in W and 
larger in Au compared to bulk materials, probably due to the many defects 
and particular texture (isotropic in W and columnar texture in Au), whereas the 
Poisson ratios were in both cases very similar to the bulk values. Another impor- 
tant elastic constant in films is the thermodilatation constant, because a large dif- 
ference in the constant of the substrate will induce large strains and defects dur- 
ing temperature changes in operation. 

Recently, elastic strains have been used extensively to grow nanostructured sur- 
faces in a self-organized manner. For example, the surface of new compliant sub- 
strates, with an angle between the respective lattices of the two crystals lower than 
15°, contains a dense network of pure screw dislocations and presents a modula- 
tion of strains on its surface (Patriarche and Le Bourhis 2002). The growth of 
a film on it will favor either the compressive or the extended regions, depending 
on the relative lattice constants of the substrate and the film. A regular mesolat- 
tice (a lattice of dots with a periodicity of several nanometers) will thus be ob- 
tained. 

A similar phenomenon can appear in the case of a reconstructed surface of the 
substrate. For example, the gold (111) surface relaxes by forming a herringbone 
structure with alternating fcc and hcp stacking within the last atomic layers (Pa- 
dovani et al. 1999; Fruchart et al. 1999; Bulou et al. 2004). At the kinks of this 
fishbone pattern appear compressed and dilated ranges where Co or Fe grows 
preferentially. Fruchart et al. have grown some Co columns using the preferential 
growth of Au on unstrained Au and preferential growth of Co on strained Au. The 
strain propagates through only a few atomic layers, but columns several nano- 
meters long can be obtained if Au and Co are deposited alternatively in the right 
proportions. Equivalent phenomena are at the origin of vertical arrangement of 
the quantum dot formation in InAs/GaAs systems (Xie et al. 1995). When some 
InAs dots are grown and covered by a film of GaAs as thick as 40 monolayers, the 
strain propagates up to the surface and favors the nucleation of the next dot on 
top of the previous one. Samples containing such coupled InGaAs/GaAs columns 
of dots are suitable for normal incidence quantum dot infrared photodetectors 
(Adawi et al. 2003). In other systems such as GaN/InGaN, blue high-brightness 
light-emitting diodes can be developed (Lee et al. 2004). 
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The strains and bond changes around the steps on the surfaces are also used to 
self-organize subatomic films. On the surface of a Au single crystal cut along a 
(7 8 8) plane, a new periodicity appears along the [2 1 1] direction: the terrace 
width is 3.9 nm due to the miscut angle. The surface is thus spontaneously pat- 
terned in two dimensions at a nanometer scale with a macroscopic coherence 
length. All the sites around a terrace edge are not equivalent for adatoms. A self- 
organized growth can thus be obtained. In this way, using a Au(7 8 8) vicinal sur- 
face, long-range-ordered cobalt nanodots could be grown on a substrate cooled 
down to 130 K (Repain et al. 2002). 

On the other hand, phonons are studied extensively at present: to understand 
in detail the behavior of superconducting alloys the coupling of phonons with 
electrons and plasmons must be investigated (see Chapter 14b). The surface re- 
construction can also be sensitive to phonons: Meunier et al. (2002) have shown 
that in the Ag/Cu (111) system there are two competing superstructures: the 
Moire and the triangular structures, with a strong effect of temperature. Calculat- 
ing the local phonon density of states by means of the recursion method gave 
evidence for a “strong correlation between local pressure and local vibrational en- 
tropy.” The phase transition from a triangular (10 x 10) structure toward a Moiré 
(9 x 9) structure when the temperature increases is due to “a subtle balance be- 
tween vibrational entropy and anharmonic effects in the internal energy.” 

Phonon excitations within crystals and at their surfaces have also been investi- 
gated by X-ray, neutron, and light diffraction methods, because of possible appli- 
cations of surface acoustic waves as tunable monochromators for X-rays or neu- 
trons (Sauer et al. 1999). 

It can be expected that due to the increasing precision of electronic structure 
calculations, in the future lattice vibrations will be taken into account more and 
more. The development of nanostructures will continue and implies an increas- 
ing importance of the elastic and phonon contributions to the total energy of 
these nanometer systems. 
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5 
Point Defects, Atom Jumps, and Diffusion 


Wolfgang Püschl, Hiroshi Numakura, and Wolfgang Pfeiler 


5.1 
Point Defects 


All crystalline solids contain structural imperfections (cf. Chapter 3), and those of 
atomic scale are called point defects or atomic defects. Defects in solids play es- 
sential roles in many properties of materials, through their characteristic produc- 
tion of local distortion and mobility. The effects are sometimes undesirable but 
very often beneficial, contrary to what the name “defect” suggests: for example, 
the atomic transport (diffusion and ionic conduction) in solids occurs by move- 
ments of point defects, and mechanical strength (hardness, flow stress, and creep 
resistance) is strongly influenced by their presence. In this section, after outlining 
the fundamentals, we discuss the equilibrium properties of point defects in me- 
tallic crystals: pure metals, dilute alloys, and ordered alloys. We aim to form the 
basis for a quantitative understanding of diffusion processes from the micro- 
scopic viewpoint. Since atomic diffusion in crystalline solids is governed by the 
concentrations and the mobilities of mediating defects, detailed knowledge on 
the properties of point defects is essential. The elementary processes and phe- 
nomenological aspects of diffusion are described in Sections 5.2-5.4. 


5.1.1 
A Brief Overview 


5.1.1.1 Types of Point Defects 
Figure 5.1 shows atomic defects commonly found in elemental crystals. Atomic 
vacancies (vacant lattice sites) and self-interstitial atoms are intrinsic defects, and a 
pair of them is called a Frenkel defect. Foreign atoms, i.e., substitutional and inter- 
stitial solute atoms, constitute extrinsic defects. Complex defects may be formed 
by association of these defects, such as pairs of vacancies (divacancies), pairs of a 
vacancy and a solute atom, triplets of vacancies or larger clusters, and so on. 
Intrinsic atomic defects in an ordered AB alloy are illustrated in Fig. 5.2. Atoms 
at regular lattice sites are indicated as A, and Bg, i.e., A atoms on « sublattice sites 


Alloy Physics: A Comprehensive Reference. Edited by Wolfgang Pfeiler 
Copyright © 2007 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim 
ISBN: 978-3-527-31321-1 


173 


wal 5 Point Defects, Atom Jumps, and Diffusion 


FIFU 


Aa 


| 


(\O 0 0.0 
© 
0000000 


\ 


| 


OO00@Oo 


66.6605 
OO 


[C] | 
\ =; 
O 
© 
Do 


OGOOGO 
| O.0 °C 


O 
O 
® 
® 
O 
O 
O 


Fig. 5.1 Atomic defects in an elemental crystal. a: atomic vacancy 
(customarily indicated by a square), b: self-interstitial atom, c: 
substitutional solute atom, d: interstitial solute atom, e: vacancy— 
vacancy pair (divacancy), f: vacancy-interstitial solute atom pair, g: 
vacancy-substitutional solute atom pair. If ‘a’ and ‘b’ are viewed as a 
pair, it is called a Frenkel defect. 
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Fig. 5.2 Atomic defects in an ordered AB alloy, in which A atoms 
(shaded circles) and B atoms (open circles) are arranged on two 
sublattices « and ß. A, and Bg: atoms at regular positions. a: vacancy 
V, (vacant « site), b: vacancy Vg (vacant f site), c: self-interstitial atom 
la, d: self-interstitial atom Ig, e: antisite atom B,, f: antisite atom Ag. 
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and B atoms on f sublattice sites, respectively. Vacancies in each sublattice and 
self-interstitial atoms of each atom species may occur, with the relative ease of for- 
mation depending on the material. The intrinsic defect species characteristic of 
ordered alloys are antisite atoms (also called antistructure atoms or substitutional 
defects), which are atoms that occupy sublattice sites for the other atom species. 
Increase in their numbers brings about disordering of the atomic arrangement, 
leading eventually to disruption of the ordered structure, i.e., an order—disorder 
transition. 

These defects in ordered alloys cannot be present in arbitrary numbers, since 
the numbers of the sublattice sites must be maintained at a ratio specific to the 
structure, such as 1:1 or 3:1. Because of this constraint, vacancies often tend to be 
formed at a fixed proportion, e.g., in pairs in AB alloys, or in a set of four vacan- 
cies in AzB alloys, with three A vacancies, V,, and one B vacancy, Vz. Such a set of 
vacancies is referred to as a Schottky defect. Complexes of these defects are also 
possible. Note, however, that a Schottky defect does not necessarily indicate that 
the vacancies are closely associated in space, and neither does a Frenkel defect; 
they are concepts in statistical thermodynamics. 


5.1.1.2 Formation of Equilibrium and Nonequilibrium Defects 

Point defects exist in thermal equilibrium owing to the significant increase in the 
configurational entropy associated with their presence. This makes a strong con- 
trast to such extended defects as dislocations and grain boundaries, which are es- 
sentially unstable but remain in the crystal because of the lack of sufficient mobil- 
ity. Here we shall briefly outline the formation of point defects and introduce 
fundamental parameters characterizing their theromodynamic properties. 

Figure 5.3 illustrates schematically the formation of a vacancy in an elemental 
crystal: an atom is removed from the interior and then attached to the external 
surface. In terms of the simple pair-interaction model of cohesion, on removal 
of the atom the internal energy E of the crystal is raised by the bond energy times 
the coordination number, and roughly one half of this energy is recovered if the 
atom is then attached at a favorable position, e.g., at a ledge of a surface step. At 
the same time, the volume of the crystal is increased by about one atomic vol- 
ume, increasing slightly the enthalpy of the crystal. On the other hand, the con- 
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Fig. 5.3 Formation of a vacancy in an elemental crystal. An atom in the 
interior (a) is removed and attached to the external surface (b). 
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figurational entropy of the vacancies lowers the Gibbs free energy G = E+ PV — 
TS at any temperature T > 0. The configurational entropy accounts for the 
number of possible ways of placing the vacancies in the crystal, and for a crystal 
consisting of N atoms and n vacancies it is given by 


(N+ n)! 


a 


(1) 


where kg is the Boltzmann constant. Assuming that the vacancy concentration is 
low enough for their interactions to be neglected, the change in the Gibbs energy 
due to the formation of vacancies is expressed as 


AG(N, n) = nAgr — TS. (2) 


where Agr is the change in the free energy per vacancy, excluding the configura- 
tional entropy term. The equilibrium concentration can be derived readily by sub- 
stituting Eq. (1) into Eq. (2) and by minimizing it with respect to n: 


Oo n Agf 


Here, the Stirling approximation In x! ~ x In x — x (x > 1) is used for both N 
and n. The principle is the same for interstitial atoms and other atomic defects, 
and thus their equilibrium concentrations are given by similar formulae. 

The free energy of formation Ag; can be expressed by the parameters that de- 
scribe the characteristics of the defect in more detail: 





Agf = Ahr = TAsr 
= Aes + PAvg — TAse (4) 


Here, Ahr, Asf, Aer, and Avg are the enthalpy, entropy, energy, and volume of for- 
mation, respectively, and P is pressure. As noted earlier, As¢ is the contribution 
(per defect) to the change in the entropy of the crystal excluding the configura- 
tional entropy, and is referred to as the nonconfigurational entropy of formation. 
Since it is due largely to the influence on the lattice vibration, it is often called the 
vibrational entropy of the defect. The formation volume of a point defect is of the 
order of the atomic volume, Q, which is typically 10-7? m?. Under the atmo- 
spheric pressure, the PAvr term amounts to only 107% J, or 10°° eV, and contrib- 
utes little to the formation enthalpy. Hence the formation enthalpy Ah; and the 
formation energy Aep are virtually equal except at very high pressures. 

Complexes of point defects may form by association of individual defects, and 
the associated defects are often energetically more favored. The balance between 
the associated and separated defects may be described by a chemical reaction be- 
tween the defect « and defect £: 


5.1 Point Defects 
a+ B= of: Aga op (5) 


The free energy of association Ag, «p is defined as the difference in the formation 
energy between the isolated and associated states: 





Aga, op = Age, op — (Agt,x + Age, p) (6) 


Association is favored when Ag, „5 is negative, and vice versa. The equilibrium 
concentration of the complexes is expressed in the form 





Aga, a 
Cup © Ay Cp exp(- = (7) 


where A is a number of the order of 10 depending on the geometrical configura- 
tion of the associated defect. The free energy of binding, Agp ap = —Aga, ap, is 
often used to characterize the preference for the complex. The cases of divacan- 
cies and vacancy-solute atom pairs are discussed in Section 5.1.2.1 in more 
detail. 

As the reader might be aware, the process of creating a vacancy illustrated in 
Fig. 5.3 is unrealistic; indeed, it is only to explain the change in the internal en- 
ergy on introducing a vacancy in a crystal. In reality, intrinsic point defects — 
vacancies, self-interstitial atoms, and their clusters — are produced at dislocations, 
grain boundaries, and surfaces, which serve as sources and at the same time as 
sinks, and the defects move around in the crystal to establish a desired spatial 
distribution. Intrinsic defects are created and annihilated incessantly at various 
places; the equilibrium concentrations are not quiescent but are maintained dy- 
namically. 

There are artificial means of introducing intrinsic point defects into the crystal, 
other than the thermal, or spontaneous, formation described above. Plastic defor- 
mation and particle irradiation are the most common among the others. Jogs in 
edge dislocations act as sources and sinks of vacancies and self-interstitial atoms 
(Hirth and Lothe 1982; see also Chapter 6) and thus severe plastic deformation 
produces these point defects. Irradiation with energetic particles, such as elec- 
trons, neutrons, and various ions, causes “knock-on” of atoms, i.e., bombarded 
atoms are displaced out of the regular positions, generating vacancies and self- 
interstitial atoms. In metallic crystals, knock-on events occur when the trans- 
ferred kinetic energy exceeds a certain value, ca. 20-30 eV, which is referred to 
as the displacement threshold energy (Gibson et al. 1960; Lucasson and Walker 
1962; Erginsoy et al. 1964; Vajda 1977). Accelerated electrons having an energy 
of 100 keV to 1 MeV deliver roughly this amount of kinetic energy to the atoms 
of the target material and introduce pairs of vacancies and self-interstitial atoms. 
This is the technique that has widely been used in experimental studies of Fren- 
kel defects in metals (Meechan and Brinkman 1954; Corbett et al. 1957; Lucasson 
and Walker 1962). In contrast, heavier particles with similar kinetic energies pro- 
duce much more significant disorder in the crystal, because the energy trans- 
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ferred to the primary target atoms is much higher than in the case of electrons; 
the atoms displaced in the first event (the primary knock-on atoms) bring about 
multiple knock-on events (Marx et al. 1952; Blewitt et al. 1957; Thomas et al. 
1969). This leads to serious deterioration of the material, and the effect, known 
as radiation damage, has been one of the key issues in nuclear reactor technology 
(Thompson 1969; Odette and Lucas 2001). 


5.1.1.3 Mobility 

Point defects migrate in the crystal by successive translational movements be- 
tween equivalent positions. The elementary process is a jump to a neighboring 
site, passing through an intermediate high-energy state, which is accompanied 
by local distortion. Figure 5.4 illustrates the atomic configurations in the course 
of a jump of an interstitial solute atom and the variations of the free energy. As 
will be discussed in detail in Section 5.2.1, the jump over the energy barrier usu- 
ally occurs by thermal activation, and the rate, or frequency, of the jump obeys the 
Arrhenius law, 


w = v exp (- i) (8) 


Here, v is the attempt frequency, which is of the order of the Debye frequency 
(104-10 s71). The free energy of migration, Agm, can be expressed using the 
enthalpy, entropy, energy, and volume of migration, similarly to the case of the 
free energy of formation, Eq. (4). 

The jump frequency œw and the free energy of migration Agm of a particular de- 
fect species can be determined by measuring its diffusion coefficient D as a func- 
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Fig. 5.4 (a) Motion of an interstitial solute atom to a neighboring site. 
(b) The associated variation of the free energy of the crystal. 
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tion of temperature, as D is proportional to w. One of the common techniques is 
to analyze the kinetics of annihilation or precipitation of the defects. The decay 
of excess defects during annealing may be monitored by, e.g., measuring the elec- 
trical resistivity, and from the temperature dependence of the kinetics the activa- 
tion energy of the rate-controlling process can be evaluated (Bauerle and Koehler 
1957; Koehler et al. 1957; Corbett et al. 1959a,b). The next section describes the 
method in more detail. 


5.1.1.4 Experimental Techniques 


Differential dilatometry The absolute concentration of vacancies can be deter- 
mined by measuring simultaneously macroscopic and microscopic thermal ex- 
pansion (Feder and Nowick 1958). The macroscopic volume of an elemental crys- 
tal consisting of N atoms and n vacancies at temperature T is expressed as 


V(T) = NO°(T) + nAvr (T) (9) 


where Q°(T) is the atomic volume in a hypothetical defect-free crystal, and Avg y 
is the formation volume. The actual atomic volume, Q(T), which can be deter- 
mined by measuring the lattice parameter, reflects the relaxation around each va- 
cancy and is given by the fractional average (Eshelby 1956): 


Q(T) = en + 








Wag betw(D) (10) 
Owing to this formula, the volume of the crystal can be written simply as 

V(T) = (N+ n)Q(T) (11) 
Let us consider the difference in volume between temperatures Ty and T. Denot- 
ing V(To), Q(To), and n(To) as Vo, Qo, and no, respectively, the relative difference 


is expressed as 


V(T)— Vo _ (N+ n)Q(T) — (N + 19)Qo 








Vo (N + 119) Qo 
NN O; 


Here, all three terms on the right-hand side are assumed to be small, and second- 
order terms are omitted. If, moreover, no/N can be neglected, we arrive at 


AV AQ n cs 
Vo Qo N l-e 





Sty (13) 


179 


20 


aà 
o 


Expansion, AL/L,, Aala, / 10° 
3 





1801 5 Point Defects, Atom Jumps, and Diffusion 





200 400 600 1 11 12 13 14 15 16 
Temperature, T /°C T” /10°K! 


Fig. 5.5 Macroscopic and microscopic thermal expansion of pure 
aluminium (a), and an Arrhenius plot of the vacancy concentration, 
cy = 3(AL/Lo — Aa/ao) (b). After Simmons and Balluff (1960). 


where notations AV = V(T) — Vo and AQ = Q(T) — Qo are introduced. For crys- 
tals of cubic structure, the relative increase in the macroscopic volume AV/Vo 
and in the atomic volume AQ/Qọ can be determined by measuring simultane- 
ously the linear dimension of the sample, L, and the lattice parameter, a, respec- 
tively. From the two sets of data, the absolute concentration of vacancies is ob- 
tained from Eq. (14). 


3-5) (14) 
Io 


Note that the magnitude of the formation volume Avs , need not be known. Fig- 
ure 5.5 shows the experimental results for pure aluminum by Simmons and Bal- 
luffi (1960). The deviation of AL/Lo from Aa/ao at the temperatures close to the 
melting temperature (660 °C) demonstrates the production of an excess volume 
due to vacancies. Figure 5.5(b) is an Arrhenius plot of the vacancy concentration 
calculated by Eq. (14), from which the formation enthalpy Ah; , and the forma- 
tion entropy As; y were evaluated (0.76 eV and 2.4 kg, respectively). 

When self-interstitial atoms are considered together with vacancies, Eq. (14) is 
modified to Eq. (15), 


AV AQ 
= amag =) (15) 





where c; is the concentration of self-interstitial atoms. If self-interstitial atoms do- 
minated over vacancies, the macroscopic expansion would negatively deviate from 
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Fig. 5.6 Transient increase in volume due to vacancy production (lower 
curve) at temperature Tz, after an abrupt rise in temperature from Tı to 
T (upper curve). 


the microscopic expansion. Experimental results on pure metals indicate that 
this is not the case; the principal defect species is always vacancies (Siegel 1978). 
As mentioned in Section 5.1.2, the formation enthalpy of a self-interstitial atom 
is much higher than that of a vacancy; the equilibrium concentration of self- 
interstitial atoms in metals is never high enough to be detectable. 

The fact that point defects increase or decrease the volume of the material in 
proportion to their number allows us to determine the migration energy by mea- 
suring the transient volume change after a rapid change in temperature. As illus- 
trated in Fig. 5.6, if a sample is first equilibrated at temperature Tı and is then 
heated abruptly to T) and kept constant, the volume of the sample changes first 
without significant delay by thermal expansion of the lattice, and then it gradually 
increases with time. This evolution originates from equilibration of the concen- 
tration(s) of intrinsic point defects at the new temperature. On the basis of Eq. 
(9), the variation of the volume with the holding time t is expressed as 


AV(Tı, Ta; t) = [n(To, t) = n(Tı)]Avg(T}) (16) 


where n is the number of the defect species in question and Aus is its formation 
volume. The number of the defects at t = 0 is n( T1), which is supposed to be the 
equilibrium value at Tı. While being kept at temperature T}, point defects are ei- 
ther generated from sources or annihilated to sinks and eventually attain an equi- 
librium spatial distribution through diffusion. The approach to the equilibrium is 
controlled by the migration of the defects, and thus the kinetics of the volume 
change, i.e., the rate constant, is characterized by the migration enthalpy Ahm. 
This method was first attempted for vacancies in CoGa by van Ommen and de 
Miranda (1981), who measured the length change by mechanical probing. Schae- 
fer and coworkers improved the method by utilizing a laser interferometer and 
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evaluated the migration enthalpy of vacancies in FeAl and NiAl (Schaefer et al. 
1999). The volume change due to the difference in the defect concentration over 
a limited temperature range is generally small and is difficult to detect. Never- 
thelss, these experiments turned out successfully owing to the unusually high va- 
cancy concentrations in the materials studied. (See Section 5.1.3.4.) 


Electrical resistometry The presence of point defects can be detected sensitively 
by measuring electrical resistivity at low temperatures, as they contribute to the 
residual resistivity in proportion to their concentration (Lucasson and Walker 
1962). Relative changes in the concentration can be monitored accurately, where- 
as the absolute value is difficult to determine unless the magnitude of the specific 
contribution is known by other means. The measurements are to be made at low 
temperatures to eliminate the effect of thermal vibration, for the changes in resis- 
tivity due to the defects are usually small. 

The formation enthalpy of vacancies in various metals has been determined by 
measuring the electrical resistivity of samples quenched from high temperatures 
(Bauerle and Koehler 1957). A sample equilibrated at temperature T, is rapidly 
cooled to a temperature that is low enough to prevent any motion of vacancies, 
and the resistivity p is measured at a cryogenic temperature T,. The procedure is 
repeated for several quenching temperatures, and the values of the excess resistiv- 
ity Ap(Tg; Tr) = (Tq; Tr) — po(Tr) due to retained vacancies are plotted against 
the reciprocal of Ty. Here, py(Tr) is the resistivity of a reference sample that is 
free from excess vacancies. The plot would fall on a straight line, similarly to 
Fig. 5.5(b), whose slope gives the enthalpy of formation, Ahr y. Many of the exper- 
imental data for Ah; y in metals are due to this method (Siegel 1978; Ullmaier 
1991; Wollenberger 1996). 

The migration enthalpy of point defects can also be determined by resistivity 
measurements (Bauerle and Koehler 1957; Koehler et al. 1957; Balluffi 1978; 
Young 1978). After freezing-in the defects formed at a high temperature by 
quenching, the resistivity of the sample is measured as a function of time during 
the course of isothermal or isochronal annealing at an intermediate temperature 
where the defects move at a measurable rate. The supersaturated defects tend to 
annihilate to sinks or precipitate to form clusters, and the decay of the resistivity 
due to the reduction in the number of the defects is monitored through resistiv- 
ity. Since the kinetics of the decay is controlled by the migration of the defects, 
the enthalpy of the migration can be determined from the values of the rate con- 
stant at different temperatures. For example, if the defects annihilate at unsatur- 
able sinks and the rate is controlled by diffusion of individual defects, the kinetics 
of annihilation is described by Eq. (17), 


dAc 

——=-KA 17 
a c (17) 
where Ac is the excess concentration of the defects. The rate constant K is propor- 
tional to the diffusion coefficient of the defect and the density of the sinks. The 
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decay of the excess concentration thus follows a simple exponential time law [Eq. 


(18)], 
Ac(t) ~ exp(—Kt) (18) 


which can be traced by measuring the resistivity. As K is proportional to the dif- 
fusion coefficient and therefore to the jump frequency, one can determine the mi- 
gration enthalpy from the temperature dependence of the decay rate (Bauerle and 
Koehler 1957). 

As mentioned earlier, excess point defects can be introduced not only by 
quenching but also by plastic deformation and by particle irradiation. While the 
formation and migration of vacancies in metals can be studied most conveniently 
by quenching experiments, the properties of self-interstitial atoms can be investi- 
gated almost solely by electron irradiation, in which pairs of a vacancy and a self- 
interstitial atom (Frenkel pairs) can be introduced in a controllable manner, by 
adjusting the energy and the fluence of the incident electrons. The mobility of 
the self-interstitial atoms is then studied by measuring the resistivity (Lucasson 
and Walker 1962; Ballufh 1978; Young 1978). In annealing experiments, charac- 
teristic “stages” appear in the resistivity versus annealing time or temperature 
curve, which arise from various processes such as annihilation of close vacancy- 
interstitial pairs, clustering of vacancies and of interstitial atoms, annihilation to 
sinks, etc. With proper interpretations and analyses, information on the mobility 
of the relevant defects can be acquired (Koehler et al. 1957; Damask and Dienes 
1963). 


Positron annihilation Positrons injected into a solid are annihilated with elec- 
trons and emit two characteristic y rays of 511 keV. The lifetime of positrons, 
which is of the order of 100 ps in solids, can readily be measured as the time in- 
terval between the injection” and the detection of the y rays. When the solid con- 
tains defects at which electron density is reduced, the lifetime is much extended. 
Positrons that encounter such defects are localized there, i.e., trapped, and have 
statistically long lives: for the case of monovacancies in metals, the lifetime is in- 
creased from a typical value of 100 ps to 200 ps. By measuring the extended life- 
time, variations of the density of vacancy-type defects as low as 1 ppm can be 
monitored. Alternatively one may analyze the energy of the emitted y rays to ob- 
tain information on how, or where, the positron terminated its life. Since the mo- 
mentum of the positron-electron pair is conserved before and after the annihila- 
tion, the energy of the emitted y ray is shifted by the Doppler effect, which 
reflects the energy of the electron. Positrons trapped at a vacancy-type defect suf- 
fer less from the Doppler effect because they are less likely to encounter the high- 


1) The most convenient positron source ? Na 
emits a y ray of 1.28 MeV when it under- 
goes the f* decay, signaling the injection. 
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energy, inner-shell electrons of an atom. This leads to characteristic sharpening of 
the overall photon-energy spectrum, from which the defect concentration may be 
evaluated. Measurements of lifetime and of Doppler shift are basic techniques of 
positron annihilation spectroscopy (PAS), and have been applied successfully to 
investigations of vacancy-type defects in a variety of solid materials: pure metals 
(Siegel 1978, 1982; West 1979; Schaefer 1982, 1987), intermetallic compounds 
(Schaefer et al. 1999; Sprengel et al. 2002), semiconductors and ceramics (Rempel 
et al. 2002). Since experiments under high pressure are not too difficult, PAS has 
been conveniently applied to determination of the formation volume of vacancies 
(Dickman et al. 1977, 1978; Wolff et al. 1997; Müller et al. 2001). Migration of 
vacancies can also be studied by PAS, through the kinetics of equilibration on 
abrupt changes in temperature similar to that in Fig. 5.6 (Schaefer and Schmid 
1989; Wiirschum et al. 1995). 

The lifetime changes according to how many of the injected positrons are anni- 
hilated at defects. If we consider only one defect species, e.g., monovacancies in 
an elemental crystal, a simple two-state trapping model shows that the trapping 
rate at the defects, x1, is given by Eq. (19) (see, e.g., Siegel 1978): 


Kı = 0101 = n(- =) (19) 





To TL 


where a is the specific trapping rate and cı is the defect concentration. On the 
right-hand side, I; is the relative intensity of the extended-lifetime component in 
the spectrum, and tọ and 7 are lifetimes of free and trapped positrons, respec- 
tively, all of which can be evaluated from an observed lifetime spectrum. The spe- 
cific trapping rate cı is generally unknown, but if it is only assumed to be inde- 
pendent of temperature one can determine the formation enthalpy of the defect 
from an Arrhenius plot of xı. For materials for which data of absolute defect con- 
centrations are available from other experiments (most probably by differential di- 
latometry), one may compare them with the measured values of xı to evaluate a1, 
which may then be adopted for other materials of the same class, i.e., of similar 
electronic structures, to estimate the concentration of the particular defect species 
from PAS experiments. 


Anelastic relaxation It is easy to envisage from the illustration of Fig. 5.1 that va- 
cancies and substitutional solute atoms induce isotropic displacements of neigh- 
boring atoms” and thus a volume change (shrinkage or dilatation) of the material. 
Other defects such as pairs of point defects, some interstitial solute atoms (Figs. 
5.13c,d below) and also self-interstitial atoms (Figs. 5.11b,c below) produce, in 
addition, anisotropic distortion because of their low-symmetry structure. Such 


2) The displacements around a vacancy or a 
substitutional solute atom are not always 
isotropic in ordered structures, such as L1, 
(Numakura et al. 1999). 
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low-symmetry defects can be in several orientations in the crystal which are crys- 
tallographically equivalent but orientationally distinguishable: for example, a self- 
interstitial atom in the “split” configuration shown in Fig. 5.12(b) is of tetragonal 
symmetry and thus can be in any one of the three orientations, [100], [010], and 
[001]. Usually the defects are equally distributed over all the possible orientations, 
but when external stress is applied some of the orientations may become more 
favored over others because the external stresses interact differently with defects 
in different orientations. This results in an increase in the population of the fa- 
vored orientation(s), which in turn produces a macroscopic, anisotropic deforma- 
tion of the sample material. This is the phenomenon called “anelastic relax- 
ation,” which manifests itself as a creep deformation under constant stress, a 
stress relaxation under constant strain, delay of the strain from the stress under 
dynamic loading, and so on. All of these pertain to dissipation of the elastic en- 
ergy due to stress-induced redistribution of the defects, and provide a useful 
means of studying the defect concentration and mobility through the magnitude 
of the energy dissipation and the rate of the redistribution, respectively (Nowick 
and Berry 1972; Weller 2001). 

The anisotropic distortion of the host crystal is determined by the characteristic 
strain, called the “4 tensor”, defined as the strain produced by a unit concentra- 
tion of the defect in orientation p [Eq. (20)]. 


(W) _ déj 


While the volume change due to defect is given by the trace of the 4 tensor, the 
magnitude of the relaxation is determined by the dispersion with respect to the 
stress axis. For example, the 4 tensor of a defect of tetragonal symmetry in a cubic 
crystal is expressed as Eq. (21), 


A 0 0 
(4j)=| 0 %2 0 (21) 
0 0 A2 


and the magnitude of the relaxation in the elastic compliance is shown to be 


2 cQ 
ss’ == ? 22 
3 kgT (22) 





Ay — A2 





where c is the total defect concentration (c = )/,, cy) and Q is the atomic volume 
of the host crystal. If either the concentration c or the so-called the “shape factor” 
|41 — 42|° is known from another experiment, the other can be evaluated by mea- 
suring the relaxation magnitude. 

A useful feature is that the intensity of the relaxation depends on the orienta- 
tion of the external stress. For example, one can readily see from Fig. 5.12(b) (be- 
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low) that a tetragonal defect in a cubic crystal would respond to a tensile stress if 
the stress is applied in either of the <100> directions, but no reorientation, and 
thus no redistribution, would occur if the stress axis is parallel to the (111) direc- 
tions, for the stress interacts identically with the defects of the three orientations. 
In general, how a point defect responds to external stress is determined by its 
symmetry, and is an important property governing the relaxation behavior. By na- 
ture, defects of tetragonal symmetry in cubic crystals respond only to {110} 
<110> shear stress, while those of trigonal symmetry (having a principal axis in 
<111>) only to {001} <100> shear stress, producing relaxation in the compliance 
S' (= 2(Sıı — S12)) and S (= S44), respectively. Such rules, called “selection 
tules,” have been derived from group theory and summarized by Nowick and 
Heller (1965). For cubic crystals, the relaxation of the compliance under tensile 
stress of an arbitrary orientation is given as Eq. (23). 


ö] = G = ras +Tös (23) 


Here, T is defined as T = y3y3 + $y? + y793 with y; being the directional cosines 
between the stress axis and the crystal axes i, and ranging from 0 for the stress 
axis in <100> to 1/3 for (111). The dependence of the relaxation intensity on the 
stress direction for tetragonal defects is therefore the opposite of that for trigonal 
defects, as illustrated in Fig. 5.7. Examination of the relaxation strength in single 
crystals thus allows identification of the defect symmetry, which can be an impor- 
tant clue to determining the atomic configuration of the defect. 

Measurement of the relaxation rate is also useful, as it gives us information on 
the mobility of the defect. If an appropriate stress is applied abruptly and is kept 
constant just as in the temperature change experiment illustrated in Fig. 5.6, an 
extra strain (the anelastic strain) appears gradually with time, similarly to the vol- 
ume change shown in Fig. 5.6. The evolution of the anelastic strain in such a 
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Fig. 5.7 Dependencies of the magnitude of anelastic relaxation in 
compliance J on the orientation of the tensile stress for the case of 
tetragonal and trigonal defects in cubic crystals. T indicates the 
orientation of the stress (see text); [ = 0 and 1/3 corresponds to a 
stress parallel to (100) and to <111), respectively. 
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“quasi-static’ experiment is most often described by the simple exponential time 
law, 1 — exp(-t/r), with the amplitude given by adj, i.e., proportional to the 
stress amplitude ao. On the other hand, if a sinusoidal stress is applied the strain 
lags behind the stress, and the phase lag ¢, or its tangent, which is a measure of 
the loss of mechanical energy, is expressed by the Debye equation (24) 


ò wT 


with œ being the angular frequency of the applied stress. As this function shows a 
maximum at wt = 1, one can readily determine the relaxation time by measuring 
the phase lag as a function of vibration frequency. From the relaxation time t ob- 
tained by any such experiments, the frequency of the reorientation jump of the 
defect can be evaluated. Since the jump is most often the elementary process of 
diffusion, the relaxation rate 7"! is, in such cases, directly proportional to the dif- 
fusion coefficient D. If the mechanism of the reorientation and that of the diffu- 
sion are both known, an exact relationship of the two quantities may be found. 
There, the measurement of the relaxation rate provides a unique tool for deter- 
mining the diffusion coefficient out of the ranges usually accessible by conven- 
tional techniques such as interdiffusion and radiotracer experiments. For more 
details of anelastic relaxation and its practice, “mechanical spectroscopy,” see 
Nowick and Berry (1972), Weller (2001), and Numakura (2003, 2006). 


5.1.2 
Point Defects in Pure Metals and Dilute Alloys 


5.1.2.1 Vacancies 


Single vacancies The atomic structure of a vacancy in pure metal crystals is a 
simple vacant lattice site, with the neighboring atoms being displaced, or “re- 
laxed,” to some extent toward the vacant site (Johnson and Brown 1962; Johnson 
1964). This simple structure has been found to be stable by a number of com- 
puter simulation studies (see, e.g., Finnis and Sinclair 1984; Daw et al. 1993), 
and is supported by the agreement between theory and experiment of positron 
annihilation (Schaefer 1987). The formation energy Aep y is the net increase in 
the internal energy on introducing a vacancy. If one assumes the nearest- 
neighbor pair interaction model, it equals to z/2 times the bond energy e1, as 
seen in Fig. 5.3: ze; is lost around a vacancy, and (z/2)eı is recovered around the 
atom at the surface site. Since the internal energy of the perfect crystal consisting 
of N atoms is given by —(Nz/2)e, in this model, the vacancy formation energy 
is expected to be equal to the binding energy per atom, i.e., the cohesive energy, 
E., neglecting possible effects of the atomic relaxation around a vacancy. In real 
metals, E. and Ah; y (= Aef, y) are both known to be roughly proportional to the 
melting temperature Tm, but the latter is only about one third of the former. Fig- 
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Fig. 5.8 Correlations of the cohesive energy E., activation energy for 
self-diffusion Qsp, and vacancy formation enthalpy Ahr , with the 
melting temperature Tm for pure metals. k is the Boltzmann constant. 


ure 5.8 displays the cohesive energy and the vacancy formation enthalpy of fcc 
and bcc metals with kg Tm on the abscissa. These relationships are expressed re- 
spectively as Eqs. (25) and (26). 


E. © (29 + 1)kg Tm (25) 
Ahe y © (11 + 1)kgTm (26) 


The disparity of the prediction of the pair interaction from experiment is due 
partly to the neglect of the atomic relaxation but to a greater extent to the sim- 
plistic model. The experimental data of the vacancy formation enthalpy are listed 
in Table 5.1, together with the migration enthalpy Ahm,v and those of a self- 
interstitial atom. 

The formation volume of a vacancy, Avf y, must be smaller to some extent than 
one atomic volume, Q, because the atoms around the vacancy relax their posi- 
tions inward on the average. There are not many experimental data for the forma- 
tion volume, however. The limited data for fcc metals fall in the range between 
0.5 Q to 0.95 Q (Ullmaier 1991; Wollenberger 1996), but the uncertainties are 
not small and no systematic trend has been recognized. On the other hand, theo- 
retical values have been accumulating recently, by computer simulation with 
fairly reliable many-body interatomic potentials and first-principle calculations 
(Finnis and Sinclair 1984; Daw et al. 1993; Mishin et al. 2001a,b). They reveal 
that the formation volume is correlated with Poisson’s ratio of the host crystal 
(Kurita and Numakura 2004). Figure 5.9 shows the linear relation between the 
relaxation volume Av, and Poisson’s ratio v. According to this relationship, the 
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Table 5.1 The melting temperature Tm, cohesive energy E. (Kittel 1996), 
formation enthalpy Ahr and migration enthalpy Ahm of a vacancy 

(v) and a self-interstitial atom (i) in pure metals (Ullmaier 1991; 
Wollenberger 1996). 











Material Tm [K] E. [eV] Vacancy Self-interstitial 
Ahgy [eV] Ahm, [eV]  Ahgi [eV] Ahm, [eV] 

fec 

Al 934 3.39 0.67 0.61 3.0-3.6 0.112-0.115 

y-Fe 1811 4.28 1.40 1.26 

Ni 1728 4.44 1.78 1.04 0.15 

Cu 1358 3.49 1.28 0.70 1.6-4.2 0.117 

Pd 1828 3.89 1.85 1.03 

Ag 1235 2.95 1.11 0.66 0.088 

Pt 2041 5.84 1.35 1.43 1.1-1.5 0.063 

Au 1337 3.81 0.93 0.71 

Pb 601 2.03 0.58 0.43 0.01 

bec 

V 2183 531 2.2 0.5-0.7 

Cr 2180 4.1 2.0 0.95 

a-Fe 1811 4.28 1.59-1.73lal 1.11 4.7-5.0 0.3 
1.79-1.851b] 

Nb 2750 7.57 3.07 0.55 

Mo 2896 6.82 3.0 1.35 0.083 

Ta 3269 8.1 3.1 0.7 

W 3695 8.9 3.6 1.70 0.054 





[al Ferromagnetic state. PlParamagnetic state. 


relaxation is large in crystals whose Poisson’s ratio is high and, therefore, the for- 
mation volume, given by Q + Av, is significantly smaller than Q. Since knowl- 
edge of the formation volume is important in studies of diffusion under high 
pressure, this relationship must be useful as a rough estimate when no experi- 
mental value is available. 

The nonconfigurational entropy associated with a vacancy, Asf y, is expected to 
be positive, since the introduction of a vacancy would bring about low-frequency 
modes in the lattice vibration. It has been evaluated from experimental data for 
the absolute concentration for some metals; for example (in kg), it is 0.7 for alu- 
minum, 2.8 for copper, 1.5 for silver, 0.72 for gold, 1.6 for molybdenum, and 3.2 
for tungsten (Wollenberger 1996). 

With the formation parameters discussed above, one can estimate the equilib- 
rium vacancy concentration in typical metals. At the melting temperature Tm, for 
example, 
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Fig. 5.9 Relaxation volume of a vacancy Av, in pure metals evaluated by 
computer simulation and first-principles calculations as a function of 
Poisson’s ratio v of the host crystal. 


As y Ahe y 
Cy Ten) = exp( 7 ) ep( = ) 


~ exp(2 + 2) exp(—11 + 1) 





~10-*+! (27) 


It decreases with decreasing temperature, and is reduced to 1078+! at Tm/2. Va- 
cancy concentrations in solid-solution alloys may also be estimated in the same 
manner by taking the solidus temperature as Tm. 

The translational motion of a vacancy to a neighboring site is in fact a jump of 
the neighboring atom to the vacant site, and is associated with the energy barrier 
at the intermediate state (Johnson and Brown 1962; Johnson 1964), similarly to 
the case of the interstitial atom depicted in Fig. 5.4. The migration enthalpy of a 
vacancy can be determined from the decay of quenched-in excess vacancies as ex- 
plained in Section 5.1.1.3. It can also be estimated from the activation energy of 
tracer self-diffusion. The self-diffusion in metals occurs by the vacancy mecha- 
nism, and the tracer self-diffusion coefficient, D*, is given by the product of the 
jump frequency of a vacancy œy, its concentration cy and the tracer correlation 
factor f, which is a constant (cf. Section 5.2). Therefore, if both the activation en- 
ergy of the tracer self-diffusion Qsp and the vacancy formation enthalpy Ah; , are 
known, their difference must give a good estimate of the migration enthalpy 
Ahm,v. The activation energy of self-diffusion in metals is also known to be well 
correlated with the melting temperature (Brown and Ashby 1980), expressed as 
Qsp (18 + 1)kgTm; this correlation is also shown in Fig. 5.8. Together with 
Eq. (26), therefore, we have Ahm,y © (7 + 2)kg Tm. In fact, Qsp and Ahr y are di- 
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rectly correlated with each other and are expressed as Ahr y = (0.60 + 0.01) Qsp. 
The migration enthalpy is thus related to Qsp as Ahmy = (0.40 + 0.01) Qsp. 
Here we have yet another empirical formula, Ahm, v/Ahç y x 2/3, for pure metals. 
A similar relationship is expected for disordered solid-solution alloys, while the 
behavior can be different in cases where, for example, strong interaction between 
vacancies and solute atoms exists, or solute atoms do not migrate by the ordinary 
vacancy mechanism (Hood 1993). 





Divacancies At elevated temperatures where the equilibrium concentration of 
vacancies is fairly high, close pairs of vacancies at neighboring sites, i.e., divacan- 
cies, may occur. By considering the reduction in the number of missing bonds 
and also in lattice strain energy (see Section 5.1.2.3), association of two vacancies 
is expected to reduce the enthalpy of the crystal. The free energy of association is 
defined from the free energies of formation as 


Aga w = Age w = 2Agf v (28) 
and the concentration of divacancies is given by Eq. (29). 
> zZ 2Age y Aga, w — Zz Bf, vv 
ewa 5 exp( FT ) ep( aa 5 ep( uT (29) 


The total concentration of vacant lattice sites is then 








Cy total = Cy + 2Cw 


A v A v A a, vv 
~op( ze) z ep Er, ae | (30) 


The second term in the bracket can be of appreciable magnitude if there is a 
strong binding between two vacancies, i.e., the association energy Ag, w is nega- 
tive and its magnitude is not negligible in comparison to Agr y. 

Divacancies are known to become noticeable in fcc metals at high temperatures 
and play a role in diffusion (Mehrer 1978). Calculations for the enthalpy of asso- 
ciation in the noble metals have reported a range of values from —0.1 to —0.3 eV 
(Balluffi 1978). Computer simulation studies suggest that divacancies in fcc met- 
als are more mobile than single vacancies (Johnson 1965, and references therein). 
The Arrhenius plot of the tracer self-diffusion coefficient is often found to be 
curved upward at high temperatures, and this trend is attributed to the contribu- 
tion of divacancies to the diffusion (Mehrer 1978; Peterson 1978). 

Much less is known about divacancies in bcc metals. The stable structure was 
suggested to be a pair of vacancies at the second-neighbor positions, and the mi- 
gration energy was similar to the case of the single vacancy (Johnson 1964). The 
Arrhenius plots of the tracer self-diffusion coefficient in some bcc metals were 
known to be appreciably curved upward at high temperatures, and were assumed 
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to be due to the contribution of divacancies (Peterson 1978). However, those 
curved Arrhenius plots are now considered to be due largely to the dynamical 
properties of the crystal (Gilder and Lazarus 1975; Herzig 1993). In this context, 
the role of divacancies in the self-diffusion in fcc metals has been questioned and 
critically re-analyzed (Mundy 1987). 


Vacancy-Solute Complexes In dilute solid-solution alloys, complexes of vacan- 
cies and solute atoms may be formed, owing to electronic and/or elastic interac- 
tion between the two defect species. Formation of pairs of a vacancy and a solute 
atom in metals has been studied theoretically and experimentally (Le Claire 1978; 
March 1978), and the diffusion behavior of the solute atoms, as well as the sol- 
vent atoms, has been analyzed (Le Claire 1978, 1993). As the concentration of 
the solute atoms can be (and usually is) much higher than that of vacancies and 
may have significant effects on the properties of the material, we discuss the for- 
mation of vacancy-solute atom pairs in slightly more detail. 

Let us consider a dilute binary A-B alloy consisting of Na and N, of A and B 
atoms, respectively (Na > Ng). The number of lattice sites next to a B atom is 
zN, and the number of other sites is Na — zNg. The probabilities of finding a 
vacancy at these sites are given respectively by exp(-Agr,/ksT)x 
exp(—Ag, ys/kgT) and exp(—Agr ,/kgT). The numbers of isolated vacancies and 
those associated with a solute atom are thus written as 





ez) 

= (Na — ZNpg) exp| — 31 

1 = (Na — nn) exp( — 38 (31) 
A vtA a, VS 

Ws = zNg exp (- u] (32) 


Their concentrations are obtained by dividing by the total number of lattice sites, 
Na + Ng t+ ty + Ms. 

Figure 5.10 shows Arrhenius plots of calculated concentrations of isolated va- 
cancies, cy (dotted line), vacancy-solute atom pairs, Cys (broken line), and their 
sum, Cy, total (solid line), for an exemplary case of z = 12, xg = Np/(Na + Ng) = 
1x 10-2 and Agavs = —0.15Agr v. Since the solute concentration is low in this 
example, c, is virtually equal to the vacancy concentration in an unalloyed crystal. 
The attractive interaction between a vacancy and the solute atom gives rise to va- 
cancies bound to the solute atoms at low temperatures. With decreasing temper- 
ature the number of vacancy-solute atom pairs is progressively increased and be- 
comes significantly larger than the number of isolated vacancies, leading to a 
higher total concentration of vacancies than without the solute atoms. If we take 
aluminum, for which Ah; y ~ 0.7 eV and Asf „/kg = 0.7, the melting temperature 
(934 K) and room temperature correspond respectively to 8 and 26 on the ab- 
scissa, Agf ,/(kgT). The association energy Ag, vs = —0.15Agr,, (2 —0.1 eV) ap- 
plies to such solute atoms as magnesium. In this case, below 520 K, which corre- 
sponds to Agr ,/(kgT) = 15 (where cys equals to cy), the majority of vacancies are 
bound to the solute atoms. 
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Fig. 5.10 Arrhenius plots of calculated concentrations of isolated 
vacancies cı, vacancy-solute atom pairs c2, and the total concentration 
G in a dilute A-B solid-solution alloy. The co-ordination number z = 12, 
the solute concentration (mole fraction) xs = 1 x 10~*, and 
Aga,vs/Age,v = —0.15. 


An unusually large number of vacancies, amounting to 10 at.%, has been dis- 
covered in metals that contain hydrogen as interstitial solutes (Fukai and Okuma 
1993; Fukai 2003). These vacancies, called “super-abundant vacancies,” are a 
somewhat excessive example of the enhancement of the vacancy concentration 
by the attractive solute-vacancy interaction described above. There is ample evi- 
dence that interstitial hydrogen atoms attract vacancies to form clusters, with a 
binding energy of 0.3 to 0.4 eV, and considerably reduce the effective formation 
energy of vacancies (Fukai 2003). The immediate impact of a high concentration 
of vacancies on diffusion is particularly interesting (Hayashi et al. 1998; Lida et al. 
2005). 


5.1.2.2 Self-Interstitial Atoms 

The atomic configuration of a self-interstitial atom in fcc metals was first envis- 
aged as an extra atom located at the body-centered position of the unit cell, (Fig. 
5.11a) (Huntington and Seitz 1942), which is referred to as the octahedral config- 
uration. The configuration shown in Fig. 5.11(b), called a <100> split, or “dumb- 
bell’, interstitial, was originally believed to be an intermediate state in the trans- 
lational motion of the octahedral interstitial. However, this configuration was 
suggested to be stable by later theoretical studies (Huntington 1953; Johnson 
and Brown 1962; Seeger et al. 1962), and was proven to exist in irradiated metals 
by anelastic relaxation experiments (Nowick 1978; Robrock 1990). Self-interstitial 
atoms in bcc metals are expected to be of a similar form: the (110) split configu- 
ration of Fig. 5.11(c) (Johnson 1964, 1973). 
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Fig. 5.11 Atomic structure of a self-interstitial atom in metals. (a) The 
octahedral configuration in fcc crystals. (b) The (100) split (or 
“dumbbell”) configuration in fcc crystals. (c) The (110) split 
configuration in bcc crystals. 


The formation enthalpy of a self-interstitial atom, Ahr į, in metals has been 
evaluated as the difference between the formation enthalpy of a Frenkel pair, 
Ahr f, and that of a vacancy, Ahr y (Schilling 1978; Ullmaier 1991; Wollenberger 
1996). The former can be determined by calorimetric measurements, from the re- 
lease of the stored energy of a sample irradiated by electrons. The results for alu- 
minum, copper, and iron show that the formation enthalpy of the self-interstitial 
atom is considerably higher than that of the vacancy, by a factor of 2 to 4. 

The formation volume Avs; is expected to be —Q if no relaxation occurs, but 
positive values ranging from 0.1 Q to 1.1 Q have been reported by experiment 
and simulation (Schlling 1978; Wollenberger 1996). A positive formation volume 
means that, when an extra atom is inserted into a crystal, the crystal dilates more 
than the specific volume of the inserted atom, even though the atom is some- 
how accommodated within. This occurs because the distortion around the self- 
interstitial atom is very large; it is in fact the origin of the high energy of forma- 
tion. 

For the entropy of formation, As; ;, virtually no experimental evaluations have 
been made, whereas data are available for vacancies in some metals which were 
derived from measurements of the absolute concentration (Wollenberger 1996). 
Theoretical calculations of the formation entropy are not straightforward (de Kon- 
ing et al. 2002) and numerical evaluation is demanding. Nevertheless, a novel 
scheme for computing the formation entropy of point defects has been developed 
recently (Mishin et al. 2001a), which is expected to promote detailed studies and 
produce reliable estimations. 

The equilibrium concentration of self-interstitial atoms in metals is negligibly 
low because of the high formation enthalpy. Assuming Ah; i © 33kgTm 
(= 3Ahr,,) and tentatively neglecting the effect of the formation entropy, the con- 
centration at the melting temperature is estimated to be below 10714, which is 
hardly detectable by any experimental method. At the temperature Tm/2, the con- 
centration would be further reduced to below 10728, which means that not a 
single self-interstitial atom would be found in a macroscopic piece of metal. 

The mobility of a self-interstitial atom is known to be much higher than that of 
a vacancy (Johnson 1973; Young 1978). Figure 5.12 illustrates an elementary step 
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Fig. 5.12 Motion of a <100) split self-interstital in an fcc crystal. 


of the motion of a <100> split interstitial in an fcc crystal. As it consists of two 
identical atoms only partly displaced from the regular position, the motion of 
the interstitial is achieved by the slight movement of one of the members to the 
next displaced position (displacing a regular atom ahead) and the accompanying 
movement of the other member to the regular atomic site. This process occurs 
with a low activation enthalpy in the range between 0.05 and 0.2 eV in alumi- 
num, copper, nickel, and iron (Ullmaier 1991; Wollenberger 1996). 


5.1.2.3 Solute Atoms 

Solute or impurity atoms constitute atomic defects by themselves. Moreover, they 
may influence the properties of intrinsic point defects by forming close pairs as 
discussed in Section 5.1.2.1, or globally changing the electronic properties of the 
host material. 

Light elements with small atoms dissolve in metals as interstitial solute atoms, 
producing significant lattice distortion. For instance, hydrogen and carbon in 
nickel and other fcc metals occupy the octahedral or tetrahedral interstitial sites 
shown in Figs. 5.13(a,b), and give rise to isotropic dilatation. Substitutional solute 
atoms also produce isotropic distortion, but either dilatation or contraction, de- 











Fig. 5.13 An interstitial solute atom (shaded circle) occupying 
octahedral (a) and tetrahedral (b) interstitial sites in an fcc crystal, and 
octahedral (c) and tetrahedral (d) interstitial sites in a bcc crystal. 
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pending on the relative atomic size in comparison to the host atoms. The volume 
change is given by the trace of the A tensor and can be evaluated, for cubic crys- 
tals, by the relative change in the lattice parameter caused by the addition of the 
solute [Eq. (33)]. 


AV 3 da 
=t Ài; = 
V r(2) a dc 





(33) 


According to anisotropic elasticity theory, two centers of isotropic distortion inter- 
act with each other through the elastic anisotropy of the host crystal (Eshelby 
1955). The interaction energy in cubic crystals is expressed as 


15 C-C  AVIAV 
4 Baa 7 





(34) 


E; int 


where C’ (= (Ci — C12)/2) and C44 are elastic stiffness constants corresponding 
respectively to {110} <110> and {001} <100> shear, v is Poisson’s ratio, AV, and 
AV, are the changes in volume associated with defect 1 and defect 2, respectively, 
and r is the distance between them. The last factor A is defined by the orientation 
of the pair as 


3 
A= ay to +o (35) 


with the «; being the directional cosines between the pair axis and the three prin- 
cipal crystal axes. According to this formula, in crystals whose elastic anisotropy 
A= Cy4/C' is greater than 1, for example, two point defects producing volume 
changes of the same sense attract each other if aligned in <100) but repel in 
<111). A defect pair thus formed may produce, in turn, uniaxial distortion along 
the axis of the pair, and cause an anelastic relaxation effect, which is called Zener 
relaxation for the case of substitutional atom pairs (Nowick and Berry 1972; 
Weller 2001). Similar relaxation effect is anticipated for divacancies and intersti- 
tial solute atom pairs. The latter is in fact observed for carbon in fcc metals (Nu- 
makura et al. 2000). 

Interstitial solute atoms in bcc metals bring about anisotropic distortion in ad- 
dition to the dilatation, since the octahedral and tetrahedral sites that they occupy, 
shown in Figs. 5.13(c,d), are of tetragonal symmetry, which is lower than the sym- 
metry of the host lattice. The uniaxial distortion is again the origin of the anelas- 
tic relaxation effect, known as the Snoek relaxation, which is caused by reorienta- 
tion of the anisotropic defect under external stress. By measuring the magnitude 
and the rate of the relaxation effect, the concentration and the mobility of the in- 
terstitial solute atoms can be determined (Nowick and Berry 1972; Numakura 
2003). 
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5.1.3 
Point Defects in Ordered Alloys 


5.1.3.1 Point Defects and Properties of the Material 

There has been growing interest in ordered alloys and intermetallic compounds 
as novel structural and functional materials. In particular, transition-metal alumi- 
nides, gallides, and silicides are attractive as high-strength lightweight alloys for 
high-temperature use. The interesting and sometimes puzzling properties of 
these materials often originate from point defects in the ordered crystal structure 
(de Novion 1995). Ordered alloys and intermetallic compounds exhibit a certain 
range of stability in composition, in which the deviation from the stoichiometry 
must be realized by intrinsic point defects, i.e., antisite atoms and vacancies. 
Quite often these defects strongly influence material properties. For instance, in 
the aluminides of iron and nickel of the B2 structure, the hardness and the chem- 
ical diffusion coefficient increase strongly with deviation of the composition from 
the stoichiometry (Pike et al. 1997; Kim and Chang 2000), which originate from 
increasing point-defect density. 

The chemical diffusion in stoichiometric NiAl is slower than in off- 
stoichiometric alloys, which is natural in view of restrictions on atom movements 
in the ordered structure (Bakker 1984; Koiwa 1992; see also Section 5.2.3). How- 
ever, it is not as slow as it might be; the tracer diffusion coefficient of Ni in NiAl, 
shown in Fig. 5.14, is similar in magnitude to the tracer self-diffusion coefficient 
in pure nickel, when comparison is made with temperature scaled to melting 
temperatures Tm. The diffusion of Ni in NizAl of the fec-based L12 structure is 
slower than in B2 NiAl even though the Ni atom sites in the L1, structure are 
mutually connected, unlike the case of the B2 structure. In strong contrast, the 
diffusivity of Ni in Ni;Sb of the bec-based DO; structure is higher by several 
orders of magnitude; Ni atoms diffuse in this compound as fast as interstitial car- 
bon atoms in nickel. This variety of diffusion behavior can only be understood 
through quantitative knowledge of diffusion-mediating defects. In fact, Ni;Sb is 
known to have a large number of vacancies in the Ni sublattice (Heumann and 
Stüer 1966). Experiments have shown that the defect and diffusion properties 
are “normal’ in NizAl (Numakura et al. 2001); experimental data (Wang et al. 
1984; Frank et al. 1995; Badura-Gergen and Schaefer 1997) show that the empiri- 
cal laws for the vacancy formation enthalpy (Ah; y X 11kg Tm), the diffusion acti- 
vation energy (Q ~ 18kg Tm), and the proportions of the formation and migration 
enthalpies of vacancies roughly hold. In contrast, compounds based on the bec 
structure tend to exhibit anomalous behavior. In particular, the formation energy 
of vacancies in transition-metal aluminides and gallides is markedly low and, on 
the other hand, the migration energy is somewhat high (Sprengel et al. 2002). 

In order—disorder alloys, the degree of order is gradually lowered with increas- 
ing temperature and finally falls to zero at the transition temperature. This low- 
ering of the long-range order occurs by progressive enhancement of antistruc- 
tural disorder, i.e., increase in the concentrations of antisite atoms. Deviations 
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Fig. 5.14 Tracer diffusion coefficient of Ni in B2 ordered NiAl (Divinski 
et al. 2001), L12 ordered Ni3Al (Frank et al. 1995), DO; ordered Ni3Sb 
(Heumann and Stier 1966) as a function of inverse temperature 
normalized to the melting temperature of each material. The broken 
line is the tracer self-diffusion coefficient in nickel (Bakker 1968; Maier 
et al. 1976), and the dotted line is the tracer diffusion coefficient of 
carbon in nickel (Smith 1966; Cermak and Mehrer 1994). 


from stoichiometry within the range of stable compositions are usually material- 
ized by introduction of antisite atoms, similarly to the case of thermal disorder- 
ing. In some intermetallic compounds, on the other hand, this is not always the 
case. Among others, FeAl, NiAl, CoAl, NiGa, and CoGa are known to bear an un- 
usually high concentration of vacancies, particularly at Al-rich or Ga-rich compo- 
sitions (Chang and Neumann 1983). Figure 5.15 shows the variations of the lat- 
tice parameter of Ni;Al and NiAl with composition (Taylor and Doyle 1972; Aoki 
and Izumi 1975). In Ni3Al, the lattice parameter increases steadily with Al con- 
centration as it passes through the stoichiometric composition, x, (mole 
fraction) = 0.25, reflecting the larger specific volume of Al than Ni. The predom- 
inant defect species was found to be antisite Ni atoms at xa) < 0.25 and antisite 
Al atoms at xa) > 0.25 (Aoki and Izumi 1975). In NiAl, the lattice parameter also 
increases linearly with Al content in the Ni-rich composition range, but at the 
stoichiometric composition it turns sharply to decrease. The negative slope at 
the Al-rich compositions indicates that vacancies are introduced in the Ni sub- 
lattice instead of antisite Al atoms. This behavior suggests that the formation en- 
thalpy of vacancies is extremely low in this compound, and thus their concentra- 
tion could be notably high. The not-too-slow self-diffusion in NiAl in spite of the 
ordered structure can be explained by the high concentration of vacancies, which 
are certainly the defects mediating diffusion. 
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Fig. 5.15 Variation of the lattice parameter of Ni3Al (Aoki and Izumi 
1975) and of NiAl (Taylor and Doyle 1972) with composition. 


As exemplified above, detailed knowledge of point defects is important for a 
good understanding of a great variety of diffusion behavior and other properties 
of ordered alloys and intermetallic compounds. For this purpose, in what follows 
we first describe the statistical thermodynamics of point defects in binary ordered 
alloys and then present some interesting examples of equilibrium concentrations 
of vacancies and antisite atoms. We restrict our discussion to thermodynamic 
principles, focusing on relationships between interatomic interactions and trends 
in the defect formation. For experimental facts, we refer the reader to recent re- 
views on the subject by de Novion (1995) and Sprengel et al. (2002). 


5.1.3.2 Statistical Thermodynamics 

Theoretical analyses of point defects in ordered alloys date back to the pioneering 
work of Wagner and Schottky (1931), who established the statistical thermody- 
namic treatment of vacancies, self-interstitial atoms, and antistructure defects in 
binary compounds. In their model, each defect is assumed to be in a defect-free 
local environment and is characterized by a specific energy (the defect energy). 
The equilibrium concentrations are then calculated in the grand canonical for- 
malism with the approximation that the defect concentrations are so low that 
each defect finds no other defects around it. The theory of Bragg and Williams 
(1934, 1935) allowed the presence of other defects in the vicinity with probabil- 
ities proportional to their concentrations, which is known as the mean-field 
approximation. The theory was able to describe the co-operative nature of the 
order-disorder transition reasonably well. Bragg and Williams considered only 
antistructural defects, but it is straightforward to include vacancies in their 
theory; the studies by Cheng et al. (1967) and Schapink (1969) are examples of 
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Table 5.2 Constituent species in a binary A-B alloy with Schottky 
defects (vacancies) and substitutional defects (antisite atoms). 





Label Symbol Species 

1 Va vacancies in « sublattice 
2 Vp vacancies in f sublattice 
3 Ba B atoms in « sublattice 
4 Ag A atoms in f sublattice 
5 Ay. A atoms in « sublattice 
6 Bp B atoms in £ sublattice 





such an extension. Although a variety of more sophisticated statistical methods 
are available today (to name a few, the cluster variation method, Calphad 
approach, and Monte Carlo simulation), the Bragg—Williams (B—W) theory is 
simple and thus useful to see how the structure and the atomic interaction affect 
the defect formation.” 


Definition of the problem We consider formation of vacancies and antisite atoms 
in a binary ordered alloy A,,B,. The sublattices for the species A and B are re- 
ferred to as « and fs, respectively. The system consists of six species (four point 
defects and two regular atoms), as listed in Table 5.2. We will refer to these spe- 
cies by the labels 1 to 6 as given in the table, and denote their numbers by N; 
(i=1,2,...,6). Let us consider a system consisting of N unit cells. Here, a unit 
cell is not necessarily a crystallographic unit cell but is defined as a structural unit 
containing m « sites and n f sites. Letting the number of A and B atoms be Na 
and N», respectively, the composition (i.e., the mole fractions) is defined as 














Na CA 
Xa = a (36) 
Na+Np cat cp 
Np CB 
Na+Ng ca + Cp 

where ca = Na/N and cp = Ng/N. 

3) The B-W model is known to predict a transition temperature comes primarily 
much too high order—disorder transition from inaccuracy in the free energy of the 
temperature, but it is not too bad at disordered phase; short-range order must be 
describing the thermodynamics of ordered important in the disorderd phase but is not 


phases as long as the degree of order is taken into account in the theory. 
fairly high. The overestimation of the 
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The problem to be solved is to determine the equilibrium numbers of the four 
defect species at a given composition and temperature. The number of compo- 
nents in the system is two, but is formally three with vacancies as the tertiary 
component. The problem is one in the statistical thermodynamics of a ternary 
system, where we are to find equilibrium distributions of the three components 
in a crystal comprising two sublattices. It should be noted here that not all of the 
N; are independent. First, the numbers of atoms are conserved: 


Ny + Ns = Na (38) 





N3 + No = Np (39) 


On the other hand, the numbers of vacancies are not conserved, i.e., the size of 
the system, N, is not constant. Nevertheless, for the ordered structure to be main- 
tained, the ratio of the numbers of the sublattice sites must satisfy Eqs. (40) and 
(41). 


N + N; + Ns = mN (40) 














N + N4 + Ne = nN (41) 


There are formally seven variables, N; and N, but only three are independent be- 
cause of these four constraints. If the last two conditions, Eqs. (40) and (41), are 
combined to a single equation, 


Ni + N3 + N5 _ N2 + N4 + N6 
m n 





(42) 


we have six formal variables (N;) with three constraints, again leaving three inde- 
pendent variables. 


Governing equations There are a few different ways of determining the defect 
equilibria, all of which indeed give identical results (Schott and Fähnle 1997; 
Hagen and Finnis 1998; Mishin and Herzig 2000). The methods of the canonical 
ensemble and the grand canonical ensemble are most commonly employed, in 
which the numbers of atoms and the chemical potentials, respectively, are given 
as inputs. Here we outline the first formalism to derive the governing equations, 
along the same lines as Allnatt and Lidiard (1993) and Hagen and Finnis (1998). 

First we write the free energy of a solid that consists of N unit cells and con- 
tains vacancies and antisite atoms at temperature T: 


4 
G(N, Ni; T) = Go(N; T) + XC Nig? — TS. (43) 


i=1 


The first term on the right-hand side is the free energy of a defect-free crystal, the 
second term is the change in the free energy due to the defects, and the last term 
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Fig. 5.16 Changes in atomic configuration for which the free energy of 
formation g” of a point defect species i is defined in the grand 
canonical ensemble formalism. 


is the contribution of the configurational entropy. The free energy of formation of 
a point defect g;” is defined, as illustrated in Fig. 5.16, as the difference in the 
free energy between a defect-free crystal and a crystal consisting of the same num- 
ber of cells with a single defect i. On the assumption that the defects are distrib- 
uted randomly in each of the sublattices, the configurational entropy S. is given 
by Eq. (44). 


(mN)! (nN)! 


44 
Ny!N3!Ns5! No!Na!Ne! er 


Se = kg ln 





After substituting this expression into the free energy with the Stirling approxi- 
mation, we incorporate the three constraints, Eqs. (42), (38) and (39), with La- 
grange multipliers 41, Aa, As: 


4 6 
G= Go + X Nig” ~ ist ma mN+nNiInnN-) Nj In x) 


i=1 i=l 





3 Ni +N; +Ns Na +N4+Ng 
2 m n 





22(N4 + Ns — Nu) + 43(N3 + Ne — Np) (45) 





The condition for thermal equilibrium, (éG/ ON;)n, 4; = 9, gives a set of six simul- 
taneous equations for the six numbers Nj. By eliminating 21, 42, and A; from 
those equations, one obtains, after some mathematical manipulations, 
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N \" (N2\" gs 
el) 46) 
MNs | &xB 
NIN; exp( = 2) (47) 
NiNa Exa 
NN; exp( = s) (48) 
where 
gs = Ho + mg? + ngy (49) 
BB EQ +83 -g (50) 
Meg tu — 8 (51) 





Equations (46), (47), and (48) are the basic formulae determining the equilibrium 
defect concentrations, called respectively the Schottky product equation, B-antisite 
disorder equation, and A-antisite disorder equation. They describe the rates of the 
reactions 

















(null) — mV, + nVg (52) 

Va + Bg > Ba + Vg (53) 

A, + Vg > Va + Ag (54) 
We may note that combining Eqs. (47) and (48) yields 

rene” er) 65) 
which pertains to production of a pair of antisite atoms, 

A, + Bg — Ba + Ag (56) 


This equation is apparently the same as the problem discussed in the original 
work of Bragg and Williams (1934, 1935) on order-disorder transition. However, 
we are to consider vacancies in calculating gxg and gxa. The energies gs, gyn, and 
gxa govern the defect equilibria and are functions of interatomic interaction ener- 
gies. They are at the same time functions of the defect concentrations, since the 
energy to create a point defect at a particular site depends on the atomic configu- 
ration around it. 


4) Note that a Schottky defect in a compound A„B, consists of a set of m of V, and n of Vz. 
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The Bragg-Williams approximation Now we shall derive expressions for the gov- 
erning equations and the energy parameters gs, gxg, and gya using the B—W ap- 
proximation.” For simplicity we take into account the atomic interactions be- 
tween nearest neighbors only. Let us denote the interaction energy between a 
neighboring pair of constituent species p and q by ep}, where p and q are either 
atom A, atom B, or a vacancy. An analytic expression for the free energy can 
readily be derived by summing the pair interaction energies ép, for the energy 
term and adopting Eq. (44) for the configurational entropy. For example, for an 
AB alloy of the L1o structure, it is found to be 





2 
G = — g {(NANB + N3 N4 + NsNo)vap + [Na (Ni + Na) + Ni N4 + N2Ns|vayv 





[Ne (Ni + Nz) + NaN3 + Ni No]vpv} 





+ 6[Nauaa + Ngupp + (Ny F N2)evy] 


6 
_ bot (2N InN- J Ni In x) (57) 


i=1 


Here, we have introduced the effective pair interaction (EPI) energies 





VAB = Era + Epp — 2eap (58) 
Vav = CAA + vv — 2e€av (59) 
UBV = EBB T €vv — 2epv (60) 


Each of them is a parameter that dictates the thermodynamic properties of the 
p-q binary system. Figure 5.17(a) explains schematically the energy vag in an or- 
dered alloy AB (cf. Chapter 7). When a pair of antisite atoms B, and Ag is intro- 
duced into an ideally ordered crystal, the internal energy changes by zvar. Since 
vag is positive for a system of ordering tendency, this change in energy is positive. 
In Fig. 5.17(b) is shown a corresponding process for a hypothetical “binary com- 
pound” AV, in which the atoms B in Fig. 5.17(a) have been replaced by vacancies; 
the energy change is accordingly zvay. The EPI energy between an atom and a 
vacancy is assumed to be negative in most cases, but this is not guaranteed. 

To determine the equilibrium defect concentrations, we apply the standard con- 
dition that partial derivatives of G with respect to pertinent variables must be si- 
multaneously zero. As three independent variables, it is convenient to choose the 
number of unit cells, N, the number of B,, N3, and the number of Ag, N4. The 
conditions (0G/0N)y, n, = 0, (@G/ON3)y y, = 0 and (0G/ON4)y y, = 0 give, in 


5) The mathematical treatment given here is 
formally identical to the “smeared 
displacement model’ of Cheng et al. (1967). 
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Fig. 5.17 Schematic illustation of the effective pair interaction (EPI) 
energy vyg. (a) In a compound AB of co-ordination number z, creating 
a pair of antisite atoms increases the energy of the crystal by 

Zupa + Zug — 2ZUag = ZVag. (b) The change in energy by an exchange 
of an atom A and a vacancy V in a hypothetical binary compound AV is 
similarly given by zvay. 








fact, equations corresponding to (46), (47), and (48), respectively. The expressions 
for the energy parameters gs, gxg, and gya are found from these three conditions. 
For AB alloys of the L1o structure, they are given as 





gs = MH (cacy + 0304 + csce)vag + [Ca(cr + c2 — 3) + C164 + C265] vav 














+ [eg (cy + c2 — 3) + 6263 + C1 C6|Ugv} + 12evv (61) 
Exp = 2[(cs — c4) (Van — Vav) + (c6 — c3 + Cı — C2) UBy] (62) 
gxa = 2[(c6 — 63) (van — vev) + (cs — c4 — C1 + C2) vay] (63) 


where c; are site fractions defined as c; = N;/N. In Eq. (61), there is a term con- 
taining eyy, but eyy can be taken to be zero, which is equivalent to defining the 
vacuum as the reference energy level. We now see that the equilibrium concentra- 
tions of the defects are determined completely by vag, vay, and vgy. For a given 
set of the three EPI energies, composition, and temperature, numerical solutions 
of c; that satisfy the governing equations can be found by iteration or by trial and 
error. If desired, one may empirically take into account possible effects of vibra- 
tional entropy, by adding to gs a term —T(m-+n)s-,, where 5, is the average 
contribution per vacancy. 

From Eqs. (62) and (63), we find that the energy controlling the production of 
antisite defects, reaction (56), is given as 





Exp + gxa = 2{[c5 + ce — (c3 + c4)]vag + (C1 — €2)(UBv — vav) } (64) 
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In the B-W theory of order-disorder transition, the long-range order (LRO) pa- 
rameter is used to describe the antisite disorder. For a binary alloy containing va- 
cancies, the LRO parameter may be defined as (Cheng et al., 1967) 


Cs C3 i C6 C4 
T 
m n 


which describes the regularity in the distribution of atoms. With this definition, 
the site fractions of the antisite atoms are written as Eqs. (66) and (67). 


mn |cg Ca +cCg 1/q © 
m+n|n m+n T 2\m n 
mn jca cate | 1/q & 
m+n|m m+n ie 2\m n 


Similar expressions for cs and cg can readily be derived. Using 7, Eq. (64) is re- 
written as 





_ilmt+n 
~ 2c, +cp 





7 (65) 





(66) 














C4 = 


(67) 





atc 
exp + gxa =4(1 ! 5 *) ne 2(cı — c2)(vav — vey) (68) 


If cı and c are neglected, it indeed reduces to the familiar B-W expression for 
the energy governing the L1o-Al transition. The formation energy for the 
Schottky defect can also be expressed using y, as 


2 2 
cG+c C—C 
(: t 2) ( : 8) [oa 
a —¢ G+ 
(el 
Cy — (2 C1 + C2 Cy — C2 : 
| (za 7 n\(1 7 ) ( 7 Ne 


As mentioned earlier, vag is positive in ordering systems, and vay and vgy are 
usually negative and of similar magnitude to vag. Hence we see from these ex- 
pressions that, at low defect concentrations (c1, c2 — 0,7 — 1), the formation en- 
ergy of a Schottky defect (~ 4vag — 6vay — 6vpy) is several times as high as that 
of a pair of antisite atoms (~ 4vap) in Llo ordered alloys. 

The exact expressions for the energies of the governing equations depends 
on the crystal structure, namely the chemical formula, m, and n, and the co- 
ordination numbers. Figure 5.18 shows some common ordered structures, and 
Table 5.3 summarizes the crystal-geometry parameters m, n and nearest-neighbor 
co-ordination numbers Zpq. In the DO; structure, the £ sublattice sites in the B2 





gs = [aac 























12evv (69) 
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Fig. 5.18 Common structures of binary ordered alloys: (a) B2 (CsCl- 
type, cubic), (b) DO; (Fe3Al-type, cubic), (c) LIo (CuAu-type, 
tetragonal), (d) L12 (Cu3Au-type, cubic), (e) B19 (MgCd-type, 
orthorhombic), (f) DOi5 (Ni3Sn-type, hexagonal). a, b, and c are the 
lattice parameters. 
Table 5.3 Crystal-geometry parameters of common ordered structures.lal 
Structure m n ZAA ZAB ZBA ZBB 
Llo (CuAu) 1 1 4 8 8 4 
B19 (AuCd) i 1 4 8 8 4 
B2 (CsCl) 1 1 0 8 8 0 
DOs (FesAl) 3 1 4 (x), 8 (1) 4 (x), 0 (1) 8 0 
D019 (Niz Sn) 3 1 8 4 12 0 
D022 (AlsTi) 3 1 8 4 12 0 
L1, (Cu3Au) 3 1 8 4 12 0 





blz, are the numbers of atoms q around an atom p. 
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Table 5.4 Effective pair interaction energies used for example 
calculations of the defect concentrations. 





Model Vav/Vap Vev/VaB Note 
I —1 —0.5 Model for TiAl, Ni3Al 
Il =i —0.05 NiAl, NiGa, CoAl, CoGa 





structure are subdivided into ß sites for B atoms and y sites for A atoms, and Za 
and Zap are different for A atoms in the « sites and in the y sites. In quite a few 
structures we find Zgg = 0, which, as we will see below, gives rise to unusual 
trends in point defect formation. It is also seen in the table that these geometrical 
parameters are identical for L1ọ and B19 structures, and this is also the case for 
L12, D019, and D022 structures. Therefore, the equations determining the statisti- 
cal distribution of point defects are the same in each of these groups of structures 
in the nearest-neighbor pair interaction model.” 


5.1.3.3 Equilibrium Concentrations — Examples 

In this section we present the equilibrium defect concentrations in L1o and B2 
ordered AB alloys calculated by the extended B-W theory described above. The 
Llo and B2 structures are chosen as typical examples of the distinctive behavior 
of defect formation in ordered alloys. The purpose is to see the effects of atomic 
arrangements (i.e., coordinations) and the EPI energies on defect formation. As 
an example, we choose two sets of the EPI energies listed in Table 5.4. Model I 
would represent transition-metal aluminides of close-packed structures such as 
TiAl and Ni;Al. The atom-atom interaction energy vag can be evaluated from 
the thermodynamic properties of the ordered alloy, such as the formation en- 
thalpy and thermodynamic activity (Chang and Neumann 1983; Ikeda et al. 
1998). The vacancy-atom interaction energy, on the other hand, may be conve- 
niently estimated from the vacancy formation energy of the pure solid (Kim 1991). 
By these procedures the parameters for Ni;Al were evaluated as vag = 0.232 eV, 
vay = —0.293 eV and vgy = —0.122 eV, which give vav/vap = —1.264 and 
vpv/vap = —0.524 (Numakura et al. 1998). The choice of the much smaller 
vpv/vap for Model II, i.e, —0.05, is somewhat arbitrary; it is intended to repro- 


6) The DO; structure involves atomic ordering van Winkel 1980; Hasaka et al. 1973; 
at the second-neighbor positions and thus Ipser et al. 2002; Murakami and Kachi 
requires introduction of interactions beyond 1982). Also, the L1, and DO, structures 
the nearest-neighbor distance in the free are degenerate in the nearest-neighbor 
energy. Such analyses of the DO; structure interaction model; it is necessary to 
and the closely related L2; structure (A,BC consider at least the second-neighbor 
Heusler structure) are found in the interaction to distinguish these two 


literature (Schapink 1968; Bakker and (Richards and Cahn 1971). 
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Fig. 5.19 Site fractions of vacancies and antisite atoms in L1o ordered 
AB alloys, with EPI energies of Model I, at composition xg of (a) 0.45, 
(b) 0.5, (c) 0.55. 


duce the anomalously high vacancy concentration of vacancies in B-rich B2 
ordered alloys. 


Llo ordered alloys Figures 5.19 and 5.20 present the concentrations of vacancies 
and antisite atoms in L19 ordered AB alloys as calculated with the EPI energies of 
Model I. The three diagrams of Fig. 5.19 display the site fractions c; at composi- 
tions xg of 0.45 (a), 0.5 (b), and 0.55 (c) as a function of reciprocal temperature 
kgT/vag.” The slope of each curve in Fig. 5.19 gives the effective formation en- 
ergy of the defect. It is seen that the slope depends sensitively on composition, 
particularly for the antisite defects. The defect species with very low effective for- 
mation energy may be recognized as “constitutional” defects. In this example, ob- 
viously it is Ag at A-rich compositions and B, at B-rich compositions. 

In turn, Fig. 5.20 shows the site fractions at temperatures kgT/vag of 0.3 (a), 
0.5 (b), and 0.7 (c) as a function of composition. We clearly see that the concen- 


7) In the B-W model with the nearest- show that the transition must take place at a 
neighbor interaction, the order-disorder temperature as low as kg T/vap % 0.5. It is 
transition from L1o to Al (fcc solid solution) not very useful to discuss the results of the 
occurs at kgT/vap = 1 for the stoichiometric B-W theory concerning temperatures above 


composition, but other advanced methods this “correct” transition temperature. 
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Fig. 5.20 Site fractions of vacancies and antisite atoms in Llo ordered 
AB alloys, with EPI energies of Model I, at temperature kgT/vag of 
(a) 0.3, (b) 0.5, (c) 0.7. 


trations of V, and Vz, vary with composition asymmetrically, while the curves of 
the concentrations of Ag and B, are almost symmetrical. As already presented in 
Fig. 5.19(b), vacancies are more populated in the « sublattice at the stoichiometric 
composition. In terms of the B-W theory, the ratio of the concentrations of the 
two vacancy species is described by Eq. (70), 


2 
C1 \ C46 EXB — BXA 

(2) C3C5 exp( kp T ) (70) 

which is obtained from Eqs. (47) and (48). At low defect concentrations the en- 

ergy gxp — gxa approximately equals 4(vgy — vay) for the Llo structure. This 


quantity is positive in the present case and thus biases vacancies to occupy « sub- 
lattice sites. 





B2 ordered alloys Figures 5.21 and 5.22 show the site fractions of the four defect 
species in B2 ordered AB alloys calculated using the same set of EPI parame- 
ters (Model I) as in the previous example of L1o ordered alloys. A remarkable fea- 
ture is that the concentrations of vacancies are of the same order of magnitude 
as those of antisite atoms. (The B2-A2 transition occurs at kgT/vag = 2 in the 
nearest-neighbor B-W theory without vacancies, but here the transition tempera- 
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Fig. 5.21 Site fractions of vacancies and antisite atoms in B2 ordered 
AB alloys, with EPI energies of Model I, at composition xg of (a) 0.45, 
(b) 0.5, (c) 0.55. 
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ture is lowered to about 1.6 by the high concentration of vacancies.) The concen- 
trations of vacancies that exist in thermal equilibrium are much higher than in 
the Llo structure, even though the EPI energies are identical. In the present 
framework, this striking difference is due solely to the differences in the coordi- 
nation numbers (Table 5.3). The energies controlling the formation of a pair of 
antisite atoms and a Schottky defect in the B2 ordered alloy are given as 


ctc 
gxB += 8(1 1 7 *) ne 4(cı — c2)(Vay — vev) (71) 
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Fig. 5.23 Site fractions of vacancies and antisite atoms in B2 ordered 
AB alloys, with EPI energies of Model II, at composition xg of (a) 0.45, 
(b) 0.5, (c) 0.55. 


5.1 Point Defects | 213 













< 
R 
TTTT 
= 


I 


TI TTTT 
/~ > 











2 "A 
` 10 wer 
s m (0) ke Tap = 1.2 
g 10° 10° E Vp ; 
© = | 
7) E (b) kg 7/Vap = 0.9 
10* 104 E 
E ge 
u Vav/ VaB =—1 


Vav/ Vae = —0.05 


er 
2 
a 
rrm] 














10611 1 | 49 10° 
04 045 05 055 06 04 045 05 055 06 04 045 05 055 06 


Composition, Xg Composition, Xg Composition, Xg 


Fig. 5.24 Site fractions of vacancies and antisite atoms in B2-ordered 
AB alloys, with EPI energies of Model Il, at temperature kgT/vag of 
(a) 0.6, (b) 0.9, (c) 1.2. 


They are approximated to 8vap and 4vag — 4vay — 4upy, respectively, at low defect 
concentrations. With the present set of parameters the latter equals 10vag, so that 
the formation energy of a Schottky defect is only 20% higher than that of an 
antisite-atom pair. It is also to be noted in Fig. 5.22 that the concentrations of va- 
cancies are not very low at A-rich compositions. In particular, the concentration of 
Vg, which is often neglected in simplified models, is no lower than those of V, 
and B,. 

Figures 5.23 and 5.24 show the equilibrium defect concentrations for the EPI 
energies of Model II, where vgy/vas is still negative but is one tenth of that for 
Model I. The “constitutional’ defect species at B-rich compositions is now V,, 
and it is the predominant defect species even at the stoichiometric composition. 
Physical origins of such anomalously high vacancy concentrations are discussed 
in the next section. 


5.1.3.4 Abundant Vacancies in some Intermetallic Compounds 


Preference of defect species Intermetallic compounds of the B2 structure used to 
be classified into two types according to the trends in point defect formation: of 
substitutional disorder; and of “triple defects” (Chang and Neumann 1983). In 
the former class of materials, antisite atoms are always the predominant defect 
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Fig. 5.25 Schematic illustration of the formation of a triple defect in an 
AB compound. A new unit cell is created at the surface, leaving a V, 
and a Vs in the interior, but the latter is then filled by an A atom to 
convert it to Ay, leaving another V,. 


species. In the latter, on the other hand, while the major defect species is Ag at A- 
rich compositions, it is V, at B-rich compositions. These two trends are related to 
the relative ease of the reaction 

A, + Bg > B, + Ag (73) 


and an alternative, 


Ay + Bg > 2V, + Ag + Bg (74) 





In the latter, three defects, i.e., “triple defects’, are created, avoiding V; and B,, as 
illustrated in Fig. 5.25. By comparing the right-hand sides of the two reactions, 
we see that the trends in defect formation are essentially determined by the pref- 
erence between B, and V,, pertaining to the balance of the reaction B, > 
2V, + By. This balance is formally represented by the following reaction, which is 
derived by combining reaction (52) and the inverse reaction of (53): 





nBy + nVg = (m+ n)Va + nVg + nBg (75) 
The energy governing this reaction is given by 
AE, = gs — ngxg (76) 


In the nearest-neighbor pair interaction model, the right-hand side is approxi- 
mately given by —zupy, or —8vpy for the B2 structure, when defect concentrations 
are low. Antisite atoms B, are dominant as long as AE, is negative, but when it is 
small in magnitude, vacancies V, are formed as much as B,. If AE, turns posi- 
tive, vacancies V, become the major defect species. 

NiAl, NiGa, CoAl, and CoGa, which are said to be typical triple-defect com- 
pounds, are chemically stable intermetallic phases which remain ordered right 
up to the melting temperature. These materials have large formation enthalpy, 
and thus large positive EPI energy vag. The atom-vacancy EPI energy vay is ex- 
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Table 5.5 The energy governing the preference of point defect species in 
a and f sublattices in an ordered alloy A,,B,. Antisite atoms or 
vacancies predominate if AE is positive or negative, respectively. 





Structure AE, = gs — ngxg AE; = gs — mgxa 

B2 (m= n= 1) —8vpy —Bvav 

Llp (m=n=1) 2(vag — 2vav — 4vgv) 2(vap — 4vav — 2vpy) 
L1: (m = 3, n= 1) A(vaB = Avay = 2vpy) —24vay 





pected, from their thermodynamic properties (Chang and Neumann 1983), to be 
negative and its magnitude is as large as vag. On the other hand, vgy could be 
much smaller in magnitude, or even slightly positive in these materials (Chang 
and Neumann 1983; Kim 1992); the bonding between B atoms at the nearest- 
neighbor positions is suggested to be very weak, or the interaction could possibly 
be repulsive, because a f site (surrounded by B atoms) might be too small to 
accommodate a B atom (Cottrell 1995; Meyer and Fahnle 1999; Korzhavyi et al. 
2000). This leads to preference for V, over B, according to the criterion 
AE, = —8vgy, and explains the high concentration of « vacancies observed in 
these materials. 

Preference of defect species in the £ sublattice can be discussed similarly on 
the basis of a reaction analogous to (75). The energy that controls the preference 
between A; and Vz is given as 


AE, = gs — Mgxa (77) 


Approximate expressions for AE, at low defect concentrations are listed in Table 
5.5 for the B2, L1o, and L12 structures. The expression for AF, for the L12 struc- 
ture is similar to the case of the B2 structure discussed above: it is determined 
solely by vay. This predicts that, in an L1, ordered A;B alloy composed of ele- 
ments A and B of low and high vacancy formation energies such as Al;Ti,” high 
concentrations of vacancies on f sublattice sites may occur. From these two exam- 
ples, it is seen that abundant vacancies arise from the characteristic atomic con- 
figurations: when a particular site is surrounded by a single species of atoms, an 
antisite atom of the same species might be unfavored, and at high temperatures, 
where gaining configurational entropy is important, the site could rather be made 
vacant. By referring to Table 5.5, such a configuration is found also in the DO; 
structure, in which zp = 0 as in the B2 and L1, structures. By experiment, it is 


8) The structure of Al;Ti is D022, which is neighbor coordination numbers are 
described by stacking two L1, unit cells with identical with L1). 
an anti-phase boundary, but the nearest- 
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known that Ni;Sb exists at high temperatures over a certain range of Sb-rich com- 
positions, in which the defects responsible for the deviation from stoichiometry 
are almost entirely vacancies on Ni sites (Heumann and Stiier 1966). This is the 
reason for the anomalously fast diffusion in this compound shown in Fig. 5.14. 


“Structural vacancies” — a remark on terminology The abundant vacancies ob- 
served in the “triple-defect” B2 compounds, NiAl, NiGa, CoAl, and CoGa, are 
essential in realizing the wide range of compositional stability at high tempera- 
tures, and are often referred to as “structural,” “constitutional,” or “composi- 
tional’ vacancies, contrasting with ‘thermal’ vacancies (de Novion 1995; Chang 
and Neumann 1983). When a compound phase exists over a certain composition 
range, the defects responsible for the deviation from the stoichiometry are gener- 
ally called “constitutional defects” or “structural defects.” In the transition-metal 
(TM) aluminides and gallides mentioned above, Ag is always the primary defect 
species (i.e., the constitutional defects) at TM-rich compositions. On the other 
hand, at TM-deficient compositions, vacancies V, are often found to exceed anti- 
site atoms B,. There have been discussions on which of these defects is more fa- 
vored in a particular material, or whether those constitutional defects are stable at 
all, in the framework of statistical thermodynamics (Kim 1988; Ren and Otsuka 
2000; Kim 1992; Breuer et al. 2002). However, it should be noted that statistically 
distributed point defects cannot exist at absolute zero temperature according to 
the third law of thermodynamics; statistical thermodynamics is not very useful 
for discussing “stability” of point defects. 

The composition of a real material is never exactly stoichiometric. A single- 
phase compound would eventually separate into two phases on lowering the tem- 
perature, most probably to the original compound phase with a more ordered 
structure and a neighboring phase in the phase diagram. The latter can be an- 
other compound phase, or a terminal elementary phase of one of the two compo- 
nents. In the Ni-Al system, there exists a compound phase Ni,Al; next to the B2 
ordered NiAl. The structure of NizAl; is described as a slightly distorted B2 struc- 
ture in which every third plane of nickel atoms perpendicular to the (111) direc- 
tion is systematically absent (Taylor and Doyle 1972). Similar vacancy-ordered 
phases are known to occur in the Ni-Ga system: Ni;Gay, (Ellner et al. 1969) and 
Ni,Ga3 (Hellner 1950). The Ni vacancies in these phases deserve the name of 
“structural” vacancies, as they are elements of specific crystal structures. 

The presence of these vacancy-ordered phases, in which vacancies continue to 
exist, is consistent with the predominance of vacancy defects over antisite atoms 
in these B2 compounds at high temperatures, but those vacancies in the B2 
phase are in principle not allowed to remain at low temperatures. In practice, 
however, the compound may remain single-phase with supersaturated point de- 
fects, since the defects present at high temperatures may not be annihilated com- 
pletely because of the loss of mobilities. Often those defects are regarded as 
“structural” or ‘constitutional’ defects, but they are actually nothing more than 
“frozen-in” defects. After all, the defects present in thermal equilibrium at high 
temperatures are formed to realize a thermodynamic state whose free energy at- 
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tains the minimum for the given composition and temperature. Since it is impos- 
sible to determine if a particular point defect has been formed thermally or con- 
stitutionally, it does not seem adequate to employ such a distinction as “thermal 
defects” and “constitutional defects”. 


5.2 
Defect Migration: Microscopic Diffusion 


5:2:1 
The Single Atom Jump 


In the previous section the presence of point defects in a crystal at finite temper- 
ature was seen to be a consequence of fundamental thermodynamic principles. 
In their turn, point defects are the prerequisite for diffusion and all changes in 
atom configuration, for whenever an atom leaves its position a defect of some 
kind is created, displaced, or annihilated. By far the most important mechanism, 
natural for substitutional solid solutions, is the interchange of an atom with a 
neighboring vacancy, and it is the vacancy mechanism which we will mainly con- 
sider in the following. Diffusion may take place also by hopping of atoms from 
one interstitial lattice site to the next, a diffusion mode important for small impu- 
rity atoms such as hydrogen or carbon. In a third mechanism of some relevance, 
the interstitialcy mechanism, an atom on an interstitial site displaces a neighbor- 
ing atom from a regular lattice which it afterwards occupies (cf. Section 5.2.2.2 on 
correlation.) All these mechanisms lend themselves to the same treatment in the 
framework of statistical mechanics which will be expounded below. 


5.2.1.1 Transition State Theory 

Transport of atoms by single atom jumps is a thermally activated process, which 
means that some energy barrier has to be overcome by thermal energy fluctua- 
tion. Such an event is, of course, the elementary step of a multitude of kinetic 
rate processes considered in physics and chemistry. Time-honored though it is, 
the transition state theory remains in many cases a viable description of the 
single activation event. Originally it was formulated for chemical reaction rates 
(Glasstone et al. 1941) and refined on the basis of statistical mechanics by Wert 
(1950), Zener (1952), and Vineyard (1957). We give a short account of this method 
below. 

Let us consider a solid with N atoms. The Born—Oppenheimer approximation 
is invoked as usual (cf. Chapters 4, 10, and 11), the electrons as the component 
with much lower mass adapting almost instantaneously to the position of the lat- 
tice ions. The ions (henceforth also called atoms, for simplicity) can then be re- 
garded as subject to forces created by the interaction with the electrons and de- 
pendent only on ion coordinates. Then we have to deal with 3N degrees of 
freedom with ion positions and momenta as coordinates. As in a classical sys- 
tem positions and momenta are distributed independently, we can restrict our 
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Fig. 5.26 Potential ® as a function of 3N atom coordinates. A is the 
original equilibrium state of the system, B the new equilibrium state 
after the atom has jumped, C saddle point, s dividing hypersurface. 
Adapted from Vineyard (1957). 


treatment to the 3N dimensional configuration space subtended by the position 
coordinates. The total internal energy of the system can be represented as a po- 
tential function ® of the position coordinates. Let us now regard an elementary 
event of diffusion, say the jump of an atom into a neighboring vacancy. 

The original equilibrium position A (Fig. 5.26) of the atom corresponds to a 
potential minimum, as does the new equilibrium position B after the jump. In 
between, there has to be at least one transition state, corresponding to a saddle 
point C in the potential surface where the curvature is negative along a certain 
direction which the transition path follows. The system, excited by thermal fluc- 
tuations, will move about in the phase space region around A and eventually, 
aided by a favorable fluctuation, pass over C into the region around B. Instead of 
following its path in time an ensemble of systems distributed in the vicinity of A 
is considered, and we calculate ensemble averages instead of time averages. A hy- 
persurface of dimension 3N — 1 which we call s, dividing the A and B basins, 
passes through C (the broken line in Fig. 5.26) and is perpendicular to the con- 
tours of constant potential ® everywhere else. The simplifying assumption is 
made that the ensemble of representative system points is in thermal equilibrium 
right up to the saddle point, as well as along the relevant part of s, corresponding 
to a canonical ensemble under isothermal conditions. The second important as- 
sumption of transition state theory is that any system which has reached s and 
has a positive velocity component normal to s will definitely cross over into the B 
basin. This means that the system point does not follow a random, reversible mo- 
tion at the saddle point, and that any friction forces are sufficiently small. The 
transition rate will be given by the average number of system points crossing 
over s in unit time, divided by the total number of systems in the A basin. An 
orthogonal rotation of the Cartesian coordinates can be performed everywhere 
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along the dividing hypersurface s so that a given coordinate axis, which we may 
call xı, is perpendicular to it. Only the positive part of the corresponding velocity 
u is then contributing to the flux across s, and from the Maxwellian velocity dis- 
tribution Eq. (78) follows for the average velocity u across s, 


I o &xP( E) du a (78) 


u=- ~~ ZkgT 








m, being the mass associated with degree of freedom number 1. Now we intro- 
duce a transition region by adding a small strip of width ô on both sides of the 
hypersurface s. If we write N,5 for the number of systems of the ensemble in 
this region, and N4 for the number of systems in the A basin, the transition rate 
can be written as (Schoeck 1980):” 


P aNs _ veo leg exp(—;°) AN. sy 


ON, ô i exp (- cy) ary 





It remains to calculate the partition functions N, 5 and N4. Provided the potential 
barrier ADB = ®(C) — O(A) > kgT is high enough and the curvature of the saddle 
surface normal to the transition path is strong enough, then most of the repre- 
sentative system points will be found near A, and the contribution to the flux 
across s will be largest at C. We may then expand the potential around A and C 
into Taylor series, and after suitable orthogonal transformations write Eqs. (80) in 
terms of the normal frequencies v; at A and v/ at the saddle point C: 


1 3N 1 3N 
©, = O(A) + 2 mj(2nv;)"q?, ®c = O(C) + aD m;(2nv})? q}? (80) 
i=1 


i=2 


where at the saddle point the dependence on the transition coordinate, associated 
with a negative curvature, may safely be omitted as it is constrained to an infini- 
tesimal strip of width ò, a quantity which finally cancels out. With Eq. (80) the in- 
tegration in Eq. (79) can be performed, easily yielding the simple result 


o= (4) (ON) - on) e 








9) In contrast to some previous studies, we use concentration) accounting for the availability 
œw here instead of T, because T will be of the defect mediating diffusion. The 
reserved, in the context of this chapter, for transition rate considered here has the role 
an effective atom jump frequency that of an exchange frequency with a defect 


contains a factor (e.g., vacancy already lying adjacent to the jumping atom. 
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The effective frequency ver, in rate theory usually referred to as an “attempt fre- 
quency” and often identified liberally with the Debye frequency, here turns out as 
the product of all 3N normal frequencies in the equilibrium state, divided by the 
(3N — 1) normal frequencies in the constrained saddle point state. Note that, at 
least in principle, all the degrees of freedom of the system enter in this expres- 
sion. Although veg is very difficult to estimate in the first place, some significant 
progress has recently been made on the basis of quantum mechanical first- 
principles calculations where phonon spectra can now be obtained in the acti- 
vated, constrained state (Wimmer et al. 2006; cf. also Chapter 11). 

For a system kept at constant temperature and pressure P, work has to be done 
against the external pressure when changing the configuration of the system. The 
potential ® then contains an additional PV term and Eq. (81) can be rewritten in 
a more compact form familiar from Arrhenius analysis as 


© = Vef EXP (- a) (82) 





Equation (82) can be cast into yet another useful form. Following Wert and Zener 
(1949) we introduce at the original equilibrium state a hypersurface similar to the 
one at the saddle point. This hypersurface, called o (Fig. 5.26), is normal at A to 
the line of force leading to C, perpendicular to the contours of constant ® every- 
where else, and should also be fitted with a strip of width 6 on both sides. Then 
Eq. (79) can be modified according to Eq. (83): 





a= UN, 5 = u Noe N; 3 = zNa =j ek AGm (83) 
ôNa ô Na Nos No ò i KT 


for a system at constant temperature and pressure, where now 





~ ASm 

P= ver exp | - I (84) 
B 

and 

SN 0 

AS = ky In| Li=2% (85) 
3N ; 

IL-2 tj 


The v? and the v! denote the constrained frequencies in the equilibrium state A 
and the saddle point state C, respectively. Equation (84) follows from comparison 
of Eq. (83) with Eq. (82). As exp[-AGm/(kgT)] is the quotient of the two con- 
strained partition functions at C and A, we see that AG, is identified as the Gibbs 
free energy needed to carry a system constrained to the (moving) hypersurface re- 
versibly from A to C. Specifically, ASm reflects the change in the vibration spec- 
trum connected with this transfer. 
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5.2.1.2 Alternative Methods 

Transition state theory is associated with two critical assumptions which contra- 
dict each other to a certain extent. On the one hand, the activated systems should 
interact with the heat bath strongly enough to acquire thermal equilibrium; on 
the other hand, the interaction must be weak enough not to reverse a positive ve- 
locity across the saddle hypersurface. 


Dynamical Theories Several authors have argued that it is altogether question- 
able whether the atom remains at the saddle point long enough to acquire ther- 
mal equilibrium (especially if the driving force for diffusion is very large or there 
is strong co-operation between successive or simultaneous atom jumps). Drop- 
ping the assumption of thermal equilibrium, it was consequently attempted to 
derive the atom jump frequency from a different argument involving lattice dy- 
namics. In these “dynamical” theories of thermal activation (Rice 1958; Slater 
1959; Manley 1960) the motion of a jumping atom was considered as a superpo- 
sition of n harmonic normal modes of the crystal. A transition event is supposed 
to take place once the amplitude exceeds a certain limit, and at the same time 
atoms surrounding the transition path “move out of the way” enabling the jump- 
ing atom to pass. A statistical treatment of the atom displacements in the frame- 
work of classical thermodynamics yields a transition frequency where the energy 
barrier for the jumping atom and the energies needed to displace the surround- 
ing atoms figure in Boltzmann factors. The attempt frequency which appears as a 
prefactor has in the numerator the normal frequencies of the system, and in the 
denominator the normal frequencies with the coordinate qı of the jumping atom 
constrained to the saddle point, making this expression very similar to Eq. (81) in 
transition state theory. 


“Diffusion” Theory of Thermal Activation Sometimes in thermal activation the 
saddle point of the energy surface in phase space corresponding to the critical 
state is not a sharp maximum as a function of the transition variable. Then it is 
to be expected that the representative system points, by their stochastic interac- 
tion with the heat bath, perform a random motion in a region of phase space 
near the saddle point with an energy difference not exceeding kgT. In this case 
the second assumption of transition state theory — that a positive velocity at the 
saddle point is not reversed — cannot be upheld any more and we have to resort 
to “diffusion” theories of thermal activation (Kramers 1940; Chandrasekhar 1943; 
Brinkman 1956). In analogy with classical nucleation theory, a steady-state flow of 
representative system points over the saddle point can then be derived. As a re- 
sult, that strong damping together with a “soft” potential (low curvature at the 
saddle point) decreases the transition rate (jump frequency) in comparison with 
the value predicted by transition state theory (Kramers 1940; Schoeck 1980). 


Molecular Dynamics Most classical treatments of thermal activation such as 
transition state theory and dynamical theory rest on the harmonic approximation 
of the crystal and therefore neglect nonlinear behavior near the saddle point. By 
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employing modern, computer-based methods this limitation and any restrictive 
assumptions about the activated state of the system can be overcome. The most 
widely used method is molecular dynamics (MD; cf. Chapter 10), which amounts 
to numerical solution of the equations of motion of the jumping atom and its sur- 
roundings. A valid description of interaction potentials is of course necessary, the 
EAM (embedded-atom method) potentials (Daw and Baskes 1984) being the most 
popular form besides the Finnis-Sinclair type (Finnis and Sinclair 1984). Molec- 
ular dynamics is the preferred method of numerical simulation for diffusion pro- 
cesses in materials which have no regular crystal structure like, of course, liquid, 
amorphous or organic materials, or in cases where the diffusion pathways are di- 
mensionally restricted as on the crystal surface, in grain boundaries, interfaces, 
and dislocation cores (cf. Section 5.4.3). 


Nudged Elastic Band Method Activation energies are inferred from MD simula- 
tions only indirectly from an Arrhenius analysis of the temperature dependence 
of the transition rates. More direct approaches to numerical determination of the 
migration barriers consist in static energy calculations where the jumping atom 
is displaced from its original equilibrium position toward a new equilibrium state 
by small steps, relaxing at each step toward the configuration of minimum en- 
ergy. The interactions are calculated by fitting various potentials or employing 
ab-initio density functional methods (cf. Chapter 11). Besides the fact that relaxed 
configurations may represent a poor picture of the actual, dynamical diffusion 
process, the main disadvantage is that in most of these calculations the jump 
path is unnecessarily restricted. Allowance is made for all degrees of freedom in 
the nudged elastic band (NEB) method (Mills et al. 1995), where between the ini- 
tial equilibrium state of the system (A in Fig. 5.26) and the final equilibrium state 
(B in Fig. 5.26) a set of replicas of the system is interposed which reflect the 
changing state of the system during the transition. The replicas are linked by 
spring forces in an additional dimension of phase space. All coordinates of all 
replicas are numerically relaxed until a minimum energy configuration is 
reached. The set of connected replicas has then settled like an elastic band across 
a saddle surface to a “most comfortable fit” where it is least stretched. This 
method can even be applied when calculating the energy on an ab-initio basis if 
the number of degrees of freedom is not too large. 


5.2.2 
Solid Solutions 


5.2.2.1 Random Walk 
Atoms migrate in solids by random jumps of defects between sites of a generally 
three-dimensional lattice.” The symmetry properties and topological connectivity 


10) In some situations, the jumps may be 
restricted to two dimensions or one 
dimension (cf. Section 5.4.3). 
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of the crystal lattice can be almost arbitrarily complex. In many cases (e.g., long- 
range ordered alloys) it will be necessary to classify the lattice sites as belonging 
to sublattices, and the jump probability generally depends on in which sublattice 
the jump starts or ends. Nevertheless, we will talk of a random walk proper™ as 
long as the jump probability does not depend on local composition or composi- 
tion gradients and is independent of any previous steps (i.e., uncorrelated walk). 
In this section we will for the sake of simplicity assume that the concept of a ran- 
dom walk applies to a migrating atom as well as to a migrating defect. 

The random walk is a well-known mathematical problem (Rudnick and Gas- 
pari 2004) with many applications in statistical physics. It is employed in the 
study of organic systems, and even more abstract fields such as sociology. Some 
of its elementary properties can already be gleaned from the simplest case, a one- 
dimensional random walk, where the atom makes jumps of equal length along a 
straight line. As discussed in Section 5.2.1, the jump probabilities are determined 
by the properties of the atom potential. In Fig. 5.27 (Manning 1968) different pos- 
sibilities are displayed schematically. In Fig. 5.27(a) jumps to the left or right are 
equally probable; in Fig. 5.27(b) there is a constant driving force such as on ions 
in an electric field, so the probability is biased toward one side; in Fig. 5.27(c) the 
properties of the potential and thus the diffusion coefficient are changing with 
position. In this latter case we will not talk of a random walk but of a diffusion 
walk. 

Macroscopic behavior may now be deduced from the random jump events by 
following two possible approaches. Either (1) we consider the net atom flux be- 
tween parallel lattice planes of the crystal or (2) we trace the random walk of 
atoms along some distance and learn about the transport of macroscopic quanti- 
ties of matter by calculating averages. 

Following approach (1) it is an easy exercise to connect microscopic jump prop- 
erties with the macroscopic diffusion laws. Consider for simplicity a simple cubic 
lattice with jump distance | and diffusion normal to the {100} planes, which re- 
duces diffusion to a one-dimensional problem. If the number of, say, A atoms in 
plane 1 per unit area is nı and in the adjacent plane m and the respective jump 
frequencies according to the scenario of Fig. 5.27(b) are T12 =To(1-+e) and 
T2, = To(1 — £), then the net flux per unit area (i.e., flux density) will be” 





j= my -min = To(nı — m) + Toe(m +m) 


ac 
= Tol? a + 20 eCl (86) 


11) Terminology in the literature may draw the 12) Here, as everywhere else in Chapter 5, 


line of distinction in different places. Some upper-case C will be used for concentrations 
authors refer to a random walk only when as atoms per unit volume whereas lower- 

all jump probabilities are equal, others case c will be reserved for site fractions. 
consider stochastic motion by any set of Note that c = QC, where Q is the atomic 
probabilities a random walk. We take an volume. 


intermediate viewpoint. 
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Fig. 5.27 Three possible potential landscapes for random walks 
(Manning 1968). (a) Unbiased random walk; (b) random walk with 
driving force; (c) diffusion walk in nonhomogeneous environment. 


where we have used the relationship Cl = (nı + m)/2 for the volume concentra- 


tion C of A atoms. 


A driving force F will lead to an energy difference FI within the distance be- 
tween adjacent lattice planes, and ¢ = FI/(2kgT). Defining the diffusion coeffi- 


cient as D = Tol? eventually puts the flux in the form 


0C DF 
i= -D+ C 
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in which we recognize Fick’s first law of diffusion with a drift term. If we con- 
sider jumps that do not coincide with the diffusion direction but subtend angles 
a; with it, one has to sum up the projections. Allowing only nearest-neighbor 
jumps to happen, we get Eq. (88). 


1 z 
D= JDP X cos? a; (88) 
i=1 
For cubic crystal structures, this can be shown to result in Eq. (89), 
1 1 
D= g! Toz = zer (89) 


with the total jump frequency T = Toz, independent of the diffusion direction. z 
is the number of nearest neighbors, the coordination number. An expression for 
the hcp structure can be found in Manning (1968), p. 48. 

Applying the continuity equation for the one-dimensional flux 


ac ð 


= 0 
ot Ox (90) 


to Eq. (87) (or directly considering the change of atom numbers in the middle 
one of three adjacent planes) we get Eq. (91), 


ôC _ pot DF ôC (91) 
ôt ox?  kgT ôx 





having thus derived Fick’s second law for one-dimensional diffusion. 

Let us now have a closer look at the second approach, in which we follow an 
atom along its path. Steps will be made in random sequence to the left or to the 
right with probabilities p and q = 1 — p. Assuming that the right and left proba- 
bilities differ ever so slightly, we set p = (1 + €)/2 + O(e?) and q = (1 — €) /2 + 
O(c”). Then after N steps the probability of having advanced a total distance of m 
to the right can be expressed by the binomial distribution, which can be shown to 
converge in the limit of a large number of steps to a Gaussian probability distri- 
bution. We suppose now that the particle on average suffers 2T', displacements 
per unit time. Introducing the continuous variable x = ml when the step width 
is l, we can write for the differential probability W(x,t)Ax that the particle has 
arrived in the interval [x, x + Ax] at time t, 


(«= at)” 


We) = 4Dt 


(92) 





1 
ex 
2ynDt > 








where we have written D = Tol? (diffusion coefficient) and 3 = 2Tole (drift veloc- 
ity). With Eq. (92) we have at the same time found Green’s function for solving 
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the initial-value problem of diffusion (Glicksman 2000) under the restrictive as- 
sumptions made here: no dependence of diffusion coefficient or driving force on 
x. As Fick’s second law Eq. (91) is a linear differential equation, solutions can be 
constructed by superposition. Equation (92) gives the solution of Eq. (91) for an 
initial distribution of one A atom exactly at x = 0, which in continuum language 
means a Dirac ô function centered at x = 0. The general solution of Eq. (92) for 
any initial distribution Co(x) = C(x,0) is therefore found as the convolution 


€=+00 


clu) =] a- awE na (93) 


č=-0 


where W is defined by Eq. (92). Since the random behavior of the atom as con- 
sidered in this section does not depend on previous history, we may write quite 
generally 


C(x! t+ 1) = [ C(x’ — x, ) W(x, 1) dx (94) 


x=—0 


which means that the state at time t+ t develops by random walks of all atoms 
with duration t from the state at a previous time t. Expanding the left-hand side 
of Eq. (94) in t and the integrand in x we obtain Eq. (95). 


oc 
C(x t) +T—+--- 





ot 
x=+00 ôC 1 82C 
-| (cs x 5% = =) W(x, t) dx (95) 


Defining the nth moment of the probability distribution W as 


+00 
ix") = | x" W(x, t) dx (96) 


—0 
we can write Eq. (95) for short times t as 


OC <> OC Ce OC 
ot 2t dx? T 0x 


(97) 





Comparing this form with Fick’s second law, Eq. (91), we can give the constants 
appearing therein a statistical meaning (Einstein 1905): 


pain DF ae 
70 2T , kp T T 





(98) 


Equations (98) are general formulations, in the sense that they apply even when 
the probability distribution W does not represent true random walk behavior. 
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We note that when directly calculating the first and second moments of the prob- 
ability distribution, Eq. (92), for finite time t, then (x) = [FD/(kgT)|t = iit, but 
(x?) = 2Dt only if there is no driving force or in the limit for short times. 

Definition (98) in the case of a random walk refers to the tracer diffusion coef- 
ficient, which is usually denoted as D* when it is necessary to distinguish it from 
other types of diffusion coefficient (cf. Section 5.4). To get an idea of mean ran- 
dom walk distances we note that in substitutional fcc alloys near the melting 
point a typical value is D ~ 101? m? s~!. The mean diffusion distance for one 
day results as \/<x?> x 0.4 mm. For small interstitial atoms a typical value of D 
near the melting point is 107°? m? s~!. This would yield a mean diffusion dis- 
tance after one day of \/<x?) 13 mm. 

The expansion according to Eq. (95) can also be performed in the general three- 
dimensional case of diffusion leading to a general three-dimensional formulation 
of Fick’s second law (small t is again implied so as to make higher-order terms 
negligible): 





ac xj) OC 
= 99 
ôt > tT dm 223 ime 72) 
where the diffusion coefficient D is defined as the second-rank tensor 
De (100) 
™0 2rt 


The off-diagonal elements vanish if an appropriate set of principal axes is chosen, 
the diagonal elements then corresponding to the diffusion coefficients along 
those axes. When choosing these axes we can make use of the fact that the sym- 
metry properties of the diffusion tensor Dj reflect the symmetry of the crystal. 
Cubic crystals exhibit a very high degree of symmetry, with the consequence that 
any three orthogonal axes are principal axes and diffusion is isotropic: (x?) = 
<x?) = <x?). Then we have for the three-dimensional displacement R = 
(x1, X2, x3) 





2 
D= lim KRD 


tsmall 6T 


(101) 
so that for isotropic diffusion we may write three-dimensional forms of Fick’s 
first law: 


FDC 


= -DVC 
j A kT 


(102) 


and Fick’s second law: 


ac j FC 
E p[v?c (Es) | (103) 
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The three-dimensional generalization of Eq. (92) follows if we consider that the 
random walk can be decomposed into three independent walks in the three prin- 
cipal diffusion directions, so the probability density is just the product of the 
probability densities in the three directions: 


W (x1, x2, X3) = W(x, t) W(x, t) W(x3,t) (104) 
In the absence of driving forces, 


1 
exp 
8y z? Dı D: D3t? 





W (x1, X2, %3) = 








2 2 2 
xi x5 | xi (105) 
4Dıt 4D zt 4D3t 


We see that the surfaces of constant concentration (or probability) are ellipsoids 
whose axes are determined by the principal diffusion coefficients Dı, D2, and 
D3. In the case of isotropic diffusion (as in cubic crystals) the ellipsoids turn into 
spheres. 


Heterogeneity of Phase Transformations by Limited Vacancy Mobility Equation 
(105) tells us that the random walk of atoms or defects such as vacancies defines 
ellipsoids or spheres of probability which grow with time. Let us consider a sys- 
tem where diffusion takes place via a vacancy mechanism, which is not in equi- 
librium for t = 0. Any change in atom configuration will begin in restricted areas 
within the ellipsoids defined above. If the mobility of the vacancies is low (low 
temperature, high enthalpy of migration AH„), and their density not too high, 
these areas will remain separate during reasonable observation times. This 
means that structural changes such as phase transformations or changes of order 
parameter, which should by thermodynamic arguments proceed homogeneously 
in all volume elements of the sample, may by reasons of vacancy mobility lead to 
a heterogeneous configuration, at least in the initial stages of the transformation. 
This effect, usually called “local ordering,” has been recognized by various au- 
thors (Beeler 1965; Allen and Cahn 1976; Athénes et al. 1996; Belashchenko and 
Vaks 1998; Soffa et al. 2003). The initially “disjunct” state of kinetics cannot end 
before the probability ellipsoids [Eq. (105)] impinge on one another. This criterion 
is however not sufficient because the migrating vacancy does not fill out the prob- 
ability ellipsoid densely enough (Reith et al. 2005). 


5.2.2.2 Correlated Walk - the Interaction of Defect and Atom 

We consider a walk in the absence of driving forces, using Eq. (101) for the defi- 
nition of the tracer diffusion coefficient. The displacement R results from a suc- 
cession of n jumps r1,T2,...r„ which are not necessarily of equal length or direc- 
tion. The averaged squared displacements can be written as Eq. (106). 


n 2 n n—1 n 
AR = ( (>: n) ) =X 4+2 Y nn) (106) 
i=1 i=1 


i=1 j=i}1 
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oO O O © Fig. 5.28 Vacancy mechanism. The atom (black) originally 
at position 1 has changed places with a vacancy. 


O O O O 


For a completely uncorrelated random walk the cross-terms cancel out on the av- 
erage since steps in any given direction are equally probable to those in the re- 
verse direction. If they do not, it is an indication that there is some “memory” 
of the previous path of the diffusing atom. How this comes about is easily dem- 
onstrated, taking the vacancy mechanism as an example. 

A tracer atom (black) in Fig. 5.28 has just changed places with a vacancy, ad- 
vancing from position 1 to 2. There is a more than equal probability that the 
next jump of the atom will be an exchange with the same vacancy, putting it 
back to position 1. Note that the motion of the atom is correlated even if there is 
no preference of the vacancy to lie in the vicinity of any atom, which means that 
the movement of the vacancy itself is completely uncorrelated. Another example 
is the correlation arising from the interplay of interstitial and regular site atoms 
in an interstitialcy mechanism (Fig. 5.29). The tracer atom (black), initially in an 
interstitial position, moves on to a regular lattice site, and the atom originally sit- 
ting there now becomes an interstitial. There is an enhanced probability that the 
next jump of the black atom will be just the reverse, because the necessary defect 
(an interstitial atom) is still at hand. 

The usual method of quantitatively describing the correlation effects in atom 
diffusion is to compare <R?> of Eq. (106) with <R?),andom Of a totally random 
walk. The quotient of both quantities is called the correlation factor: 


14 e Dai pare 1) 
| Xia <r?» 


<R?> ; 
T= o Im 
random 





(107) 


Much effort has been put over the years into calculation and measurement of cor- 
relation factors in systems with various symmetries and diffusion mechanisms 
and several jump types. Elaborate systematic techniques based on probability con- 


a Fig. 5.29 Interstitialcy mechanism. An interstitial atom (black) moves 
to a regular lattice site (white). The atom which is sitting there now 
O O becomes an interstitial atom. 
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cepts (Bardeen and Herring 1952), matrix methods (Mullen 1961; LeClaire and 
Lidiard 1956), and analogies to electric circuits (Compaan and Haven 1956) and 
different numerical methods have been reviewed by Manning (1968). In less sym- 
metric lattices, correlation factors in the principal diffusion directions may be dif- 
ferent and must be defined by equations analogous to Eq. (107) with displace- 
ments in the respective directions. In cubic lattices, the high symmetry reduces 
correlation factors to rather simple expressions involving the average cosine of 
the angle between two successive jumps, yielding for the vacancy mechanism 


_ 1+ <cos 0) 
= 1 — <cos 0) vos) 
and for the interstitialcy mechanism 
f =1+ <cos 0’) (109) 


6’ being the angle between the interstitial-to-lattice jump and the subsequent 
lattice-to-interstitial jump of the tracer atom. 

Table 5.6 (Murch 2001) lists various correlation factors which result for differ- 
ent diffusion mechanisms, different lattices, and different dimensionalities. The 
correlation factor for vacancy diffusion is a characteristic number always smaller 
than unity depending on the specific lattice and its connectivity. Interstitial diffu- 
sion of an infinitely dilute solute is uncorrelated, as diffusion does not depend on 
an ancillary defect. If, however, the interstices are filled with solute atoms so that 
only a few free places remain where the atoms can go, we revert to the situation 
of vacancy-assisted diffusion with the same correlation factors as found there. 


Table 5.6 Correlation factors for various lattice structures and diffusion mechanisms. 


Lattice Mechanism Correlation factor, f 
Honeycomb vacancy 1 

Square vacancy 1/(r-1) 
Triangular vacancy 0.56006 
Diamond cubic vacancy A 

Body-centered cubic vacancy 0.72714 

Simple cubic vacancy 0.65311 
Face-centered cubic vacancy 0.78146 
Face-centered cubic divacancy 0.4579 + 0.0005 
All lattices interstitial 1 

NaCl structure collinear interstitialcy 2 

CaF; structure (F ion) noncollinear interstitialcy 0.9855 

CaF; structure (Ca?* ion) collinear interstitialcy 4 

CaF) structure (Ca?* ion) noncollinear interstitialcy 1 
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Fig. 5.30 Correlation factor for vacancy diffusion in the fcc lattice (Murch 1975). 


Conversely, if in substitutional diffusion by a vacancy mechanism more and more 
vacancies are introduced, correlation with specific vacancies is lost as the atom 
becomes free to move almost anywhere, because a vacancy is always around. 
Both cases are characteristic of a situation where a large number of objects fill 
an available space until they behave collectively like the one vacant lot they have 
left free. 

Figure 5.30 (Murch 1975) illustrates this change of regimes in terms of the cor- 
relation factor for vacancy diffusion in the fcc lattice. The correlation factor goes 
from the value for single vacancies (near 0.8) up to 1 as the vacancy concentration 
rises from 0 to 1. The same diagram can in principle be read from right to left for 
interstitial diffusion as the solute atom concentration increases from 0 at the 
right-hand side of the diagram to 1 at the left-hand side. In the latter case f de- 
creases as the number of available sites is depleted. 

In cubic crystals all jumps are of the same length, and we have for the correla- 
tion factor 


> (110) 





an especially useful form if one wants to obtain f by direct computer simulation. 
With the help of Eq. (101) and the total jump frequency T = n/t the diffusion co- 
efficient can therefore be written as 


2 
D* s (111) 
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which is the usual representation of the diffusivity as an uncorrelated part and a 
factor taking account of the correlation effects. We note that in alloys with a 
high concentration of defects, for instance nonstoichiometric intermetallic com- 
pounds or fast ion conductors, the apparent activation energy of D* may contain 
an important contribution from the temperature dependence of f (via the T- 
dependence of the defect concentrations). 

There are alloys where any diffusion jump leads to a different sublattice of the 
long-range ordered structure (cf. Section 5.2.3). An example is B2 (FeAl, NiAl,.. .) 
with all nearest neighbors of a given atom belonging to a different sublattice. If 
we assume nearly perfect order, and allow only nearest-neighbor jumps to occur 
(which in many cases is the most probable hypothesis) then a diffusion jump of 
a “regular” atom is bound to create an antisite defect. As the new configuration 
in general is at a higher level of energy, there will be an enhanced probability of 
an immediate back-jump, leading sometimes to very low correlation factors. Al- 
though we note a jump reversal effect as in the usual case of correlation due to 
the cooperation with a vacancy, the cause is now different. For the special situa- 
tion of diffusion and the correlation effects in highly ordered alloys, refer to Sec- 
tion 5.2.3. 


Correlation effects in impurity diffusion: the five-frequency model Whereas in 
tracer diffusion f can be given as a simple number depending on lattice geome- 
try, the correlation factor of impurity atoms depends on the relative value of their 
jump frequency as compared to the jump frequency of the solvent. The obvious 
reason for the particular temperature dependence of impurity diffusion is that 
both œ values are a function of temperature. In Fig. 5.31 (Murch and Thorn 
1978) f is plotted as a function of w/a, the ratio of the vacancy exchange fre- 
quency œ of the impurity and wp of the host atom. The curves have been calcu- 
lated for the walk of a sufficiently dilute impurity atom in the fcc lattice. 

If wo > œ, the vacancy exchanges much more rapidly with a host atom than 
with the impurity atom. It will therefore move away quickly from the impurity 
atom, whose next jump will probably not involve the same vacancy. The conse- 
quence is weak correlation, viz., f ~ 1. Vice versa, if wo « œ there is little in- 
ducement for the vacancy to exchange with one of the sluggish host atoms, and 
therefore the next exchange of the vacancy will be with the impurity atom again. 
The result is strong correlation and f « 1. 

In order to describe the impurity diffusion and to derive the correlation factors 
plotted in Fig. 5.31, the phenomenological so-called five-frequency model (Murch 
and Thorn 1978) was used. The influence of different atomic neighborhoods is 
accounted for by introducing five distinct exchange frequencies, as explained in 
Fig. 5.32. Here œ; is the frequency for the exchange of the host atom with a va- 
cancy if both are nearest neighbors to the impurity, and w is the exchange fre- 
quency of the impurity with the vacancy, as above; œ; is the frequency of a disso- 
ciative jump of the host atom, which takes the host atom away from the impurity 
atom, and finally œw, refers to the opposite movement which brings the host atom 
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Fig. 5.31 Correlation factor for impurity diffusion. The upper and lower 
curves refer to the upper and lower x axis labels and represent the two 
regimes «w/o < 0 and w/a > 0. After Murch and Thorn (1978). 


into a nearest-neighbor position to the impurity. The fifth frequency is wo for the 
exchange of a host atom with a vacancy in the absence of impurities. We note that 


w4 Eg 
ae a 112 
M3 exp( kp 7) ( ) 


Eg being the impurity-vacancy binding energy (negative if the vacancy is at- 
tracted to the impurity). Manning (1959), within the framework of this model, 
has made a rigorous derivation of the impurity correlation factor: 


_ 201 + 7Fa3 
201 + 202 + 7Fo3 


h 





(113) 


where the dependence on wo enters through the fraction F of the vacancies per- 
manently lost from a site. F is given by the very good approximate expression 
(114). 
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Fig. 5.32 Five-frequency model for impurity diffusion (Murch and Thorn 
1978). Open square: vacancy; open circle: host atom; filled circle: 
impurity. «3 is the frequency of a dissociative jump of the host atom 
away from the impurity. The fourth frequency a, is the corresponding 
associative jump, the fifth frequency, œo, refers to the exchange of a 
host atom far from an impurity, with the vacancy. 


10a* + 180.503 270? + 1341 
para eee ge (114) 
20° + 40.20” + 254a° + 597% + 436 @o 








Analogous expressions have also been derived for other lattice structures, e.g., for 
bec and the diamond structure (Manning 1964), the hcp structure (Huntington 
and Ghate 1962; Ghate 1964) and for impurity diffusion by interstitialcy jumps 
(Manning 1959). 


5.2.2.3 Diffusion Walk with Chemical Driving Force 

When deriving the diffusion flux in Eq. (87) we assumed that the enthalpy of the 
crystal was unchanged after an atom jump had taken place, a premise that can be 
upheld in a homogeneous alloy or in the case of an ideal solution. In Fick’s first 
law the concentration gradient formally plays the role of a driving force. The net 
flux arises, however, only because there are more atoms on plane 1 than on the 
adjacent plane 2. We might also say that it is the gradient of the entropy of mix- 
ing which drives diffusion in an ideal solution.'” In the general case of a non- 
ideal solution we have to expect, however, an enthalpy difference if the environ- 
ment of the jumping atom has changed as a result of the jump, for instance if 
the number of neighboring A atoms has increased. The jump probability accord- 
ing to the usual ansatz of transition state theory, Eq. (83), depends on the Gibbs 


13) It is readily verified that this is equivalent to 
a concentration gradient, if the gradient is 
not too large. 


5.2 Defect Migration: Microscopic Diffusion 


free energy difference between the initial state and the saddle point state.” We 
should remember that the Gibbs free energy difference used here contains a con- 
tribution from lattice vibration entropy, but does not contain mixing entropy.” 

Any kinetic model must be formulated consistently so that the system ap- 
proaches a true thermodynamic equilibrium state at long times. It is usual to in- 
voke dynamical equilibrium for every single pair of an atom jump and its reverse, 
a well-known requirement known as the principle of detailed balance. Regarding 
an ensemble of systems and calling n;° the number of systems in state i in equi- 
librium, we have for the fluxes of the ensembles between both states 





ny 012 = n> or (115) 

or 
7 Gs —G 7 Gs —G 

noy en(-S—S) = ny Vv exp (- S| (116) 
from which 

ny G2 = Gi 

— 117 
m w 


Thus it is seen that the occupation numbers of systems in equilibrium in states 1 
and 2 correspond to the respective ensemble densities, as demanded by statistical 
mechanics. 

If equilibrium has not yet been reached, the net diffusion flux depends on driv- 
ing forces toward equilibrium. An obvious driving force is the Gibbs free energy 
difference incurred by the moving atoms. This difference is conveniently de- 
scribed by a change in local chemical potential, which is defined as the Gibbs 
free energy change when an atom is added at a given lattice site.'” For the chem- 
ical potential of atom species i we write 


0G f 
1 = IN = lo | je | ‘a 
i 





(118) 


14) Whether we choose Eq. (82) or (83) for the tions made in transition state theory, 


jump probability depends on if we want to 
make allowance for the change in lattice 
vibration spectrum on displacing the atom 
from the equilibrium position to the saddle 
point (and on to the new equilibrium 
position). To remain in accordance with the 
main body of literature, we decide to use 
Eq. (83). There still remains the delicate 
question of whether the attempt frequencies 
¥ of Eq. (83) are the same for a jump and its 
reverse. Keeping in mind the approxima- 


instead of overstretching this model we 
invoke the principle of detailed balance, 
requiring the attempt frequencies to be 
the same if the proper Gibbs free energy 
differences have been used. 

15) The Gibbs free energy level of the new 


equilibrium state after the jump enters only 


insofar as it determines the probability for 
the back jump. 

16) Sometimes, especially in chemical physics, 
chemical potential is defined per mol. 
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where uo is the chemical potential of a reference state and the second contribu- 
tion is due to mixing entropy and is the only part remaining in an ideal solution. 
If we regard the jump of one particular atom, we deliberately take the change of 
mixing entropy out of our consideration because it just comes about by the statis- 
tics of many atoms jumping. The free energy difference which enters the jump 
probability explicitly is therefore only the third contribution in Eq. (118), an ex- 
cess chemical potential which arises only for non-ideal solutions. It yields the fol- 
lowing Gibbs free energy change, equivalent to the effect of a driving force F act- 
ing along a jump of length I: 


ous 
Mga Ge a {== (119) 





According to the conventions of chemical thermodynamics x; is often written in 
terms of the activity a; and the activity coefficient y; describing the deviation from 
the behavior of an ideal solution: 


Hi — Ho = ksT In a; = kgT In ait! + kgT In y; (120) 
Thus by comparison with Eq. (118), 
ur = kgT ln y; (121) 


the activity coefficient being nothing else than the excess chemical potential ap- 
pearing in a different guise. The bias in jump frequencies due to a driving force 
F is given by e = FI/(2kgT). In the case of a chemical driving force we therefore 
get Eq. (122). 


„ron (122) 


e = = 
2 10% 


Following the same procedure which has led us to Eq. (87), but now using the 
chemical driving force F in Eq. (119) together with Eq. (121), we obtain Eq. (123) 
directly from Eq. (87). 








: 0C; ôln y; ô ln y;\ °C; 
=p DE h -pirg h 
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=p (1 Ho; = 2) = (123) 
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Here we have introduced for later convenience the site fraction c; = QC;, Q being 
the atomic volume. We can therefore describe diffusion in a chemical driving 
force by the introduction of a thermodynamic factor p [Eq. (124)] to the diffusion 
coefficient. 
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14+ ¢; +=] : 124 
? ee OC; = ôln c; (124) 
We have indicated by the asterisk and the index that the diffusion coefficient ap- 
pearing in Eq. (123) is a tracer diffusion coefficient valid for species i. For the dif- 
fusion coefficient, e.g., in cubic crystal structures, we have to take the expression 
of Eq. (89), 


P zTol? 
Dea 





(125) 


where the effective jump frequency To follows from the atom-defect exchange 
frequency by incorporating the appropriate factor for defect concentration and an- 
other factor for correlation so that 


Gs —G 
To = wfo = fep- ST) (126) 
B 


which is to be compared with Eq. (116). 


5.2.2.4 Diffusion Walk in an Inhomogeneous Crystal 

It goes without saying that not only the thermodynamic factor by its dependence 
on composition, but also the other quantities involved, make diffusivity inhomo- 
geneous in the general case. This is especially true if crystal defects or boundaries 
constitute enhanced diffusion pathways (see Section 5.4.3) or if for energetic rea- 
sons the diffusion-mediating defect is confined in some region of the crystal such 
as vacancies in the antiphase boundaries of a long-range ordered structure, an ef- 
fect which becomes strikingly evident when studying domain growth by kinetic 
Monte Carlo simulations. Mass transport in a general diffusion field then de- 
pends on the local properties of the random walk, and for a time interval t we 
can write Eq. (127) in terms of the flux in one dimension (Manning 1968; Le 
Claire 1958): 
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The composition change it causes consequently results as (Alnatt and Rice 1960): 
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The concentration dependence of the diffusion coefficient implied in Eq. (128) 
can lead to unexpected diffusion behavior in alloys, such as transient sharpening 


of an interface concentration profile, as was recently discovered (Erdelyi et al. 
2004). 








SH 


237 


238 


5 Point Defects, Atom Jumps, and Diffusion 


Up to now we have treated the diffusion of the atom species considered with 
respect to a fixed reference frame, and especially the behavior of the diffusion- 
mediating defect (most often a vacancy) as independent of the diffusion of other 
species. If, however, several atom species diffuse with different rates, a net flow of 
vacancies is to be expected, and the vacancy presents itself with unequal probabil- 
ity on different sides of the moving atom. The effects of this vacancy flow, ques- 
tions of the reference frame against which the fluxes have to be measured, and 
the necessity to distinguish between different types of diffusion coefficients will 
be discussed in Section 5.4.1. 


3.2.3 
Atom Migration in Ordered Alloys 


5.2.3.1 Experimental Approach to Atom Kinetics in Ordered Alloys 

Ordered intermetallic compounds (intermetallics) are favorite subjects of study in 
alloy physics because of their interesting strength and electromagnetic properties, 
which are linked to the long-range ordered structure. For thermodynamic reasons 
the degree of long-range order (LRO) depends on temperature. Out of equilib- 
rium, the stability of the ordered state is limited by the kinetic processes altering 
it. It is therefore of great practical relevance to study and characterize atom ki- 
netics in ordered intermetallics. How can details of the atomic movement be ob- 
served in these alloys? The usual diffusion experiments, e.g., by measuring tracer 
diffusivity (Frank et al. 1997; Divinski et al. 1998) or recently by secondary ion 
mass spectroscopy (SIMS) analysis of the diffusion of a corresponding stable iso- 
tope (Frank et al. 1995, 2001), give information on atom jumps during long-range 
diffusion at a constant average degree of LRO (“steady-state diffusion’). The same 
holds for the observation of the atomic jump vector by various scattering methods 
under equilibrium conditions (Sepiol et al. 1998a; Thiess et al. 2001). 

A completely different approach concerns so-called “order-order” relaxation ex- 
periments (Kozubski 1997; Pfeiler 2000; Pfeiler and Sprusil 2002). In this case, 
the system under observation is pushed slightly out of its thermodynamic equilib- 
rium state by the application of a small temperature shift, and the subsequent re- 
laxation into a new equilibrium is observed. During such order relaxation experi- 
ments just those atom jumps are observed which entail a change in the degree of 
LRO. 

Interestingly, there is up to now only one experimental method with the neces- 
sary high precision so that the very small changes of the degree of order corre- 
sponding to slight changes of temperature can be resolved: residual resistometry 
(REST), which is known for its ultrahigh resolution of structural changes (Pfeiler 
1997). Here the sample is cooled to low temperature for each measuring point so 
as to virtually eliminate the phonon contribution to electrical resistivity, thereby 
improving the signal-to-noise ratio. REST measurements yield an integral signal 
composed of several effects, and considerable experience is necessary to separate 
them correctly. We are of course interested to interpret the macroscopic kinetics 
in the light of what is happening at the atomic scale. What is needed is firstly in- 
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formation which is as detailed as possible on the single atom jump in different 
environments, and secondly a method to sum up the individual atom jumps sta- 
tistically and therefore to arrive at the mesoscopic and/or macroscopic kinetic 
behavior. The first requirement can be met by employing ab-initio jump profile 
calculations based on modern density functional methods (Pfeiler et al. 2004). A 
natural statistical method of evaluating the cumulative effect of all single atom 
jumps is Monte Carlo simulation as it is practically free from additional assump- 
tions (Pfeiler et al. 2004). 


5.2.3.2 Jumps Within and Between Sublattices 

Some special considerations apply for atom kinetics in long-range ordered alloys. 
Any approximation based on the solution being dilute is no longer valid since the 
stoichiometry usually corresponds to a simple numerical proportion (or at least 
approximately so), putting these alloys in the intermediate composition range. 
The elementary cell of ordered alloys contains atoms sitting on different sublat- 
tices. Jumps may carry an atom from one sublattice to another or to a neighbor- 
ing position on the same sublattice. Therefore in the general case a certain num- 
ber of nonequivalent jump types arises, which we have to account for when we 
write down the diffusion coefficient in terms of atomistic kinetic properties. Tak- 
ing Eq. (101) as a definition of D we again regard, as in Eq. (106), the atom dis- 
placement R as composed of possibly different jump vectors r; (i= 1,...,n) 
which can be classified into M different jump types with respective jump length 
r and jump number n,. For cubic crystal structures, Eq. (129) can then be shown 
(Manning 1968) to apply, 


IN, 2 
D= ae nri (129) 


u=1 


with the correlation factor f measuring the effectiveness of this ensemble of 
jumps for long-range atom transport. This formulation was used by Bakker and 
Westerveld (1988) (see also Bakker 1987) to obtain expressions for the diffusion 
coefficient in ordered intermetallics in terms of the jump frequencies involved, 
assuming nearest-neighbor (NN) exchange with a single vacancy. For instance, 
in the B2 structure, Fig. 5.18(a), we can write for the total rate of A atom jumps 


m B 

na Na map | Na ppa 

AA, Ap 130 
t Na ANa ^ (130) 


Here Nx means the number of A atoms sitting on the « sublattice, Na the total 
number of A atoms, and T% the jump rate of an A atom from the « to the £ sub- 
lattice. The jump rate will depend on the availability of a vacancy according to Eq. 
(131), 
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pe being the probability of finding a vacancy on £ next to an A atom on « and 
og the atom-vacancy exchange rate under this premise. Invoking the principle 
of detailed balance for jumps in equilibrium, 


Nery = NITE (132) 
the total rate of A atom jumps can be written as Eq. (133). 


n, Nx 

—=3 Art (133) 
t Na 

Given these ingredients it is easy to write down the diffusion coefficient for spe- 

cies A: 


Cx j OL 
Da = fn PNO a? (134) 
A 


with fa the relevant correlation factor, că and ca the fraction of A atoms on the « 
sublattice and in total, respectively, and a the lattice constant. It is to be noted that 
the quantities appearing here, especially the correlation factor and the concentra- 
tion of defects, are very sensitive to the state of long-range order and to deviations 
from ideal stoichiometry. Figure 5.33 illustrates the strong dependence of the cor- 
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Fig. 5.33 Logarithm of correlation factor for diffusion of one atom 
species in the B2 structure versus reduced reciprocal temperature 
(Bakker et al. 1976). v/kgT = 0.3 corresponds to the order-disorder 
transition temperature Ty. 
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Fig. 5.34 (a) Ll2 ordered structure; (b) nearest neighbors for a majority 
A atom; (c) nearest neighbors of a minority B atom (After Numakura et 
al. 1998). 


relation factor on the degree of order. It is the result of computer simulation of 
diffusion in the B2 structure (Bakker et al. 1976). The system considered here 
has an order-disorder transition temperature T, given by v/(kgT;) = 0.3 where v 
is the mixed interaction energy, or “ordering energy” cf. p. 244. Above this tem- 
perature the correlation factor f is almost constant and close to 1, indicating a 
weakly correlated diffusion mechanism. Down from T;, the correlation factor de- 
creases sharply with temperature, because strong correlation effects hamper dif- 
fusion to an ever-increasing extent. The reason is that any mass transport in B2 
has to start with a local disruption of LRO. In an almost perfectly ordered state at 
low temperature, the Gibbs free energy cost of disordering is high as the entropy 
increase associated with generating an antisite defect will then no longer com- 
pensate for the enthalpy expenditure. 

Regarding the L1, structure (stoichiometry A3B, Fig. 5.34(a)) as another exam- 
ple of ordered intermetallics, we note an important qualitative difference in com- 
parison to B2. In the B2 structure, all nearest neighbors (NN) of an atom belong 
to the other sublattice. In L1,, the minority B atoms are likewise surrounded by 
12 A atoms, but majority A atoms are surrounded by eight A atoms and only four 
B atoms as nearest neighbors. This means that any NN jump in B2 ends on the 
other sublattice, but in L1, majority atoms can diffuse within their own sublattice 
by NN jumps. The different possibilities for atom jumps are represented sche- 
matically in Fig. 5.35. We have to decide between atom jumps on the majority « 
sublattice (intra-sublattice) which are neutral with respect to order (“0” in Fig. 
5.35, arrows in the vertical direction) and jumps between the sublattices (inter- 
sublattice; Fig. 5.35(a): horizontal arrows, “+” increases the number of antisites 
by one, ‘‘—” decreases the number of antisites by one). The restricted possibilities 
for the B2 lattice are displayed in Fig. 5.35(b) as a contrast. 

It is clear that any experimental method such as REST, which is sensitive to 
changes in the degree of order, will see only the effect of order—disorder inter- 
sublattice atom jumps (“+—”), and classical diffusion measurements will see 
mainly the effect of long-range atom transport by intra-sublattice jumps along 
the majority sublattice. Adequate caution must therefore be exercised when com- 
paring activation enthalpies obtained from diverse methods of measurement. For 
example, Arrhenius analysis of order-order relaxation times observed in residual 
resistivity measurements on Ni;Al yields an activation energy of 4.6 eV (Kozubski 
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Fig. 5.35 (a) Atom jump types in L12; (b) atom jump types in B2. 
Schematic representation of sublattices as vertical lines. 


and Pfeiler 1996), whereas in standard Ni tracer diffusion experiments in Ni;Al 
an activation energy of about 3 eV is found (Hancock 1971; Hoshino et al. 1988). 
(See Section 5.2.3.3 for a discussion of a possible co-operative jump mechanism.) 
Distinguishing between « > « intra-sublattice jumps and « = f inter-sublattice 
jumps, the expressions analogous to Eqs. (130)-(134) read for the L1; lattice 
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Applying the principle of detailed balance again leads to 
= TE A okok + por Ja? (137) 


Dg follows by replacing A by B, 
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Dg = zh (pavos + phog )a? (138) 


Equations (137) and (138) correspond to a six-frequency model of diffusion in the 
ordered L1, structure, where two of the frequencies do not appear explicitly as 
they have been eliminated by the principle of detailed balance. Analogous formu- 
lae for the DO; (Fe3Si), the A15 (Cr3Si), and the C1 (CaF) structures and the 
Laves phase C15 (MgCuz) can be found in the original work of Bakker and West- 
erveld (1988). 
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Of course expressions like Eqs. (134), (137), and (138) must remain of academic 
value only if the quantities appearing therein cannot be determined by theoretical 
or experimental methods. The probability of finding a vacancy next to a given 
atom follows from the vacancy concentration on the respective sublattice and the 
binding energy of the vacancy to the atom species considered. Both can be ob- 
tained from solution models on various levels of approximation (details in Sec- 
tion 5.1), or from more direct, modern ab-initio methods based on density func- 
tional theory (cf. Chapter 11 and Section 5.2.3.3), followed by a treatment 
according to the principles of statistical thermodynamics in order to obtain the 
temperature dependence (cf. Chapter 10). The atom-vacancy interchange fre- 
quency w can be assessed by a calculation of the energy profile of atom migration, 
using approximate methods based on potentials such as the EAM or again by an 
ab-initio treatment. The activation process of the atom—vacancy interchange may 
be analyzed either in a static formulation, if possible allowing for the relaxation of 
a fairly large number of coordinates (nudged elastic band method; cf. Chapter 11 
and Section 5.2.1.2), or in a molecular dynamics version (see Chapter 12 and Sec- 
tion 5.2.1.2). A more difficult problem is the determination of the tracer correla- 
tion factors fa and fg. One possible strategy is kinetic Monte Carlo simulation 
(Bakker et al. 1976; Murch 1984), others are random-walk approaches (Manning 
1971; Bakker 1979; Stolwijk 1981) or the path probability method (Sato et al. 
1985). 

In an extended study comprising a series of papers, Kikuchi and Sato (1969, 
1970, 1972) have presented a thorough analysis of diffusion in B2 ordered binary 
alloys by means of the path probability method, revealing the role of the various 
factors in the tracer diffusion coefficient of a component, and how the two sub- 
lattices contribute to them. A direct calculation of the correlation factor f was 
achieved. It proved convenient to consider two different correlation factors fap 
and fra, dependent on the tracer atom (always belonging to the same species, 
e.g., a B atom) jumping from a site on either the « or the £ sublattice. Equation 
(108) is applied separately to each of the two cases. The overall correlation factor 
is then the harmonic mean 


a in + fan) (139) 


and shows, especially for a stoichiometric alloy, the expected sharp decrease (tan- 
tamount to strong correlation) as perfect LRO is established. The tracer diffusion 
coefficient of one of the components, say the B component, can be factorized ac- 
cording to Eq. (140), 


w; being an attempt frequency, Vg a vacancy availability factor, and Wg an effec- 


tive jump frequency factor, which is in effect a Boltzmann factor containing 
in the exponent the energy of the bonds broken by the atom when jumping. 
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While Eq. (140) is formally equivalent to what appears in Eq. (126) together 
with Eq. (125), the interesting information is, however, contained in the behavior 
of the individual factors. Both Vg and Wg turn out to be quite sensitive to stoichi- 
ometry and state of order, in fact even more so than the correlation factor f itself. 
Like the correlation factor, they can be decomposed into contributions from the 
two sublattices, as a weighted arithmetic mean in the case of Vg and another har- 
monic mean in the case of Wg. The behavior of the three contributing factors is 
impressively illustrated in Fig. 5.36 (Kikuchi and Sato 1972). Here reduced ver- 
sions of the quantities Vg and Ws are plotted together with the correlation factor 
f versus the reciprocal temperature normalized to the order—disorder transition 
temperature. The parameter « = ca — 0.5 is a measure of deviation from the stoi- 
chiometric composition, and the parameter U is a measure of energy asymmetry 
and is defined as U = (era — épp)/v with v = eaa + egr — 2eap, e being the re- 
spective pair interaction energy between atom species. The normalizations are de- 
fined according to Vg = Vg /cy, cy being the overall vacancy concentration and 
We = Wp exp(—7eapf), B = 1/(kpT). 

In L1, ordered intermetallic compounds such as Ni3Al, Ni3Ga and Ni3Ge, dif- 
fusion takes place mainly on the majority sublattice, where even the minority 
atoms are able to diffuse rapidly as an antistructural atom population (Numakura 
et al. 1998). This explains also why the dependence on stoichiometry of the mi- 
nority atom diffusion coefficient is practically identical to that of the fraction of 
B atoms in antisite position (Belova and Murch 1999). A possible role of six- 
jump cycles (see Section 5.2.3.2) could be ruled out because this would require B 
atoms to diffuse faster than A atoms, a fact which is not supported by the experi- 
mental findings. 

Another ordered structure which has gained much attention in recent years is 
Llo, Fig. 5.18(c), also derived from the fcc structure, but with stoichiometry 1:1. 
The structure consists of alternate layers of A and B atoms. The lattice cell is tet- 
ragonally distorted, making the tetragonal c axis (normal to the layers) an easy 
axis for magnetization in the case of ferromagnetic alloys such as FePd, FePt, 
CoPt. These materials are under investigation as future high-density magnetic or 
magneto-optic recording media, envisioned in the form of thin films or nano- 
structures. As in L1), intra-sublattice jumps provide an easy diffusion path. Diffu- 
sion in the direction normal to the A/B layers requires, however, inter-sublattice 
jumps, as the sublattices are not connected in this direction. Likewise, inter- 
sublattice jumps are necessary to produce any change of LRO. The anisotropy 
of diffusion and the role of different jumps for both constituent elements in L1o 
ordered FePt and TiAl have recently been discussed by Nakajima et al. (2005), 
generally confirming the expected behavior (see also Kushida et al. 2003; Nosé 
et al. 2005). Experiments comprising residual resistivity measurements, X-ray 
diffraction, magnetization, and Méssbauer absorption measurements have been 
done on FePd, FePt, and CoPt thin films and foils by Issro et al. (2006) with the 
aim of shedding light on order kinetics. Thin films of FePd deposited by MBE 
show originally an orientation of the L1o c axis normal to the film plane. After 
a temperature treatment, an increasing fraction of domain variants with the 


5.2 Defect Migration: Microscopic Diffusion | 245 


@=+0.! 
Tr 
1009 08 0.7 0,6 05 


Qa=0.0 









0.01 
100.0 


50.0 


u-i 


10.0 


10 42 1.4 1.6 18 20 
IZ Tr 1/Tr 


Fig. 5.36 Reduced vacancy availability factor Vg, reduced effective 
frequency factor Wg and correlation factor f (all in logarithmic scales) 
as functions of the reciprocal reduced temperature. « stoichiometry 
parameter, U energy asymmetry parameter (see text). After Kikuchi and 
Sato (1972). 


c axis in-plane was found. Monte Carlo computer simulations of atom kinetics in 
FePt (Kozlowski et al. 2005) show the same trend, which can be understood as 
relaxation toward an equilibrium position of lower chemical and magnetic field 
energy. 
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Fig. 5.37 Six-jump cycles (6)C) proposed for the B2 lattice, here NiAl. 


5.2.3.3 Jump Cycles and Cooperative Atom Jumps 

Long-range atom transport in highly ordered alloys can be imagined by several 
possible mechanisms. A primary requirement, arising from the high price in 
terms of Gibbs free energy, is that any disturbance of order be local and transitory 
only. If a sublattice is continuous by nearest-neighbor connections it offers an 
easy “highway” of diffusion. Such is the case, for instance, in the L1, structure 
(majority sublattice), and for the L1o structure, although in only two of the three 
directions in space. In B2, however, NN jumps necessarily affect order. (See below 
for a discussion of next-nearest-neighbor (NNN) jumps.) 


Six-jump cycles Co-operative jump cycles have therefore been proposed, most 
notably the six-jump cycle (6JC), originally introduced by Elcock and McCombie 
(1958), after a suggestion by H. B. Huntington, which on completion leave the 
pristine ordered state and an atom interchanged with a vacancy. In Fig. 5.37 we 
show different kinds of possible six-jump cycles proposed for the B2 structure. 
The arrows in Fig. 5.37 indicate the path the vacancy takes around a quadrangle 
of lattice sites, completing one and a half turns. Having finished the six-jump 
cycle the vacancy has changed places with a Ni atom distant by a face diagonal 
([110] cycle) or by a cube edge ([100] cycle). The state of order is destroyed locally 
during the first part of the cycle and restored by the second, symmetric part. The 
six successive steps need not really be separate atom jumps. Thus Mishin et al. 
(2003) found by computer calculations with EAM potentials in NiAl that after 
the first NN jump of the Ni vacancy an unstable state follows (which is to be ex- 
pected considering the extreme scarcity of Al vacancies in this alloy; see below) 
so that the first and the second jump are performed simultaneously and collec- 
tively.'” For the [110] cycle, even the middle configuration corresponds to a local 
maximum of energy instead of a minimum, making this 6JC effectively a three- 
jump cycle by merging together of pairs of successive jumps in a collective mo- 
tion. Because of an altogether lower energy barrier, the [110] cycle seems to be 


17) For reasons of symmetry, this is also true for the fifth and sixth jumps. 
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the predominant jump cycle in NiAl. Essentially the same result was obtained in 
the computer simulations of Farkas and Soulé des Bas (2001), who discovered 
also a minor contribution of more complex correlated cycles with 10 and 14 
jumps. 

As the mean square displacement of an A atom in a series of six-jump cycles of 
an g vacancy is twice that of a B atom and vice versa, it can be reasoned (Elcock 
and McCombie 1958) that the ratio of the tracer diffusion coefficients of the two 
species must for low enough temperatures obey the inequality in Eq. (141). 


1 Dk 
en 
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<2 (141) 


The validity of Eq. (141) is confirmed by experiment for a series of B2 alloys, such 
as AgMg (Domian and Aaronson 1964), AuCd (Huntington et al. 1961; Gupta 
et al. 1967), AuZn (Gupta and Lieberman 1971), CoGa (Bose et al. 1979), making 
the 6JC eligible as a mechanism for self-diffusion in these materials. Results con- 
flicting with this requirement were, however, reported for AuZn (Hilgedieck et al. 
1983), CoGa (Stolwijk et al. 1977), NiGa (Donaldson and Rawlings 1976) and 
NiAl (Hancock and McDonnell 1971; Lutze-Birk and Jacobi 1975), inducing 
some workers to propose alternative diffusion mechanisms like the antistructure 
bridge mechanism or the triple defect mechanism (see below). 

Generally, it has to be kept in mind that the importance of clearly defined cor- 
related jump cycles rapidly diminishes when the number of defects increases and 
the state of almost perfect order or the region of ideal stoichiometry is left. This is 
perfectly plausible as a jump cycle is a very low-entropy path allowing few alterna- 
tive variants. Even so, according to the statistical behavior of atom jumps which 
can be observed in kinetic Monte Carlo (KMC) simulations, correlated jump 
cycles should not be imagined as a strict step-by-step sequence of events. Rather, 
they result from many futile attempts and reversals of incomplete cycles. KMC 
investigations with improved residence-time algorithms (Athénes et al. 1997) 
have revealed in the B2 structure the importance of alternate, asymmetrical cycles 
branching off from conventional 6JC, together with new antisite-assisted cycles 
combining properties of the 6JC and the antistructure bridge mechanism (see be- 
low). The relative contribution of these mechanisms is strongly dependent on 
asymmetry of the like-atom interaction energies, deviation from ideal stoichiome- 
try, and temperature. As was shown by Athénes et al. (1997), their interplay can 
produce a negative curvature of the Arrhenius plot of the diffusion coefficient, 
without having recourse to a divacancy (triple-defect) mechanism. 

Six-jump cycles have also been proposed for the L12 structure by Hancock 
(1971). Figure 5.38 shows two variants of such cycles for Ni3Al. Debiaggi et al. 
(1996) showed by simulation based on EAM potentials that, at least in NizAl, 
long-range diffusion of both atom species on the majority sublattice is by energy 
reasons preferred over inter-sublattice diffusion via 6JC. The same conclusion 
was drawn also by other authors (Numakura et al. 1998; Belova and Murch 
1999) on the basis of experimental data (dependence of diffusivity on stoichiome- 


247 


248 | 5 Point Defects, Atom Jumps, and Diffusion 





(a) 
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Fig. 5.38 Six-jump cycle in the L12 structure: (a) bent; (b) straight (From Debiaggi et al. 1996). 


try). The condition imposed on the tracer diffusion coefficients (Eq. (141) for B2) 
reads, for the L1; lattice (Maeda et al. 1993): 


* 


D 
0.1064 < = < 0.8524 (142) 
B 





It is on this basis that 6JC diffusion can be excluded for Ni;Al. 


Cooperative Generation of Anti-site Pairs Not only atom jumps leading to long- 
range atom transport but also those responsible for a change of order (cf. Section 
5.2.3.1) may involve co-operative atom movements. In Ni3Al it is known from 
KMC simulations (Oramus et al. 2000) that the faster of two exponential pro- 
cesses of disordering consists mainly of the generation of antisite pairs. A co- 
operative mechanism of two atom jumps was therefore proposed (Schweiger 
et al. 2001), Fig. 5.39. 
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Fig. 5.39 Cooperative jump sequence leading to an antisite pair 
(Schweiger et al. 2001). From left to right: An Al atom (open circle) 
jumps into a Ni vacancy. A Ni atom (filled circle) occupies the Al 
vacancy and prevents the Al atom from jumping back to its original 
position. 
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First, an Al atom jumps into a vacancy on the majority (Ni) lattice. For energy 
reasons, there is a high probability of immediate reversal of this jump. If, how- 
ever, a Ni atom almost simultaneously jumps into the Al vacancy, the back jump 
is blocked, and an antisite pair results. Ab-initio calculation of the energy surface 
for simultaneous jumps of the Al and the Ni atom shows that the activation en- 
ergy of such cooperative jumps following representative paths is significantly 
higher than in the case of single atom jumps (Schweiger et al. 2001), and closely 
approaches the value measured in the resistivity measurements when the energy 
of formation of the initial Ni vacancy is superimposed. It is well known that the 
density of Al ($) vacancies for near-stoichiometric Ni;Al is by several orders of 
magnitude smaller than the density of Ni («) vacancies. By taking advantage of 
an Al vacancy in statu nascendi a Ni atom can create an antisite, but at an energy 
cost which the system is willing to pay, as it were, because there is no alternative 
mechanism available. 


Antistructure Bridge (ASB) Mechanism The conventional six-jump cycle (Elcock 
and McCombie 1958), although able to explain diffusion behavior in B2 ordered 
compounds in the vicinity of the ideal stoichiometric composition at low enough 
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Fig. 5.40 Antistructure bridge (ASB) mechanism of diffusion in an 
ordered B2 lattice (Kao and Chang 1993). Uppermost unit cell: arrows | 
and II designate two NN jumps leading to an effective NNN jump. In 
the cells below other possibilities of effective jumps to more distant 
neighborhood positions are shown. Each effective jump consists of two 
NN jumps of the vacancy, the second involving a pre-existing 
antistructure (antisite) atom. 
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temperatures, quickly loses importance at large deviations from stoichiometry. As 
an alternative in this region the antistructure bridge (ASB) mechanism was intro- 
duced as a model by Kao and Chang (1993), Fig. 5.40. It depends on a sufficient 
concentration of antistructure (antisite) atoms, which serve as “bridges” by which 
a vacancy can migrate through the lattice without changing the ordered state. In 
the topmost unit cell of Fig. 5.40, two NN jumps of the vacancy result in an effec- 
tive jump to an NNN position. In the elementary cells below, other possible effec- 
tive jumps are shown, which lead to lattice positions in higher-order coordination 
shells of the initial position of the vacancy. All of these jumps, however, are equiv- 
alent to two NN vacancy jumps, which together leave the state of LRO un- 
changed. 


Triple-Defect (TD) Mechanism This mechanism was proposed by Stolwijk et al. 
(1980) to explain the tracer diffusion data in B2 ordered CoGa, where the require- 
ment of Eq. (141) for the ratio of the tracer diffusion coefficients of the two spe- 
cies was not fulfilled. Frank et al. (2001) reverted to the TD mechanism to explain 
the diffusion behavior of NiAl, especially on the Ni-rich side. In this alloy, triple 
defects (cf. Section 5.1) play a very prominent role, Ni vacancies occurring more 
frequently than Al vacancies by several orders of magnitude. Figure 5.41 shows 
the steps of the TD mechanism (Frank et al. 2001). By a series of vacancy ex- 
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Fig. 5.41 Triple-defect (TD) mechanism in NiAl (Frank et al. 2001). 
An atom in an antisite position changes places with an NNN atom in 
a regular position with the help of a vacancy pair. The antisite atom 
together with the two vacancies constitutes a triple defect. 


5.2 Defect Migration: Microscopic Diffusion 


changes, a Ni atom in a triple defect position is exchanged with an adjacent Al 
atom. 


Next-nearest-neighbor (NNN) jumps We have hitherto made the tacit assump- 
tion that diffusion jumps in intermetallics always end up in an NN vacancy posi- 
tion. The possibility of next-nearest-neighbor (NNN) jumps in the B2 structure 
cannot be excluded a priori, however, in view of the great advantage of keeping 
the ordered state intact, and bearing in mind that the NNN jump distance is 
only 15% larger than the NN distance. This mechanism was first proposed by 
Donaldson and Rawlings (1976) for NiGa. Exchange of an atom with an NNN 
vacancy represents a viable alternative to correlated jump cycles if the energy 
of migration is smaller than the energy needed to create disorder. In Table 5.7 
(Neumann 1980) we have arranged some common B2 alloys according to their 
enthalpies of formation which are correlated with the energy needed to crate an 
antisite defect. The higher this value, the more frequent NNN jumps should 
become as they are relatively more favored. 

In B2 ordered FeAl, the results of quasielastic Mossbauer spectroscopy (Vogl 
and Sepiol 1994) and nuclear resonant scattering (Sepiol et al. 1998b) indicated 
that diffusion of Fe takes places via NN jumps, which was further supported by 
Monte Carlo simulations (Weinkamer et al. 1999). Measurements of quasielastic 
neutron scattering in alloys with intermediate formation energies, namely CoGa 
(Kaisermayr et al. 2001) and NiGa (Kaisermayr et al. 2000), could not be recon- 
ciled with the assumption of NNN jumps of Ni, neither could the diffusion mech- 
anism be attributed to any specific correlated cycle. Rather, a great variety of jump 
modes involving several defects seems to be operative. (For a more detailed dis- 
cussion concerning the application of nuclear scattering methods to investigate 
elementary steps of diffusion, see Chapter 13.1.) 

In NiAl, NNN jumps were suggested as a plausible diffusion mechanism espe- 
cially on the Al-rich side, where there is an ample supply of constitutional Ni va- 
cancies (Mishin and Farkas 1997, 1998). An activation energy of about 2 eV found 
in previous experiments by Hancock and McDonnell (1971) agreed well with 
NNN migration energies computed with EAM potentials (Mishin and Farkas 
1997, 1998). This conclusion was, however, contested by Frank et al. (2001) on 
the basis of their own experimental results showing an almost constant diffusivity 


Table 5.7 Formation enthalpies for some B2 intermetallics (Neumann 1980). 





Material Formation enthalpy, AHr [eV] 
FeAl 0.34 
CoGa 0.37 
NiGa 0.47 


NiAl 0.72 
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at Al-rich compositions, at odds with an NNN jump mechanism, where diffusiv- 
ity should increase markedly as more constitutional Ni vacancies become avail- 
able. Frank et al. (2001) instead proposed the TD mechanism as the principal 
mode of diffusion at the stoichiometric composition and on the Al-rich side. An 
increase of diffusivity on the Ni-rich side was explained by an increased contribu- 
tion of the ASB mechanism. As a part of the ongoing controversy on diffusion 
mechanisms in NiAl, NNN jumps were recently defended with better EAM and 
ab-initio calculations (Mishin et al. 2003). 

In conclusion, it might be said that at the present state of knowledge details of 
diffusion in intermetallics, especially in the B2 ordered structure, are still very 
much a matter of active research and animated discussion. 


5.3 
Statistical Methods: from Single Jump to Configuration Changes 


Modern materials science is pursuing the ambitious but ever more realistic aim 
of describing and understanding the development of microstructure on a coher- 
ent progression of scales, reaching from the atomistic details to the macroscopic 
world (multiscale approach). The problem is related to the classical question of 
statistical physics: How does a multitude of elementary events (e.g., collisions of 
molecules) determine the behavior of the very few observables (e.g., pressure, 
temperature) we are really interested in? The difference is, however, that in alloy 
physics we have to deal with subsystems like nuclei, precipitate particles, do- 
mains of order, diffusion zones, defects of various dimensionality and extent, 
etc. Therefore, statistical behavior is very often the result of cooperation of not 
quite so many atoms or molecules, sometimes only a few hundred, in most cases 
definitely less than Avogadro’s number. It may well be doubted that it is legiti- 
mate to transfer concepts liberally from macroscopic thermodynamics, such as 
entropy and free energy, to very small subsystems, as has often been done in the 
past. There are some indications of cases where the difference might count. For 
instance, Miyazaki et al. (2005) in various alloys observed a precipitate nucleation 
rate deviating from the predictions of classical theory, leading him to adopt a new 
entropy concept by Tsallis (1988, 2003), which takes account of long-range inter- 
actions and therefore is nonextensive. The peculiar properties of very small sys- 
tems have become a natural topic in the treatment of nanostructures (see also 
Chapter 9) for which surface and interface interactions cannot be disregarded 
in comparison to volume interactions, and certain approximations such as using 
a simplified version of Stirling’s approximation formula must be applied with 
caution.'” 


18) From the simplified version of Stirling’s thus deviates by about 1%. For 30 atoms 
formula In(N!) x N In N — N we get for the respective values are 72.0 and 74.7, 
100 atoms 360.5, whereas the exact value is corresponding to an error of about 4%. 


In(100!) = 363.7. The approximate value 


5.3 Statistical Methods: from Single Jump to Configuration Changes 


In the following we will give a very concise overview and critical assessment of 
different methods that have been developed to describe how many single atom 
jumps co-operate to change the configuration of an alloy over a time interval of 
observation. Most of these concepts and techniques will receive detailed treat- 
ment in the respective dedicated chapters. 


5.3.1 
Master Equation Method 


Quite generally a master equation gives the rate of change in the occupation of a 
certain state of a system as a balance between transitions leading to it from other 
possible states, and of transitions away from it to other states, so that we can write 


cn =X (WamPna = Wmn Pm) = SP (143) 


where the occupation number Pm of state m is changed by transitions with fre- 
quency Wnm leading toward it and Wi, leading away from it. This form of kinet- 
ics is valid whenever a system changes its states according to a Markov process 
(see Section 12.2.1.2). The transition can be written in a more compact form as a 
transfer matrix applied to the state vector P. 

In an advanced formulation of the master equation method (Vaks 1996; Belash- 
chenko et al. 1999) the occupation of each lattice site is left free to vary individu- 
ally. A high-dimensional configuration space is subtended by all these degrees of 
freedom of occupation, and the time evolution of an ensemble density on this 
configuration space is considered. The transfer matrix $ of Eq. (143) can be ex- 
pressed in terms of single atom jumps taking place according to transition state 
theory (cf. Section 5.2.1.1). Vaks and coworkers adopted a direct-exchange model 
after showing its equivalence to the vacancy-exchange model in advanced stages 
of phase transformation (Belashchenko and Vaks 1998). The underlying difficulty 
of this method is, of course, how to express the transfer matrix § as a function of 
the current state of the systems, that is, by means of local concentrations and cor- 
relations. This cannot be done without some kind of averaging if one wants to 
avoid outright simulation. Vaks and coworkers used various kinds of cluster 
mean-field approximations. Ordering kinetics and the development of long-range 
order domains in two dimensions could be described very well by this technique. 
Taking averages over statistical distributions deprives the method, however, of the 
possibility of describing phenomena that rest upon kinetic paths of nonmaxi- 
mum probability, such as nucleation (cf. Chapter 7 and Section 5.4.2.2). 


5:3:2 
Continuum Approaches to Microscopic Diffusion and their Interrelationship with 
Atom Jump Statistics 


Another possible way to obtain microscopic diffusion equations is to start from 
continuum arguments and to set changes of field variables proportional to ther- 
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modynamic driving forces. A field variable can be any local variable describing 
the state of the system, such as an order parameter (nonconserved) or a local con- 
centration (conserved). This approach leads to the time-dependent Ginzburg- 
Landau equations (TDGL; see Chapter 10, Eq. (124)). If the field variable is con- 
centration, then the diffusion fluxes can be shown to obey Eq. (144) (Chapter 10, 
Eq. (128)): 


j=—Mv (5) (144) 


i.e., the flux is proportional, by a mobility M, to the gradient of the local chemical 
potential. Together with the continuity equation and a suitable form of the free 
energy functional comprising a gradient-energy term (see Eq. (118) in Chapter 
10), this expression for the fluxes results in the well-known Cahn-Hilliard diffu- 
sion equation. This formalism is also the point of departure for a host of phase- 
field methods which in recent times have become popular in the investigation of 
complex phase transformations (see Chapter 12). 

Martin (1990) has given a revealing analysis of the interconnection between a 
stochastic model based on individual atom jumps in a master equation formula- 
tion and the TDGL approach, Eq. (144) (see Chapter 8). In a one-dimensional dis- 
crete diffusion model, an analogue of the local chemical potential could be con- 
structed, which must be uniform for an equilibrium state. The dynamic steady 
state of diffusion jumps was identified with thermodynamic equilibrium and 
shown to have a probability density equal to that derived from the canonical en- 
semble. Special care was devoted to the question of under what conditions it 
might be justified to set the mobility M constant, as is routinely done in the 
Cahn-Hilliard formulation. It was recognized that M contained a second-order 
term in the deviation from uniform concentration, which should become impor- 
tant whenever the gradient-energy contribution is nonzero. 

An atomistic application of the TDGL equations was used by Khachaturyan and 
coworkers (Chen et al. 1994), where the change in occupation of a lattice site at 
position r was made proportional to the variation of free energy by atom jumps 
to neighboring lattice sites: 





dn(r,t) R OF 
ral cee ca (145) 


The kinetic coefficients L should be proportional to the probability of elementary 
diffusion jumps. The free energy was formulated so as to account for elastic 
strain energy, and the kinetic equation (145) was put in Fourier form. The model 
was able to explain both ordering and segregation within the same formalism and 
reproduce a wealth of precipitate morphologies including such interesting effects 
as the splitting due to elastic interaction of overgrown coherent particles in Ni-Al 
alloys (Miyazaki et al. 1982). 


5.3 Statistical Methods: from Single Jump to Configuration Changes 


Both in this variant of TDGL kinetics and in the more continuum-oriented 
methods of Cahn-Hilliard and the phase-field family, the same caveats have to 
be kept in mind: (1) these approaches are essentially based on linearization valid 
in the vicinity of equilibrium and must be viewed with caution in highly nonequi- 
librium situations; and (2) they incorporate mean-field concepts that have to be 
revised in complex alloys with non-negligible higher-order correlations. 


3:33 
Path Probability Method 


Kikuchi (1951) devised the cluster variation method (CVM) in order to include 
multibody correlations up to arbitrary order in the configuration entropy expres- 
sion for an alloy. The most probable (i.e., equilibrium) state of the system was 
then determined by the interplay, via free energy, between internal energy and 
CVM entropy. A further logical step was to extend this concept by carrying it 
into the time dimension, thus creating the path probability method (PPM) de- 
scribed in Section 10.2.6; see also Kikuchi (1966): the most probable kinetic path 
between two states for a given energy difference is the one with the highest num- 
ber of microscopic transition possibilities. Transitions between a hierarchy of 
cluster configurations are expressed by path variables such as Yj 4 giving the 
probability of changing the atom pair ij to kl by an elementary kinetic step (mod- 
eled as a spin flip in an Ising system). The total number of transition possibilities 
at each step is measured by a path probability function 


P = Pi PP; (146) 


where the first factor is connected with the unbiased spin flip probability per unit 
time, the second is a Boltzmann factor of the energy difference before and after 
the transition, and the third a combinatorial expression, akin to CVM entropy, de- 
pendent on the path variables and taking account of the number of microscopic 
transition paths. P is now maximized at each step with respect to the path vari- 
ables, which in their turn describe the change of the configuration. 

The advantage of the PPM is that it can be used even far from thermodynamic 
equilibrium since it does not explicitly use the concept of free energy. However, 
the complexity of the formalism limits it practically to homogeneous systems 
and precludes the treatment of a vacancy or direct-exchange diffusion mecha- 
nism. 


5.3.4 
Monte Carlo Simulation Method 


This method, which is described in detail in Chapter 12, can be viewed as a way 
of doing statistics by computer experiment. No averaging is carried out before 
or on the level of basic calculation, which consists of performing random atom 
jumps on a lattice. Statistical evaluation takes place only after the jump sequence, 
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in at least two ways: first by accumulating the results of several runs with the 
same starting configuration and the same macroscopic parameters, but each 
time a different sequence of random numbers, and second by calculating integral 
quantities by summing over the calculation cell. 

At the microscopic level, however, there is no restriction on the freedom of 
atom jumps. In principle all correlations and all interactions are described as re- 
alistically as the jump probabilities and the underlying Hamiltonian can be for- 
mulated. (For a discussion see Chapter 12 and Section 5.2.1.1.) Specifically, low- 
probability paths do get chosen by the system with their respective statistical 
weights, and thermally activated processes are observed regularly if the activation 
enthalpy is in a viable range for the given temperature. Therefore, a nucleation 
process need not be introduced into the kinetics artificially. Likewise, the estab- 
lishment and coarsening of ordered domains happen quite naturally together 
with the ordering process itself. The power to describe rich microstructural evolu- 
tion out of a single jump process without any intervening additional assumptions 
is certainly one of the reasons for the immense popularity of this method. 


5.4 
Macroscopic Diffusion 


5.4.1 
Formal Description 


5.4.1.1 Fick’s Laws 

When studying the random walk of atoms in a crystal (Section 5.2.2.1) we have 
already seen that for a concentration gradient in an ideal solid solution Fick’s first 
diffusion law follows directly 


j=—DVC (147) 
Applying the continuity equation (mass conservation), 


ac 
148 
F j (148) 


Fick’s second diffusion law results, 


oc 

= =v.(DVc) (149) 

ot 

which describes changes of a concentration field due to the diffusion process. 
For a constant diffusion coefficient D, Eq. (149) simplifies to the linear diffu- 

sion equation 


5.4 Macroscopic Diffusion 


ðC 
<= pvc (150) 


For the case of mixing in isotropic solids the diffusion coefficient D is a scalar. In 
general, however, atom flux is determined by a symmetric second-rank diffusion 
tensor Dj. 

Solving the linear diffusion equation under specific boundary conditions is a 
standard problem in technical applications for which we refer to the correspond- 
ing literature (see, for instance, Carslaw and Jaeger 1984; Crank 1994). An exam- 
ple of special relevance for the measurement of diffusion properties in alloys is 
the behavior of diffusion couples. When a thin film of solute is attached to an al- 
loy under investigation, for example to determine tracer diffusion coefficients, the 
“thin-film solution” of Fick’s second law for an initial condition of C(x,t= 0) = 
(x) is 


2 
C(x.) = sn z) (151) 


M being the mass of the thin-film diffusion source (mass per area). Note that Eq. 
(151) corresponds to Eq. (92) of Section 5.2 for the case of a half-space. In prac- 
tical situations, chemical diffusion coefficients often turn out as composition- 
dependent. For specific boundary conditions Fick’s second law can be integrated 
in this case. A classical method is to solve the inverse diffusion problem: if the 
concentration field C(x,t) is known from experiment, the diffusion coefficient 
can be determined as a function of composition. Matano (1933) has demonstrated 
a method to solve the corresponding transformed diffusion equation. Let us con- 
sider the concentration profile of a diffusion couple which was step-like at t = 0 
(Fig. 5.42). The solution for the diffusion coefficient can be written as Eq. (152). 








Fig. 5.42 Boltzmann—Matano geometry for a diffusion couple (after Glicksman (2000)). 
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1 dx 


c' 
D(C’) = Hal | (x — xm) dC (152) 


Cr 





Here xy is the coordinate of the Matano plane, for which the shaded areas in Fig. 
5.42 balance: As = A; + A2 + As. Equation (151) can be solved graphically if xy is 
taken as the origin of the x axis. 

Very recently, in a simple but very effective way, equilibrium intermetallic com- 
pounds and solid-solution phases of various compositions were created by long- 
term annealing of junctions of three or more metals or alloy phases (Zhao 2005; 
Zhao et al. 2005; Zhao 2006). This “diffusion-multiple” approach enables the 
generation of a large number of alloy phases and compositions for efficiently 
mapping phase diagrams, phase properties, and kinetics. Many critical alloy data 
are obtained in this way with a high efficiency: phase diagrams, diffusion coefh- 
cients, hardness, elastic modulus, and thermal conductivity, but also precipitation 
kinetics and solution and precipitation strengthening effects. It is planned to ex- 
tend the diffusion-multiple approach in order to determine optical, magnetic, me- 
chanical, and other properties, by the development of corresponding microscale 
screening tools. It seems that thereby an extremely useful method for alloy de- 
sign has been generated, as shown in Fig. 5.43. 


HTS and Micro-Scale Property Tools 
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Fig. 5.43 Schematic representation showing how the “diffusion- 
multiple” approach can be used for alloy design and discovering new 
functional materials (Zhao et al. 2005). EPMA: Electron Probe Micro- 
Analysis; EBSD: Electron Back Scatter Diffraction. 
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5.4.1.2 Nonreciprocal Diffusion, the Kirkendall Effect 

Since diffusion coefficients of constituent species in alloys are usually of the same 
order of magnitude, any mixing of atoms due to concentration gradients is 
brought about by jumps of all atoms. It is well known that the interdiffusion (or 
chemical diffusion) of atoms in a binary alloy can still be characterized by a single 
diffusion coefficient (the interdiffusion coefficient or chemical diffusion coefh- 
cient D). This is straightforward if both atoms, A and B, diffuse at the same rate 
(reciprocal diffusion), where Fick’s first law reads 


j,=—D\VC; (i= A,B) (153) 


with Da = Dg = D. If, on the other hand, the two species in a diffusion couple do 
not move at the same rate (nonreciprocal diffusion), atomic planes are shifted 
(lattice flow) to compensate for a net flux of vacancies (vacancy wind). This phe- 
nomenon is known as the Kirkendall effect (Smigelskas and Kirkendall 1947). 

If a net flux of atoms occurs in a diffusion couple due to nonequal diffusion 
coefficients, two frames of reference can be used. (a) In the laboratory reference 
frame, the ends of the sample remain unchanged, but lattice positions within the 
diffusion zone identified by surface markers due to the unbalanced diffusion 
move with a velocity vy (marker velocity). Here, as before, a single diffusion coef- 
ficient, the interdiffusion coefficient D, describes the macroscopic interdiffusion 
of both species. (b) In the lattice reference frame, which is fixed to the atomic 
planes in the diffusion zone, the ends of the couple move and two intrinsic diffu- 
sion coefficients, Da and Dg, describe the microscopic diffusion process. 

At any instant of time t, the fluxes of A and B atoms within the lattice frame are 
given by Fick’s first law: 


: ôC ; oc 
Ja = Di and js = =D (154) 


If Da # Dg, a net atomic flux results as jnet = ja + jg. The rate at which mass is 
accumulated in the diffusion zone can be related to the velocity of a marker in the 
diffusion couple, 


UM = —JnetQ (155) 


where Q is the atomic volume of the alloy, which is assumed constant. Noting 
that with this assumption Ca + Cg is constant, it can be shown that the velocity 
can be expressed in terms of the concentration gradients: 


ac, 


A 


OCR 
0x 








vm = (Da — Dg) (Dg — Da)Q (156) 


The occurrence of a marker velocity, verified in the famous experiment by Smi- 
gelskas and Kirkendall (1947) using a diffusion couple of pure copper and 70-30 
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brass, gives evidence for the motion of lattice planes (advective motion). Assum- 
ing a conservation of lattice sites in the crystal, a net flux of vacancies has to com- 
pensate for the unbalanced atom fluxes: 


jv + ja + jp =0 (157) 
Inserting the atom fluxes according to Fick’s first law, Eq. (154), and considering 


that Ca + Cg can still be regarded constant for the usual low concentrations of 
vacancies, the vacancy flux in the lattice frame is obtained: 


: OCA OCR 
=:(D D =(D D 
jv = (Da B) 3% (Dp A) ax 





(158) 


It is seen that the net flux of vacancies is proportional to the difference in the in- 
trinsic diffusion coefficients. The flux of vacancies compensating for the flow of 
crystal lattice is called the vacancy wind, which “blows” through the lattice 
against the faster-diffusing species. This means that in the diffusion zone the va- 
cancy concentration changes locally to values below or above the local equilib- 
rium value. As a consequence, lattice planes are gradually created by generation 
of vacancies and in turn are destroyed by vacancy annihilation at sinks (e.g., jogs, 
dislocation loops and stacking faults). The motion of lattice planes is therefore 
high where the concentration of vacancies is far from equilibrium which is a con- 
sequence of the different atoms diffusing at different rates. 

The Kirkendall effect has of course a consequence for Fick’s second law, which 
has to account for the advective motion of the lattice planes. It can be shown that 
Fick’s second law can be modified to give the diffusion advection equation: 


oc ~ 
S = V- (DVC) + vm - VC (159) 
For the time-independent case and for constant D, Eq. (159) simplifies to the 
steady-state diffusion advection equation: 


V-C 


-VC 
mo (160) 
D 


Several applications, especially in the field of solidification and crystal growth, for 
a steadily moving interface can be treated using Eq. (160) when transforming to 
the lattice reference frame (cf. Chapter 3). 

Shortly after the discovery by Smigelskas and Kirkendall, Darken (1948) ana- 


lyzed the effect in a purely phenomenological treatment. He obtained the first 
Darken equation, 


OCA OCR 
m = Da—-+ De — 161 
UM At Bay (161) 
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corresponding with Eq. (156).'” In addition, he obtained an essential relationship 
between the “overall” interdiffusion coefficient D and the intrinsic diffusion coef- 
ficients (the second Darken equation): 


D = cg Da + ca Dg (162) 


This means that, by determining the interdiffusion coefficient D, e.g., by the 
Boltzmann-Matano analysis, and simultaneous measurement of the marker ve- 
locity vm, the intrinsic diffusion coefficients Da and Dg involved can be obtained. 


5.4.1.3 Nonideal Solutions 

In an alloy, each atom species and vacancy diffuses with its own diffusivity. We 
can define a generalized diffusion driving force, which causes a certain drift ve- 
locity of a species i: 


u; = BFA (163) 


The mobility B of the specific atom is in general a tensor, which relates the drift 
velocity and the driving force. The generalized force driving the diffusion of a spe- 
cies i can be written as a gradient of the corresponding chemical potential, 


Re = —V 4; (164) 


linked to the concentration gradient. The diffusion flux, being the product of local 
concentration and drift velocity, can be written as 


ji = Cu; = CBF = —CBVy; (165) 


The chemical potential 4; is not directly accessible to experiment but can be writ- 
ten in terms of the activity a, a measurable thermodynamic quantity, conve- 
niently expressed by the activity coefficient y; (4; = Mg + kgT In aj, a; = c;y;) de- 
scribing the deviation from the behavior of an ideal solution (y = 1). Then we 
can write Eq. (166) for the driving force using the site fraction c; (= QC;) instead 
of the concentration C;: 


Vu; = kg TV In a; = kg T(V In c; + V In y;) 


olnc olny; 
bot ( Oc; l 0c; ve 


_ eT (1 qe "ve (166) 


Cj ôln c; 











19) The site fractions c; are identical to the 
mole fractions in disordered alloys of low 
vacancy concentrations and are related to 
the concentrations as c; = QC;. 
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This yields Eq. (167) for the atom flux: 





ji = —kp TB; (1 + ain “Vea, (167) 


ôln ci 


If this is compared with Fick’s first law, Eq. (147), the corresponding intrinsic dif- 
fusion coefficient D; can be identified: 





5 ôln y; 
D; = kg TB; {| 1 : 168 
3 ( +) ee) 


The term within the parentheses is called the thermodynamic factor p and takes 
into account that for diffusion in nonideal solutions the driving concentration 
gradient has to be modified. Note that this result was obtained independently in 
Section 5.2 [Eq. (123)] using the bias in individual atom jump probabilities. 

The thermodynamic factor entering the diffusion coefficient for nonideal, con- 
centrated solid solutions has a remarkable consequence for unmixing systems. 
We show below that the thermodynamic factor is proportional to the second de- 
rivative of a local Gibbs free energy per atom with respect to concentration, g”(c), 
if an adequate local concentration can be defined. First we write the local Gibbs 
energy for an A-B alloy (ca + cg = 1) as g = (1 — cp)uy + Cup. The first deriva- 
tive, g’ = 0g/dcp, is given as 





Ota ôu 
g' = —ua + (1— cp) ae + Up + Cp = 
= Mg — Ha (169) 


Note that in the first expression the sum of the second and the fourth term is 
zero due to the Gibbs-Duhem relationship ca du, + cg dug = 0. Expressing the 
chemical potentials by activities, i.e., 


g = u + kgT In ag — up — kgT In ag (170) 


the second derivative, g” = 0?g/öc}, is found to be 


g” = kg T(n ag — ln aa) 
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The interdiffusion coefficient can therefore be written as: 


D = [cs Da + (1 — cg) Ds] ETE" = Mog" (172) 





The thermodynamic factor p and in consequence the diffusion coefficient D is 
proportional to g”, which can be either positive or negative. For a decomposing 
alloy system the two curves of ga(c) and gg (c) (where c is either ca or cg) may be 
connected, having a maximum between the two minima. The two inflection 
points (g” = 0) of the combined g(c) curve lie on the spinodal connecting the in- 
flection points for all temperatures. For all concentrations within the spinodal 
(g” < 0) the diffusion flux turns from downhill (down the concentration gradi- 
ent) to uphill diffusion (up the concentration gradient); i.e., the diffusion coefh- 
cient changes its sign to D < 0. This can be interpreted as an instability of alloys 
in this composition range with respect to infinitesimally small concentration fluc- 
tuations: any concentration fluctuation starts the decomposition process, however 
small. This phenomenon of spontaneous decomposition process is called spino- 
dal decomposition and is in contrast to the decomposition mode by nucleation 
and growth when D > 0. A detailed treatment of this distinction is given in Chap- 
ter 7 on phase transformations. 


5.4.2 
Phase Transformations as Diffusion Phenomena 


5.4.2.1 Spinodal Decomposition 

As we discussed above, if g’(c) < 0 we expect phase separation to happen sponta- 
neously by “uphill” diffusion. Cahn and Hilliard (1958) write the free energy of a 
crystal as a functional of concentration and concentration gradient” (cf. Sections 
7.5 and 10.2.1): 





a6 =| (gle) - ga) + x(Ve)"Jav (173) 


In the classical treatments of spinodal decomposition the diffusion flux density is 
set proportional (by a mobility M) to the gradient of the chemical potential.” 
Generalizing the chemical potential as a variational derivative of the integrand in 
Eq. (173), the flux density results as 


. ô 
j(x,t) = -MY (2 = 29%) (174) 
Ic 
20) Here we have omitted the term taking composition c can be justified in the vicinity 
account of elastic strain energy. of the critical point but is certainly not true 


21) That M is taken to be independent of in the general case. 
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Applying the continuity equation to the atom flux we finally arrive at a diffusion 
equation, which is a generalized version of Fick’s second law: 





a =; [mv (z = 2.9) (175) 
ot oc 


For small concentration amplitudes this equation becomes linear and is solved by 
sinusoidal concentration modulations growing or decaying exponentially in time. 
For a detailed discussion, see Chapter 7. 


Spinodal ordering The long-range ordered state can be seen as a concentration 
fluctuation with a very short wavelength equal to the periodicity of the ordered 
lattice, a so-called ordering wave. Instability of a homogeneous, disordered alloy 
with respect to such a fluctuation with infinitesimal amplitude leads to a behavior 
analogous to spinodal decomposition, appropriately called spinodal ordering (de 
Fontaine 1975; Khachaturyan 1978). Of course, the diffusion jumps connected 
with the growth of a fluctuation with such a short wavelength cannot be ade- 
quately rendered by concepts like continuum diffusion fluxes. In a discrete model 
of spinodal decomposition (Hillert 1961; Cook et al. 1969) the possibility of such 
short-wavelength concentration instabilities was already recognized and it was re- 
marked that in phase-separating systems the instability was driven by negative 
g", originating from the first term in Eq. (175), whereas in spinodal ordering g” 
is positive, and the instability is driven by negative x, the gradient-energy coefh- 
cient. Statistical concepts like the path probability method (PPM) have also been 
profitably applied to the kinetics of spinodal ordering: In a computer experiment 
(Mohri 1994) an AB alloy with stoichiometric composition was quenched to a 
temperature below the ordering temperature and an initial ordering composition 
fluctuation imposed. It was demonstrated how above the spinodal ordering tem- 
perature only an initial fluctuation with sufficient amplitude could grow (nuclea- 
tion regime) whereas below the spinodal ordering temperature even a very small 
amplitude of fluctuation spontaneously led to a long-range ordered state (spino- 
dal regime). 


5.4.2.2 Nucleation, Growth, Coarsening 

If the composition of an alloy is in the two-phase field but outside the spinodal 
region, i.e., g”(c) > 0, small-amplitude spontaneous fluctuations decay rather 
than grow, and the kinetics of the phase transformation can no longer be de- 
scribed by macroscopic diffusion flows. Instead we have to consider the jumps in- 
dividually and allow for kinetic paths of nonmaximum probability. This is done in 
classical nucleation theory, which regards a sequence of subcritical clusters (em- 
bryos) where a cluster of size i+ 1 is created by the addition of one atom to a 
cluster of size i. Since the Gibbs free energy of the system increases as a sub- 
critical cluster grows, the net local diffusion flux is always directed outward from 
the cluster. Nevertheless, after a certain incubation time a population of nuclei of 
each size is established so that there is a practically equal flux of growing and 
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shrinking nuclei N(i)w; = N(i+1)@;,, where wt and w are the probabilities 
per unit time of growth and shrinkage. The important point is that although 
wj < w, the net flux between cluster size classes is not directed from i+ 1 to- 
ward i but because N(i) > N(i+1) the smaller nuclei restore the balance by 
sheer numerical superiority. The kinetics of the size distribution function N(i, t) 
can be cast in the form of a Fokker—Planck-like generalized diffusion equation in 


phase space, 


ON(i,t) _ 0 ERO ô (xe N] (176) 


ôt ôi ôi \ Ne (i) 








where Ne(i) means an ideal equilibrium distribution of nuclei. For a more de- 
tailed discussion including the computation of the nucleation rate (at which the 
nuclei grow beyond critical size) as a steady-state solution of Eq. (176), see Chap- 
ter 7. 

Once supercritical nuclei have been created, their growth takes place by the 
usual macroscopic diffusion. In a steady-state regime a parabolic growth law is 
followed as long as the depletion zones around the growing particles do not yet 
impinge on one another. For spherical particles the radius then grows as 
R= ży Dt. 

When the equilibrium volume fraction of the precipitate phase has been 
reached, large particles grow at the expense of smaller ones, exchanging atoms 
via the matrix phase. The driving force of this coarsening process is interfacial en- 
ergy. In the classical treatments of Lifshitz and Slyozov (1961) and Wagner (1961) 
(LSW theory), the particle size distribution function f(r) (r being the particle ra- 
dius) is shown to reach under certain conditions a normalized standard form. 
1/3. The t!/3 law changes to other 
power law dependencies if coagulation and splitting of precipitate particles are 
enabled (Binder and Heermann 1985). For a detailed discussion, again see Chap- 
ter 7. 

Summing up this section, we recognize that, in the phase transformation sce- 


The average particle size then grows as r xt 


narios mentioned, (a) local diffusion fluxes may very well be directed up the con- 
centration gradient; (b) for nucleation, atom jumps have to be considered as to 
their effect upon the cluster size distribution function instead of taking local aver- 
ages; and (c) cluster and precipitate size distribution functions evolve in phase 
space according to generalized diffusion equations. 


5.4.3 
Enhanced Diffusion Paths 


Whereas a homogeneous and infinitely extended single crystal containing only 
point defects is a welcome simplification for the theoretical treatment of diffu- 
sion, real materials rarely meet such a condition. What is more, the extended de- 
fects that they are endowed with not only produce correction terms to the ideal 
behavior but very often they constitute fast diffusion paths (short-circuit paths) 


265 


266 


5 Point Defects, Atom Jumps, and Diffusion 


which exceed bulk diffusion by several orders of magnitude and thus influence in 
a qualitative and quantitative manner the diffusion properties. 

Mechanisms of diffusion along extended defects are less thoroughly under- 
stood than volume diffusion, and there is often no obvious connection between 
the structure of a defect and the rate of diffusion within it. In any case, it is plau- 
sible that in regions of the crystal where the lattice structure is loosened or bro- 
ken up atom transport may happen more easily. In experiments, defect diffusion 
is always intertwined with volume diffusion taking place simultaneously, and is 
therefore difficult to separate from it. An early treatment of this interconnection 
by Hart (1957) relied on random-walk arguments and resulted in the hardly sur- 
prising formula for an effective diffusion coefficient 


Deg = (1— f)Dv + fDp (177) 


with a volume fraction f of the defect(s), a diffusivity Dp in the defected volume, 
and a diffusivity Dy in the matrix volume. 

Solute tends to segregate toward defects, especially high-angle grain bound- 
aries, so that in equilibrium the concentrations in the defect and the matrix vol- 
ume are related by a segregation coefficient k = Cy/Cy, entering the equations 
whenever we consider solute diffusion by defects (as distinguished from self- 
diffusion or tracer diffusion). 

An important distinction between moving and static defects should not be over- 
looked. Often defects such as dislocations transport solute atoms into a matrix 
where they have to be distributed subsequently by volume diffusion. If the defects 
are at rest, a region of higher solute concentration builds up in the matrix sur- 
rounding the defect, thus hindering effectively the flow of the solute along the de- 
fect. A moving defect, on the other hand, is able to shed solute copiously into 
ever-fresh crystal volume, or to effect fast phase transformation where pure vol- 
ume diffusion would be too slow. Driving forces exerted by concentration fields 
lead to chemically induced dislocation or grain boundary motion, which is also 
part of the mechanism of discontinuous precipitation (Purdy 1990). 


5.4.3.1 Dislocation-Core Diffusion 


Computer simulation studies Diffusion along dislocations (pipe diffusion) can be 
imagined as taking place along a cylinder with a radius of a few ängstroms where 
the lattice is widened, especially in edge dislocations. Experimental access to pure 
defect diffusion is notoriously difficult. For an explanation and a quantitative un- 
derstanding of the diffusion mechanism along dislocations, atomistic calcula- 
tions, and especially Monte Carlo and MD simulations, have turned out to be in- 
dispensable. MD simulations of diffusion along dissociated edge dislocations in 
copper were made by Huang et al. (1989, 1990), a kinetic Monte Carlo study for 
the same material using an EAM potential was done by Hoagland et al. (1998), 
and a hyper-MD simulation by Fang and Wang (2000). These works were criti- 
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Fig. 5.44 Dislocation pipe diffusion model by Le Claire and 
Rabinovitch: solute is transported by volume diffusion with coefficient 
Dy and pipe diffusion with D,. Dislocations are modeled as cylinders 
of radius a set at mutual distance 2L normal to the surface in a 
hexagonal array through which the figure shows a cut. Surface 
concentration is Co, average solute concentration in a thin slice at 
depth y is <c(y)> (Purdy 1990). 


cally discussed recently by Vegge and Jacobsen (2002), who performed a nudged 
elastic-band (NEB) study on the same system. 


Experimental determination A well-established method of determining diffusion 
coefficients experimentally is to apply a thin coating of the diffusing species 
(often a radioactive tracer atom) to the surface of the specimen. After diffusion 
annealing, the specimen is cut into thin slices. Measuring the amount of diffus- 
ing species that has penetrated to depth y then allows evaluation of the diffusivity 
according to the appropriate solution [Section 5.4.1.1, Eq. (151)] of Fick’s second 
law. Striving to adapt this method to the case where dislocations modify diffusion 
behavior, Le Claire and Rabinovitch (1981, 1982, 1983, 1984) in a series of papers 
developed a classical model of this situation, which is depicted in Fig. 5.44. Under 
the condition that at the boundaries between dislocations and matrix both the 
concentrations and the diffusion fluxes are continuous,” analytical solutions 
could be obtained for the two standard cases: (a) of a constant surface concentra- 
tion and (b) of a thin surface film containing a specified initial amount of the dif- 
fusing species per unit area. In plots of <c(y)> vs. normalized penetration depth 
n = y/VDt the dislocation contribution shows up as characteristic linear “tails” 
as in Fig. 5.45, the slope of which is a function of the ratio of diffusion coeffi- 
cients Dy (volume) and D, (dislocation core, i.e., pipe). Knowing Dy, the tail 
can therefore be used to estimate dislocation pipe diffusivity, which in the case 


22) If we regard solute diffusion, due to the 
segregation factor k the continuity condition 
at the boundary reads Cy = kCy instead of 
C=C). 
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Fig. 5.45 Average concentration vs. penetration depth plot of an 
annealing experiment with boundary condition (b), showing diffusion of 
Ga atoms in a single crystal of germanium (Ahlborn 1979). The linear 
tail shows the contribution of dislocation pipe diffusion. 


of solute diffusion contains the segregation coefficient k (Le Claire and Rabino- 
vitch 1984). 


5.4.3.2 Grain-Boundary Diffusion 

Grain boundaries can be distinguished qualitatively by the amount of misorienta- 
tion between the grains.”? Symmetric tilt boundaries with up to about 15° mis- 
orientation (low-angle grain boundaries) can be represented by equivalent arrays 
of edge dislocations (Fig. 5.46), where the spacing 4 of dislocations with Burgers 
vector b is connected to the tilt angle 0 by A = b/[2 sin(0/2)]. The diffusion prop- 
erties of these grain boundaries can be broken down into those of the correspond- 
ing dislocation pipes with the boundary diffusion coefficient D, = D, /A and the 


23) For a review on grain boundaries see, for instance, Gleiter (1982). 
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array of edge dislocations with spacing 4. After 


Fig. 5.46 Small-angle tilt boundary consisting of an 
| Bocquet et al. (1983). 


[100] [100] 


[ool] , 


obvious consequence that diffusion within the boundary along the dislocation 
pipes (in the direction of the tilt axis) is faster than in a direction normal to 
them. As can be seen from these simple relationships, diffusivity in low-angle 
grain boundaries is generally proportional to the misorientation angle. 

For larger misorientation, the cores within the grain boundaries begin to over- 
lap, and the grain boundary structure, although periodic, can no longer be de- 
scribed in terms of lattice dislocations. The detailed structure has been unraveled 
by atomistic simulation (Vitek 1984), in some instances supported by high- 
resolution electron microscopy. Diffusivities in high-angle grain boundaries of a 
given system tend to be in the same range, especially if the orientations do not 
correspond to high coincidence, and as a rule of thumb the activation enthalpy 
may be assumed to be roughly one half of the value in the bulk phase (Peterson 
1983). 

As regards experimental measurement, a standard geometry of combined grain 
boundary/volume diffusion has been treated by Whipple (1954) after an approxi- 
mate solution by Fisher (1951) according to Fig. 5.47: diffusion takes place from a 
surface where a constant concentration Co is being maintained, into a bicrystal 
with a grain boundary of width ö normal to the surface. Taking again the average 
concentration <c(y)> of a slice at distance y from the surface, the experimental 
results are most conveniently plotted in the form of <c) versus y*/* curves, in 
which the contribution of grain boundary diffusion is visible as linear tails, from 
which the grain boundary diffusion coefficient can be evaluated (Whipple 1954). 

In conclusion, it can be stated that according to a distinction by Harrison (1961) 
short-circuit networks of dislocations or grain boundaries can contribute to diffu- 
sion in one of three regimes. (a) When the bulk diffusion penetration depth v Dt 
is larger than a characteristic length L of the network (grain size or link length of 
a dislocation network) then the diffusion zones of neighboring short circuits over- 
lap and Hart’s formula, Eq. (177), is valid. (b) When there is bulk diffusion but 
the penetration depth is smaller than L the “tails” in a <c) versus y plot (cf. Fig. 
5.45) can be used to deduce defect-diffusion properties. (c) When bulk diffusion is 
completely negligible but the penetration depth along the network is larger than 
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Fig. 5.47 Iso-concentration plots for diffusion into a bicrystal 
containing a grain boundary of width 6 (Whipple 1954). 


L, the concentration profile is similar to bulk diffusion with an effective diffusivity 
D’ oc D,/L? instead of Dy. 


5.4.3.3 Diffusion along Interfaces and Surfaces 

Inner surfaces such as those along a crack or a void can offer short-circuit diffu- 
sion paths and may be treated formally along the lines of grain-boundary diffu- 
sion (cf. Fig. 5.47) where one of the grains is missing, if a reasonable value for 
the thickness of the surface layer can be chosen. 

The diffusion properties at free surfaces play an pre-eminent role in vacuum 
physics and for epitaxial growth processes. At the same time, surface structures 
are ideally accessible by modern analytical techniques such as scanning tunneling 
microscopy (STM), atomic force microscopy (AFM) and updated versions of field- 
ion microscopy (FIM). Surface diffusion is largely determined by the geometry of 
the surface, which in turn depends on its orientation. Figure 5.48 shows the rele- 
vant structures of a surface near a low-index orientation. 

The surface consists of terraces (T) along close-packed planes. The deviation of 
the surface from a close-packed plane is accommodated by ledges (L) of atomic 
height which themselves contain kinks (K). Single ad-atoms can be created by de- 
taching them from a kink or a ledge or extracting them from a terrace, leaving 
behind an ad-vacancy. The first of these processes is by far the most likely to be- 
come thermally activated. Formation energies of both defects are comparable for 
most orientations, so both are contributing to mass transport at the surface to a 
significant extent. Migration energies for fcc crystals increase approximately with 
surface roughness and are therefore higher for less close-packed surfaces. It 
must be remembered, however, that migration is quite anisotropic. It is markedly 
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Fig. 5.48 Terrace (T)—ledge (L)-kink (K) model of a crystal surface. The 
processes of creating an ad-atom out of a kink or out of a terrace with 
the simultaneous creation of an ad-vacancy are shown. After Bocquet 
et al. (1983). 


easier along dense rows or channels than in a direction crossing them. Above a 
certain temperature Tr = 0.5&/kp, where e is the energy of the nearest-neighbor 
bond, ledges can be created freely by thermal activation and the surface becomes 
delocalized (roughness transition (Leamy and Gilmer 1974)). Motion of atoms at 
the surface takes place by single-atom jumps if the temperature is sufficiently 
low (T < 0.15Tm). At higher temperatures new mechanisms come into play, 
such as correlated and collective jumps of multidefects, and finally evaporation- 
condensation-like scenarios, where the moving atom is flying over the surface. 

An overview of recent developments in surface diffusion research can be found 
in a monograph (Tringides 1996). Experimental approaches to measuring the sur- 
face diffusion coefficient fall into either of two large classes. In one of them, the 
random walk of single ad-atoms is followed by direct imaging methods such as 
FIM and STM, and the tracer diffusion coefficient is derived from the mean 
square distance covered by them during time t, Ds = (R?)/(4t), in direct analogy 
to Eq. (101) for the three-dimensional case. The second large class of methods de- 
duces surface diffusivity from the collective behavior of many atoms. Due to the 
interactions between ad-atoms a “collective” or interdiffusion coefficient is thus 
obtained [cf. Eq. (168)]. A multitude of methods falls into this class; see the table 
in the introductory chapter of Tringides (1996). It must be remembered in any 
case that surface diffusion is extremely sensitive to impurities, which must there- 
fore be kept at a low level. 

Mass transport along interfaces and surfaces is only one of the many processes 
by which atoms interact with surfaces and thus has many cross-references with 
neighboring topics like gas adsorption and desorption, growth of solids from liq- 
uid or gas phases, mixing and segregation of thin layers with substrates or of 
multiple layers, establishment of ordered structures, and development of nano- 
structures in various environments and conditions. There is an immense and 
steadily growing literature on all these questions to which we cannot make refer- 
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ence in this concise chapter on diffusion in alloys, and which should therefore be 
sought under the headings of surface science, nanostructures, and phase trans- 
formations. 


5.5 
Outlook 


After about a century of research on diffusion, we can claim a fairly good under- 
standing of diffusion mechanisms in solids provided the conditions are not too 
exotic. This means that established methods generally rely on aggregates of 
atoms whose number is large enough to apply classical statistical mechanics, 
and field gradients small enough to allow linearization and expansion. In the 
treatment of atomistic diffusion processes, some kind of adiabatic approximation 
such as the Born-Oppenheimer behavior of electrons with respect to the ions 
or the steady maintenance of equilibrium in large parts of the system, even in a 
transition state, is usually invoked. 

Modern developments of the theory will therefore go together with the succes- 
sive removal of these restrictions and the introduction of new paradigms and 
principles. One recent example of this evolution is the introduction of the nudged 
elastic band method of path optimization instead of calculating the migration 
profile along a fixed path. Another example is ab-initio molecular dynamics in 
those cases where a static profile calculation will not suffice, for instance in 
highly nonequilibrium situations, for weakly bound atoms at surfaces, and in ca- 
talysis. 

Soaring computer power has put kinetic simulation of comparatively large, 
complex structures comprising millions of atoms within easy reach, and progress 
in ab-initio calculations delivers the necessary potentials. The path has been 
opened to treat diffusion in special environments such as grain boundaries, dis- 
locations, and complex nanostructures. More economical simulation techniques 
for KMC further increase the ability to handle large aggregates, e.g., by exploiting 
statistics to find shortcuts, and more realistic jump frequencies based on a more 
detailed knowledge about jump paths are on the verge of being introduced 
(Pfeiler et al. 2005). In the mesoscale range, advanced phase field methods enable 
the treatment of intricate structure evolutions in systems with rich phase trans- 
formation behavior. 

The application of simulation methods is already ubiquitous in present-day re- 
search, and it is to be expected that these tools will go on solving very specific dif- 
fusion problems in the emerging materials of interest, among which nanostruc- 
tures and thin-layer structures seem to be most important in the near future. It 
is very clear that with nanoscale microstructures where the atomistic nature is 
beginning to be felt, continuum-based calculations must give way to customized 
models based on individual atom jumps, making simulation methods such as 
MC and MD a natural choice. Ever more realistic simulation of complex problems 
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in realistic materials will undoubtedly dominate diffusion theory in the foresee- 
able future, although some surprising effects in seemingly well-understood “clas- 
sical’ diffusion scenarios may still lie in wait, like the unexpected sharpening of 
concentration profiles due to concentration dependence of the diffusion coefh- 
cient which was discovered recently (Erdelyi et al. 2004). 

Whereas in past years diffusion research centered on the determination of fun- 
damental parameters such as diffusion coefficients, activation energies, and cor- 
relation factors, attention during the last 15 years or so has shifted toward the 
treatment of complex processes. The topics involve multicomponent systems, dif- 
fusion flows between different phases in various shapes, and the interaction with 
other physical properties, e.g., diffusion under mechanical stress, under the influ- 
ence of magnetic fields, and concomitant with chemical reaction. A recent confer- 
ence volume (Danielewski et al. 2005) reflects this shift of research interest to a 
remarkable diversity of applied problems. 

On the experimental side, modern development is characterized by a steady in- 
crease in new methods appearing as new acronyms in the scientific language, 
which give insight on diffusion processes on ever smaller spatial and temporal 
scales. Many of them are scattering methods with nuclear particles or radiation, 
such as neutron spin echo (NSE) spectroscopy, quasi-elastic neutron scattering 
(QNS), and nuclear resonant scattering (NRS), or are based on nuclear reaction, 
such as quasielastic Mossbauer scattering (QMS), positron annihilation, and per- 
turbed angular correlation (PAC). The burgeoning field of surface science is con- 
tributing a host of modern analysis techniques which can be chosen according to 
the requirements of the alloy system and the boundary conditions, such as field 
ion microscopy (FIM), atomic force microscopy (AFM), scanning tunneling mi- 
croscopy (STM), magnetic force microscopy (MFM), photoelectron energy mi- 
croscopy (PEEM), scanning Auger microscopy (SAM), secondary-ion mass spec- 
trometry (SIMS), scanning electron microscopy (SEM), and low-energy electron 
diffraction (LEED). Modern microscopic tools such as high-resolution transmis- 
sion electron microscopy (HRTEM) offer glimpses on configuration changes and 
phase transformations in an unprecedented resolution and will continue to be 
improved. Without doubt, powerful radiation sources such as high-flux reactors 
or synchrotrons contribute to the expansion of experimental ability. They also 
open a temporal window on the very initial stages of phase transformations or 
ordering when the scattering amplitude is still very weak (time-resolved X-ray 
scattering). 

Owing to computer and software progress and a cross-fertilization from the 
biomedical sciences, tomography by X-rays or neutrons has gained a firm stand- 
ing in materials science also, offering a more integral view on the evolving micro- 
structure in three dimensions. 

As a conclusion, we venture to state the view that the basic problems of diffu- 
sion in alloys may be considered as solved by now. It is in complex, not so famil- 
iar situations found in real materials, where many parameters interact, and in 
submicroscopic detail that exciting discoveries are to be expected in the future. 
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6 
Dislocations and Mechanical Properties 


Daniel Caillard 


6.1 
Introduction 


It has been known for a long time that metals and alloys — and more generally all 
crystalline materials — deform by shear along dense lattice planes and dense lat- 
tice directions. This property was easily interpreted by assuming that the shear 
displacement is a multiple of one of the smallest translation lattice vectors. It 
was also demonstrated that a global shear would require a very high stress, of 
the order of u/2r, where u is the shear modulus, i.e., several tens of gigapascals 
(GPa). Under such conditions, realistic deformation stresses could be explained 
only by introducing dislocations. 

Plastic deformation can be described in terms of dislocation motion, multipli- 
cation, annihilation, and storage. However, the multiplicity of the microscopic 
mechanisms involved, and the rather long-range elastic interactions between 
neighboring dislocations, make a complete calculation of collective dislocation be- 
havior almost impossible. Fortunately, the mechanical properties are usually do- 
minated by a few mechanisms, in such a way that reasonable approximations 
can be made yielding surprisingly nice results. 

Dislocations can be introduced in any material by the Volterra process de- 
scribed in Fig. 6.1. The solid is cut along the surface S ending along the curved 
line d. The two lips are then displaced by a uniform translation b, and pasted to- 
gether again. Note that the latter process generally requires the addition (or the 
removal) of atomic layers to maintain the continuity of the structure across the 
cut. All the elastic distortions are then concentrated around the line d, which is 
called a dislocation with Burgers vector b. In crystals, the cut surface is perfectly 
healed, and the corresponding dislocation is perfect, if b is a translation vector of 
the lattice. If b is not a translation vector a fault is created along the cut surface 
and the dislocation is imperfect. Figure 6.1 also shows that the motion of the dis- 
location increases the size of the cut surface, which increases the total amount of 
plastic deformation, e. Planar dislocation motion involving only shear displace- 
ments (b parallel to the cut plane, which is also the plane of motion) is called 
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Fig. 6.1 Creation of a dislocation d by the Volterra process. 


“glide,” whereas motion involving addition or removal of matter is called “climb” 
(b perpendicular to the plane of motion) or “mixed climb” (intermediate situa- 
tions). 

The plastic strain rate, projected in the plane of dislocation motion and along 
the Burgers vector direction, é, can then be expressed by the Orowan law [Eq. 
(1)], where p is the dislocation density (in m~*), b the Burgers vector, and v the 
average velocity of the moving dislocations. 


é = pbv (1) 


The determination of dislocation velocities is the key point of the study of plastic- 
ity mechanisms. Dislocation velocity is controlled by two types of obstacles. 
e Long-range elastic interactions with extended obstacles, such 
as elastic stress fields of other dislocations, precipitates etc.: 
generally these are alternately attractive and repulsive, in the 
form of an oscillating stress 20) of amplitude 7; (Fig. 6.2a). 
ti is called the “internal stress.” 


obstacle stress 





displacement 


Fig. 6.2 Schematic description of the local stress as a function of displacement. 


6.2 Thermally Activated Mechanisms 


e Short-range interactions with local energy barriers: if t is the 
applied stress,” the effective stress available to overcome 
these barriers is ole) = t - 1%), Where the resistance 
against dislocation movement exerted by local stress is a 
maximum, a minimum velocity results. Since the average 
dislocation velocity is only a few times larger than its 


minimum value,” we can assume that Eq. (2) holds. 
v= (te) with tef = t — ti (2) 


Short-range interactions, which are largely responsible for the dependence of the 
deformation stress on temperature and strain-rate, are the subject of Section 6.2. 
Long-range interactions, which are the result of the history of the sample (ther- 
mal treatments and deformation-induced dislocation structure) are described in 
Section 6.3. The latter are especially important in fatigue, discussed in Section 
6.4.2. Both types of interactions generally tend to decrease at increasing tempera- 
ture, which accounts for the usual loss of mechanical strength at increasing tem- 
perature. However, the reverse behavior, called “stress anomaly,” is sometimes 
observed in ordered intermetallic alloys. This will be discussed in Section 6.4.1. 
Other properties, such as small-size effects (Section 6.4.3) are also directly related 
to long-range and short-range interactions. 


6.2 
Thermally Activated Mechanisms 


6.2.1 
Introduction to Thermal Activation 


Short-range interaction of dislocations with energy barriers takes place in such a 
small volume — a few hundreds of atoms only - that it is strongly influenced by 
thermal vibrations. Such a “thermal activation’ helps dislocations to overcome 
barriers of height ranging between a fraction of an electronvolt and a few elec- 
tronvolts. This results in a reduction of the effective stress ty necessary to reach 
a given dislocation velocity v(t.) and average strain rate é = pbv, as the tempera- 
ture increases. These short-range thermally activated processes govern almost all 
the temperature-dependent mechanical properties of materials. The reader can 
refer to Schöck (1980) for a complete description of the theory of thermal activa- 


1) All stresses are expressed in terms of local 2) The proportionality coefficient is the total 
or average shear stresses. The applied shear displacement distance divided by the 
stress t can be deduced from the normal distance at which the local stress is close to 
applied stress by multiplication with the its minimum value. 


Schmid, factor which is usually between 0.2 
and 0.5. 
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tion, and to Caillard and Martin (2003) for a description of the thermally activated 
dislocation mechanisms in crystals. 

Although plastic deformation is an irreversible process, because a dislocated 
sample is not in equilibrium, the microscopic processes of dislocation motion 
can be analyzed in terms of Eyring thermodynamics of viscous flow (Eyring 
1936). If v is the vibration frequency of a dislocation segment held up by the 
stress against an energy barrier, the probability per unit time of jumping over 
the barrier can be expressed using a Boltzmann factor as Eq. (3), where kg is the 
Boltzmann constant, T is the absolute temperature, AG is the variation of Gibbs 
energy during the process (commonly called the activation energy), and v is the 
attempt frequency. 


AG 


P=vexp—7 7 
B 


(3) 


b 
Taking v = DTP where vp is the Debye frequency (vp x 1013 s7!) and l is the 


wavelength of the vibrating dislocation, and considering that the distance be- 
tween obstacles in the direction of motion is y, the average dislocation velocity 
can be written as Eq. (4). 


b AG 
v=v exP -ET (4) 


The increase in velocity with increasing stress lies in the decrease of AG with in- 
creasing effective stress tep, defined in Section 6.1 [Eq. (2)]. 

Defining the activation volume Vas in Eq. (5), the activation energy can indeed 
be written as Eq. (6), where AG, is the energy barrier at zero stress, and ty V is 
the work done by the effective stress during the crossing event. V represents b 
times the area swept by the dislocation during this event. 


-69 ° 


AG = AGo — tef V (6) 





The activation energy is deduced from the slope of Arrhenius plots, 
1 
Inv=f (=). The slope is equal to the change in activation enthalpy, AH, which 


0AG 
is related to AG by Eq. (7), where AS = — (=) is the activation entropy. 
IT Jag 


AH = AG + TAS, (7) 


The estimation of AS is a key step toward the determination of AG from AH. It is 
generally considered that the main contribution to AS is the variation of the elas- 
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tic shear modulus « with temperature. Under such conditions, Schöck (1980) has 
derived the following relation to deduce AG from AH: 


Td, Td 
ac = fant ev) 1-24 (8) 
Dislocation mechanisms can in principle be identified by activation energy mea- 
surements. However, writing AG in the inverse form [Eq. (9)] deduced from Eqs. 
(1) and (4), and considering that the logarithm term ranges between 20 and 30 in 
the usual experiments, one can see that AG adjusts itself to a value depending es- 
sentially on T [Eq. (9)], whatever the controlling mechanisms. 





2 
AG = kT In u YP (9) 


E 


It is thus usually impossible to identify dislocation mechanisms through the de- 
termination of AH and AG only. A more suitable parameter is the activation vol- 
ume V, which can vary by several orders of magnitude for different controlling 
mechanisms. 

The most important thermally activated mechanisms are now described in 
more detail. In this section (6.2), which is devoted to thermally activated mecha- 
nisms, the effective stress Tep is often considered equal to the total stress t, which 
is justified at small deformation where the internal stress 1; is small. 


6.2.2 
Interactions with Solute Atoms 


6.2.2.1 General Aspects 

Four domains can generally be identified in the stress-temperature curves of di- 
lute alloys, as shown in Fig. 6.3. Domain 1, which involves inertial effects due to 
very high dislocation velocities, has been described by Granato (1971). Domain 2 
corresponds to dislocation interaction with fixed solute atoms and will be de- 
scribed in Section 6.2.2.2. Domain 3 is characterized by stress instabilities (Sec- 
tion 6.2.2.3) and domain 4 is controlled by the dragging of solute atmospheres 
surrounding dislocations (Section 6.2.2.4). 

The main contribution to dislocation-solute interaction is the difference in 
atomic volume between solvent and solute, also called the size effect. In polar co- 
ordinates, the radial elastic interaction force between a pure edge dislocation and 
a solute atom can be written as Eq. (10) (Haasen, 1979), where AQ is the local 
change of atomic volume due to a solute atom, 0 = 0 in the direction of the Bur- 
gers vector, 0 = z/2 in the direction perpendicular to the slip plane, and r is the 
distance between the solute atom and the dislocation. 


b in 0 
Fin = 40T, 
T 





Ta (10) 
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Fig. 6.3 Deformation shear stress of CuAl alloys as a function of 
temperature (from a review by Neuhauser and Schwink, 1993). 


The maximum value of this interaction force at one interatomic distance is 
given by Eq. (11), where Q is the atomic volume. 


3 AQ 
pe _ Hb” AQ (11) 


int 1 Q 


6.2.2.2 Low Temperatures (Domain 2, Interaction with Fixed Solute Atoms) 
When a dislocation is pinned, the force exerted on the pinning point in the iso- 
tropic approximation (Fig. 6.4) is given by Eq. (12), where T is the dislocation-line 
tension, equal to the increase in dislocation energy per increase of length. 


F=2T sina (12) 
The line tension is not exactly equal to the dislocation energy per unit length, for 


several reasons discussed by Friedel (1964) and Kocks et al. (1975). Its order of 
magnitude is T = 1ıb?. 


=> 
=} 


Fig. 6.4 Force exerted by a dislocation on a pinning point. 
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In the case of a low density of weak obstacles (small «), this force can also be 
written as Eq. (13), where L; is the distance between obstacles along the disloca- 
tion line. 


In this approximation by Fleischer (1961), the net force acting on Lr is trans- 
ferred to the obstacle point. Combining Eqs. (12) and (13) yields Eq. (14). 


ub sin « 
Lr 





; (14) 


Lr is stress-dependent because more strongly curved dislocations interact with a 
higher density of obstacles. It has been estimated by Friedel (1964) as given by 
Eq. (15), where d is an in-plane average distance between obstacles and cop is their 
atomic concentration (number of obstacles per atom). 


= ub ee 7 1/3 
p= (5) = b( (15) 


Note that c,, can be smaller than the average concentration of solute atom, if ob- 
stacles are clusters. Combining Eqs. (12), (14), and (15) yields Eq. (16). 


_ bf FY? T 


Another expression [Eq. (17)] has been proposed by Mott and Labusch (Labusch 
1970; Neuhauser and Schwink 1993) for obstacles of width w > b. 


PESO ” 


Several phenomenological expressions can be taken for the energy necessary to 
cross the small-size obstacles as a function of the force F (Kocks et al. 1975; Cail- 
lard and Martin 2003). They can be expressed as Eq. (18), where AGmax is the 
maximum energy at F = 0, and Fmax is the force the obstacle opposes to the dis- 
location motion. 








AG(F) = AGmax h = (4)| N (18) 


Fmax 


Fmax is given approximately by Eq. (11). The exponents a and b are of the order of 
unity, e.g., a = 1 and b=1 for a rectangular force—distance profile; a = 1 and 
b = 3/2 for a parabolic force-distance profile; and a = 0.69 and b = 3/2 for the 
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Cottrell-Bilby potential which correctly describes dislocation-solute atom interac- 
tions (Cottrell and Bilby 1949, Wille et al. 1987). 

Combining Eqs. (16) and (18) yields Eq. (19), which is the expression proposed 
by Kocks et al. (1975) with 0 < 2a/3 <1land1<b<2. 


1- 5" | (19) 


Taking Eq. (20) from Eq. (8), with Eq. (21) where To is the temperature above 
which the obstacles are crossed under a negligible stress, (i.e., the “athermal’”’ 
temperature above which the deformation stress no longer decreases), yields Eq. 
(22), which becomes Eq. (23) for the Cottrell-Bilby potential. 





AG(t) = AGmax 





AG(t) = kpT In 20) 


AGmax = kpTo In ma, 21) 
€ 
B , T 1/b 3/2a " 
a | 
2.17 
€ T 2/3 
Polls fic 23 
Tmax | (=) | ) 


This law has been verified experimentally in CuMn alloys by Wille et al. (1987). 
The Cu-Mn system has been selected because the tendency to form short-range 
order is thought to be negligible below 5 at.% Mn (see Flor et al. 2003), the 
stacking-fault energy is almost independent of the Mn concentration, and the 
size effect at the origin of the dislocation-solute interaction is very large. Figure 
6.5 shows the critical resolved shear stresses (CRSS), in (a) as a function of tem- 
perature, and in (b) in a suitable form to check the validity of Eq. (23). The effec- 
tive stress t in (b) has been deduced from the total stress in (a) by subtracting the 
internal stress t;, assumed to be the value of the CRSS at the athermal tempera- 
ture (To ranges between 530 and 610 K, depending on the solute concentration; 
see Fig. 6.5). Tmax is the value of t at 0 K. The results are in excellent agreement 
with Eq. (23). AGmax is estimated using Eq. (21), Fmax is deduced from AGmax 

AGmax 





using the approximation Fmax = where w x 2.5b (w is the width of the ob- 





2 
stacle), and Cop X (3) is deduced from Eq. (16) using the experimental values of 


Tmax and AGmax. The results indicate that the concentration of obstacles cop is 20 
times smaller than the average solute concentration. In addition, the experimen- 
tal value AGmax = 1.3 eV is at least three times that expected from interactions 
with individual solute atoms. Therefore the authors conclude that the obstacles 
consist of doublets or triplets of solute atoms. 
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Fig. 6.5 Deformation shear stress in CuMn alloys plotted (a) as a 
function of temperature, and (b) to check the validity of Eq. (23). 
From Wille et al. (1987). 


6.2.2.3 Intermediate Temperatures (Domain 3, Stress Instabilities) 
As the temperature rises, solute atoms become sufficiently mobile to diffuse to- 
ward dislocations. Immobile dislocations are then surrounded by an “atmo- 
sphere” of solute atoms with local concentration c given by Eq. (24), where Uin 
sin 0 

r 





1 
is given by the integration of Eq. (10), i.e., Uint = -ubAQ „and Co is the aver- 
7 


age solute concentration. 


c = Co exp (er) (24) 
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Fig. 6.6 Interaction between a moving edge dislocation and solute 
atoms. (a) Computed solute atom density in a plane located one 
interatomic distance from the slip plane (the dislocation moves to the 
right). (b) Corresponding friction stress. From Yoshinaga and 
Morozumi (1971). 


The concentration c can be either larger or smaller than co according to the sign 
of sin 0. 

Moving dislocations tend to drag their atmosphere, and a dynamic equilibrium 
is established which depends on temperature and dislocation velocity. This equi- 
librium has been computed and the results are shown in Fig. 6.6(a). Figure 6.6(b) 
shows that the corresponding friction stress increases with dislocation velocity as 
long as the concentration profile remains unchanged. It decreases, however, 
when the density of the atmosphere decreases with increasing dislocation veloc- 
ity. It increases again when the dislocation velocity is so high that solute diffusion 
becomes negligible (which is the case in Section 6.2.2.2). 

This behavior is at the origin of stress instabilities. Figure 6.7 reproduces the 
variation of the velocity dependence of the friction stress plotted in Fig. 6.6(b). 
Let us consider a sample containing a density of mobile dislocations p, deformed 
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v v(t,) y 


Fig. 6.7 Schematic description of the origin of stress instabilities at 
intermediate temperatures (domain 3). 


at an imposed strain-rate é ranging between pbvı and pbv2. Figure 6.7 shows that 
when the stress increases to the critical value 11, dislocations suddenly accelerate 
from vı to v(t) > v2. Since pbu(tı) > &, the tensile machine relaxes and the ap- 
plied stress decreases to the second critical value t2. The dislocation velocity then 
instantaneously decreases to v(t2) < v1. Since pbv(t2) < ¿, the applied stress in- 
creases again and another cycle starts. This behavior leads to stress instabilities, 
also called the Portevin—le Châtellier (PLC) effect. Increasing the applied strain- 
rate increases the total time spent by the dislocations in the high-velocity regime 
but the average flow stress keeps oscillating between 7, and 1). This implies that 
the stress dependence of the strain rate must be zero. 

An anomalous increase in average deformation stress with increasing tempera- 
ture can even be obtained when vı ~ 0. As a matter of fact, the unlocking stress tı 
increases with locking time (because of a more intense diffusion process), i.e., 
with decreasing strain-rate. This effect is called dynamic strain aging. More de- 
tails on stress instabilities can be found in several reviews (see, e.g., MacCormick 
1972; Van den Beukel 1975; Mulford and Kocks 1979; Estrin and Kubin 1989; 
Kubin and Estrin 1990). 


6.2.2.4 High Temperatures (Domain 4, Diffusion-Controlled Glide) 
In this temperature range, dislocation glide is controlled by the diffusion of solute 
atoms present in their core. 

Two different approximations can be made corresponding to | Uint| > kg T, and 
| Uint| < kp T. 

For | Uint| > kg T, Eq. (24) shows that the solute cloud around the dislocation is 
highly asymmetrical, with a high concentration of solute atoms in the region cor- 
responding to Uint < 0, and a weak depletion in the opposite one (Un: > 0). Ac- 
cording to Friedel (1964, p. 410), the solute atoms are then all very close to the 
dislocation core, forming a row of pinning points with average distance 4 (Fig. 
6.8). The work done by the applied stress during the diffusion of one solute 
atom over one interatomic distance is tb7/. 
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Fig. 6.8 Elementary dislocation motion controlled by the diffusion of a 
solute atom over the distance b. 








1 Ug — th? 
The frequency of this event is 7 vD 7 exp = F in the forward direction 
1 b U4 + 1b? a 
and 5 vp 7 exp i F in the backward one, where Uj is the activation en- 
B 


ergy of solute diffusion. The dislocation velocity, accordingly, is given by 


ve 1 b = U4 - th? ie U4 + th?) 
eee ad kad keT P kt 


or, assuming that tb?) « kgT, by Eq. (25a). 





u tbt as U, 
corey eae Se, 


v 





(25a) 


U, 
This yields Eq. (25b), where D = b*vp exp — T is the diffusion coefficient of sol- 
ute atoms. B 


kgT 


= 535° (25b) 


T 


The friction force appears to be independent of the solute concentration in the 
dislocation core. 

For | Uint| < kgT, the calculations are more complex. As shown by Caillard and 
Martin (2003), the resulting friction force is however close to the value given by 
Eq. (25). 

Equation (1), where the velocity is given by Eq. (25a), and where the mobile dis- 
location density is taken proportional to 1? (Taylor law, cf. Section 6.3.2), is con- 
sidered to account for the creep properties of “class I” alloys. Class I alloys are 
indeed characterized by a creep rate proportional to the third power of stress, an 
activation energy equal to that of solute diffusion, and viscous dislocation motion 
(see, e.g., Takeuchi and Argon 1976). In many cases, however, the creep proper- 
ties are considered too complex to be explained so easily (see, e.g., Poirier, 1976). 


6.2.3 
Forest Mechanism 


The interaction between dislocations and other dislocations intersecting their slip 
plane is called the “forest” mechanism, because each intersecting dislocation be- 
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haves like a “tree”. The intersection of two dislocations with Burgers vectors at 
right angles, which only requires the formation of a pair of jogs, is similar to the 
interaction between dislocations and fixed solute atoms, treated in Section 6.2.2.2, 
and the same equations hold. The activation energy is then given by Eq. (18), 
where AGma is twice the jog energy Uj, and the activation volume is given by 
Eq. (5). Taking a = b = 1 (rectangular force-distance profile), the activation vol- 


AG 
ume is V = Lrbw, where w = — 





is the obstacle width, of the order of the dis- 


max 
location dissociation width, and the activation energy is AG(t) = AGo — te Lrbw. 
At a given strain-rate and temperature, AG and thus the product tp Lr must re- 
main constant along deformation curves. According to Eq. (15) ty is then inver- 
sely proportional to the average dislocation distance, d. In other words, t. can be 


1 
written in the form of the Taylor law, i.e., A oer — (cf Section 6.3.3.2), where aef 
u p 


depends on temperature and strain rate, but is constant along a deformation curve. 
The ratio AG/T must also remain constant, according to Eq. (9), which implies 
that «ep at constant strain rate must decrease linearly with increasing temperature. 

Experimental evidence for the forest mechanism results from the “Cottrell- 
Stokes” law (Cottrell and Stokes 1955) which states that effective and internal 
stresses both verify the Taylor law, and as a result they remain proportional to 
each other for various amounts of strain. These experiments and the correspond- 
ing interpretations will be described in Section 6.3.3. 


6.2.4 
Peierls-Type Friction Forces 


6.2.4.1 The Kink-Pair Mechanism 

Peierls (1940) and Nabarro (1947) were the first to remark that the dislocation en- 
ergy per unit length, E, has necessarily the same periodicity as the crystal lattice. 
This variation can be described by a periodic “Peierls” potential with hills and val- 
leys of amplitude AE and wavelength h. The dislocation core structure is deter- 
mined by local atomic displacements which depend on the anisotropy of the lat- 
tice and corresponding atomic bonds. It tends to extend along the directions of 
easier shear displacements. 

When the core is spread in the slip plane, the core smoothes the lattice period- 
icity and the friction stress is low (small AE). This is the case for the 1/2[110] dis- 
locations in the {111} planes of face-centred cubic (fcc) metals and alloys, and for 
the 1/6[1120] dislocations in the basal plane of some hexagonal closea-packed 
(hcp) metals and alloys. When the core is extended out of the slip plane, disloca- 
tion motion requires either very energetic collective atomic displacements, or pe- 
riodic transformations into a more compact and more glissile core, which results 
in a high Peierls hill and high friction stress (large AE). 

Dislocations which are locked in the bottom of a Peierls valley (y= 0) can 
move to the neighboring one (y= h) by the nucleation and propagation of a 
kink-pair (Fig. 6.9). 
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Fig. 6.9 Two steps of the nucleation of a kink-pair on a dislocation. 


According to Eqs. (3) and (4), the nucleation rate of kink-pairs per unit disloca- 
tion length can be written as Eq. (26) (see Guyot and Dorn 1967), where AGy,(z) 
b 
is the energy of the threshold position, vp T is the dislocation vibration frequency 
of wavelength |, and 1/1 is the corresponding density of antinodes of vibration 
(i.e., the density of possible kink-pair nucleation sites). 


b AG 
Pip = Vp 72 exp ( a): (26) 





The wavelength l is usually taken equal to the critical value of the kink separation, 
Ax, at the threshold position. The threshold energy (or activation energy) AG,,(t) 
decreases with increasing shear stress t. The corresponding dislocation velocity 
U, is given by Eq. (27), where L is the dislocation length. It is proportional to L 
because the longer the dislocation, the larger the total number of possible nucle- 
ation sites. 


bL AG, (T 
by = vo rh erp (2) (27) 





The difficult point is to estimate the activation energy AG;,(t). Two different ap- 
proximations can be made. 
(a) When the two kinks are well separated at their threshold 
position (Fig. 6.9, case 2), the kink-pair energy AG;,(z) is the 
energy of two isolated kinks, Gg, minus their elastic 
interaction energy, and minus the work done by the stress 
over the area swept by the kinks [Eq. (28)]. 


212 
AGip(t) = 2G, — we — hbtAx (28) 


Elastic calculations yield Eq. (29), where Ey is the average 
dislocation-line energy per unit length. 


Gy © h(E0AE) (29) 
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This energy goes through a maximum value (the activation 
energy) [Eqs. (30, 31)]. 


ur 1/2 
AGip(t) = 2G, — (hb)?! (£) (30) 
for 
1/2 
ee (31) 
8T T 


The decrease in AG (t) with increasing stress t [Eq. (30)] is 
at the origin of the stress dependence of the dislocation 
velocity [Eq. (27)]. 

This approximation is valid till Ax decreases below the 


width of a single kink, i.e., for t < 





E 

2nhb’ 

(b) When the critical kink-pair separation is small (Fig. 6.9, case 
1), which happens at large stresses according to Eq. (31), the 
two kinks are not well separated and the threshold 
configuration is like a bowed-out dislocation arc. Several 
approximations have been proposed, in the framework of 
elastic calculations by Seeger (1956) and others (see 
Caillard and Martin 2003). Atomistic calculations have also 
been carried out by Duesberry (1983), which confirm that 
AGy,(r) decreases with increasing t. 


6.2.4.2 Locking—Unlocking Mechanism 

Vitek (1966) was the first to remark that if the kink-pair formation involves the 
thermally activated transformation of a sessile dislocation (in the bottom of a 
Peierls valley) into a more glissile one (close to the hill of the Peierls valley), the 
reverse transformation may also be hindered by an energy barrier, and be ther- 
mally activated. In contrast to the situation described in Fig. 6.9, it cannot be con- 
sidered that dislocations extracted from a Peierls valley fall spontaneously into the 
neighboring one. In other words, they can keep a glissile configuration for a sub- 
stantial length of time, sufficient to move over a large distance. The reverse trans- 
formation takes place subsequently and locks the dislocation (Fig. 6.10). 

In this case, the motion can be treated as a series of locking and unlocking 
events with respective probabilities P; and P,,, per unit time and dislocation 
length. If the dislocation velocity in the glissile configuration is v,, the average ve- 
locity is then given by Eq. (32). 


P 
v= v (32) 
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Fig. 6.10 Schematic description of the locking—unlocking mechanism. 


Both probabilities can be written in the same way as Eq. (26), which yields Eqs. 
(33a) and (33b). 








b AG 

Pi) =vp 0 exp u (33a) 
b AG 

Pu VD cu? exp (=) (33b) 


The corresponding dislocation velocity is v [ Eq. (34)]. 


1 AG AG 
v= vz, Ta) exp GT (34) 


Since AG; is expected to be fairly small, of the order of the energy of a constric- 
tion on the glissile configuration, the dislocation velocity has the positive activa- 
tion energy AGu — AG; x AGu. Note that the pre-exponential term does not 
have the usual form of Eqs. (4), (25a), or (27), because it does not contain the 
Debye vibration frequency, vp. This results from the fact that thermal vibrations 
have two canceling effects, one contributing to increase the unlocking probability, 
Pu, one contributing to increase the locking probability, P, [Eqs. (32) and (33)]. 
The activation energy of unlocking, AG, is estimated below. 

The critical configuration for unlocking has the shape of an arc of a circle of 
extension y., which can be larger than the distance h between Peierls valleys. If 
Yc is large enough, elastic calculations can be used. The bulge energy is then, ac- 
cording to Fig. 6.11: 





Gp = 2R sin 0AE + (Ey + AE)(2RO — 2R sin 0) — thR?(0 — sin 0 cos 0) 


where R is the radius of curvature of the bowing dislocation. 
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Fig. 6.11 Dislocation bowing-out of a Peierls valley. 


After a Taylor expansion of sin @ and cos 0, and taking t = T/R, where T is 
the dislocation-line tension [see Eq. (12)], G, goes through a maximum for the 
critical angle 


2AE NA AR? 
0, = | —— zx | — 
(= = =) ( Eo ) 


This maximum energy of G, (i.e., the activation energy of unlocking) is given by 
Eq. (35a), and the corresponding activation volume by Eq. (35b). 


25/2 Eo 2A E?/2 


AGul = > ar 2 (35a) 
25/2 Eo 12A E?/2 


6.2.4.3 Transition between Kink-Pair and Locking—Unlocking Mechanisms 

The mean dislocation jump length of the locking—unlocking mechanism [Eq. 
(36)] decreases with increasing temperature, according to a more efficient locking 
[larger P, in agreement with Eq. (33a)]. 


Ug AG, 
8 oad) 36 
Yg PL oc exp (7S (36) 


Therefore the locking-unlocking mechanism can take place only at low enough 
temperatures to have yy > h. The decrease of yg tends to slow down dislocations, 
but is largely compensated by the increase of the unlocking probability P,, [Eq. 
(33b)|. As a result, the dislocation velocity increases in accordance with 
AG, — AG; > 0. When the temperature is sufficiently high for y, decreasing to 
h, dislocations move by the kink-pair mechanism. The latter thus appears as the 
high-temperature limit of the locking-unlocking mechanism. Details on the tran- 
sition mechanism have been published by Farenc et al. (1995) and Caillard and 
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Fig. 6.12 Transition between kink-pair and locking-unlocking 
mechanisms: (a) CRSS as a function of temperature; (b) activation 
volume as a function of stress. 


Martin (2003). The expected variations of yield stress and activation volume as a 
function of temperature are shown schematically in Fig. 6.12. 


6.2.4.4 Observations of Peierls-Type Mechanisms 
The deepest Peierls valleys are expected to lie along the densest atomic rows of 
the structure. They are also expected to be especially deep along screw disloca- 
tions that are surrounded by several directions of easier shear displacement, on 
account of their high degree of symmetry. Dislocations are expected to remain 
straight along Peierls valley directions, and to move either smoothly (kink-pair 
mechanism) or by series of small jumps (locking—unlocking mechanism). 
Peierls-type friction stresses control the motion of dislocations in many metals 
and alloys with various structures: bcc ones at low temperatures, hcp ones also at 
low temperature (in prismatic slip), intermetallic alloys with various structures 
(L12, Llo, B2, DO19 etc.). In fact, easy glide in close-packed planes of fcc metals 
appears to be more an exception rather than the rule, although fcc alloys such as 
steel are the most familiar to us. 
(a) Body-centered cubic metals, such as Fe, Nb, Mo, and their 

alloys, were the first ones in which the kink-pair mechanism 

was clearly identified. The CRSS of pure Fe with two 

orientations, C-doped Fe, and Mo with three orientations are 

shown in Fig. 6.13. The strong decrease in stress at 

increasing temperature and the small corresponding 

activation volumes (of the order of a few tens of b?) are 

typical of this mechanism. Alloys with the same structure 

exhibit similar properties. 

Discontinuities in the behavior of stress and activation 
volume with temperature, however, are still unexplained. 
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Fig. 6.13 Deformation shear stress versus temperature in bcc metals: 
(a) pure iron with two orientations (open symbols) and carbon-doped 
iron (full symbols), from Kuramoto et al. (1979); (b) molybdenum with 
various orientations (from Aono et al. 1983). 


Strong orientation effects are observed in Fig. 6.13, in 
agreement with a complex stress orientation dependence of 
the core structure (Duesberry 1989; Duesberry and Vitek 
1998). In-situ experiments show the corresponding steady 
and viscous motion of straight screw dislocations in iron 
alloys (Furubayashi 1969) and in Nb (Ikeno and Furubayashi 
1972; Louchet and Kubin 1979). The nonplanar core 
structure of screw dislocations at the origin of Peierls-type 
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Fig. 6.14 Computed core structure of screw dislocations: (a) non- 
degenerate core in niobium; (b) degenerate core in molybdenum. 

The relative atomic displacements between adjacent atoms are 
perpendicular to the figure, and proportional to the length of the arrows 
connecting them. From Duesberry and Vitek (1998). 


friction forces has been computed by several authors, e.g., 
Duesberry and Vitek (1998) (Fig. 6.14). 

(b) Hexagonal close-packed metals and alloys deformed in 
prismatic slip also exhibit important Peierls friction stresses 
up to room temperature. In Mg, in-situ experiments reveal 
straight screw dislocations moving steadily and viscously at 
room temperature in agreement with a kink-pair mechanism. 
The dislocation velocity is proportional to length in 
agreement with Eq. (27) (Fig. 6.15). The motion becomes 
jerkier at lower temperatures, in agreement with a transition 
toward the locking-unlocking mechanism (Couret and 
Caillard 1989). The latter has been extensively studied in Be, 
for which histograms of waiting times and jump lengths are 
fully consistent with Eqs. (33a,b) (Couret and Caillard 1989). 
The out-of-plane core structure at the origin of the Peierls- 
type friction stress in the prismatic planes of Mg and Be is 
an extension (or dissociation into Shockley partials) in the 
intersecting basal plane. It is more complex in Ti (Legrand 
1985). The transition between kink-pair and locking- 
unlocking mechanisms has been studied by means of in-situ 
experiments in Ti by Farenc et al. (1995). Figure 6.16 shows 
that the temperature variation of the activation volumes 
varies as expected from Fig. 6.12(b), and that the 
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Fig. 6.15 Motion of a screw dislocation (denoted by x) and expansion 
of a screw dipole (denoted by £a), and corresponding screw dislocation 
velocity as a function of length. In-situ experiment in magnesium, 300 K. 
From Couret and Caillard (1985). 


discontinuity corresponds exactly to the transition between 
locking-unlocking (large yg) and kink-pair (very small yg) 
mechanisms. One can see from Fig. 6.17 that activation 
volumes at low temperature vary as expected from Eq. (35b). 
Similar results have been obtained in Ti-5 at.% Al alloys, Zr, 
and zircalloy polycrystals (Caillard and Martin 2003). 

(c) Intermetallic alloys exhibit extensive jerky dislocation motion 
controlled by the locking-unlocking mechanism, e.g., 
ordinary dislocations in TiAl (Couret 1999), super- 
dislocations in cube planes of Ni3Al (Molénat and Caillard 
1992), prismatic and basal planes of Ti;Al (Legros et al. 
1996), {110} planes of Fe3Al (Molenat et al. 1998). Atomistic 
calculations in TiAl reveal two possible core configurations 
for ordinary dislocations, one sessile and one glissile, which 
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Fig. 6.16 Average jump distance Y, as a function of temperature 
(in-situ experiments in titanium deformed in prismatic slip, from Farenc 
et al. 1985), and comparison with the corresponding activation volume 
V, measured in a conventional deformation test by Naka et al. (1988). 
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Fig. 6.17 Stress dependence of the activation area of prismatic slip in 
titanium at T < 300 K. Open symbols are from Biget and Saada (1989); 
full symbols are from Levine (1966). 
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Fig. 6.18 Two possible core structures for a screw ordinary dislocation 
in TiAl: (a) planar and glissile in the {111} plane; (b) nonplanar and 
subjected to a high friction force. Arrows have the same meaning as in 
Fig. 6.14. From Simmons et al. (1997). 


may account for the locking-unlocking mechanism observed 


experimentally (Simmons et al. 1997) (Fig. 6.18). 


(d) In fcc metals, dislocations glide mostly in close-packed {111} 


planes where their cores are extended (or dissociated in the 
case of a low stacking fault energy), and where the Peierls- 
type friction stress is accordingly very weak. However, the 
same dislocations can also glide in the non-close-packed 
planes, where the Peierls-type frictional forces are much 
higher. Close-packed and non-close-packed glide can thus be 


compared, respectively, with basal and prismatic glide in hcp 


Mg or Be. Extensive glide in non-close-packed {110} and 
{100} planes has been observed in various fcc pure metals, 
for specific orientations of the straining axis (Fig. 6.19). It is 
characterized by coarse and wavy slip lines at variance with 


the usual fine slip lines in {111} planes. Figure 6.19(a) shows 


that the reduced temperature above which non-close-packed 


slip becomes important varies linearly with the inverse of the 


dissociation width, ub/y (where y is the stacking fault 


energy). The critical stress for {100} slip measured by Carrard 


and Martin (1988), shown in Fig. 6.19(b), has the same 
general properties as those shown in Fig. 6.12(a). The 


replacement by non-close-packed slip of the - a priori, easier 


- {111} slip is probably due to an easier cross-slip, which 
enhances dislocation multiplication. 

Glide in non-close-packed planes is generally ignored in 

plasticity models of fcc metals and alloys. Recent results, 
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Fig. 6.19 Properties of glide in non-close packed planes of fcc metals: 
(a) onset of {110} glide as a function of temperature and stacking fault 
energy (from Le Hazif et al. 1973); (b) CRSS of {001} glide in aluminum 
as a function of temperature, for different strain rates (from Carrard 
and Martin 1988). 
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however, indicate that the mechanical properties of 
precipitation-hardened aluminum alloys are related to {100} 
non-close-packed slip (Majimel et al. 2004). 


6.2.5 
Cross-Slip in fcc Metals and Alloys 


Cross-slip is a thermally activated change of dislocation slip plane, which plays an 
important role in by-passing of obstacles, source operation, and recovery. When 
dislocation motion is controlled by Peierls-type friction forces as in bcc alloys in 
the low-temperature regime, cross-slip occurs via a change of plane in which 
bulges or kink-pairs are nucleated, which does not require any supplementary 
activation energy. Cross-slip is thus especially important in fcc metals and alloys 
where dislocations glide freely in {111} planes, and where any change of plane is 
thermally activated. Under such conditions, it may determine the onset of defor- 
mation stage III, where hardening becomes dependent on temperature and 
strain-rate (Section 6.3.3.2). A review of cross-slip has been published recently by 
Puschl (2002). 


6.2.5.1 Elastic Calculations 

Cross-slip is generally considered to take place by the Friedel-Escaig mechanism 
described in Fig. 6.20. Elastic calculations have been proposed to estimate the 
constriction energy (Stroh 1954), and the activation energy and activation volume 
of cross-slip (Escaig 1968). The results predict that the activation volume at low 
stress should be of the order of V, in Eq. (37), where d is the dissociation width, 
t is the applied shear stress, tg and 7'4 are respectively the components of the ap- 
plied stress which tend to constrict the dislocation in the primary plane and to 
widen it in the cross-slip plane, and « ranges between 0.5 and 1. 


cross slip plane 





<—_ primary plane 


(a) (b) (c) 


Fig. 6.20 Schematic description of the Friedel-Escaig cross-slip mechanism. 
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Fig. 6.21 Tension-compression asymmetry of the different stresses 
involved in cross-slip. The dissociation width in the primary and cross- 
slip planes changes according to the direction of the applied stress 
(three zones in the stereographic triangle), in tension and compression. 


$ 
Va = aapea (37) 


Equation (37) shows that the stress dependence of the cross-slip probability is 
complex, and Fig. 6.21 shows that important tension-compression asymmetries 
are expected, in domain B. Taking ta ~ t’g ~ t and « x 1 yields Eq. (38). 
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Fig. 6.22 Activation parameters of cross-slip in copper (from Escaig 1968). 
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Figure 6.22 shows the variations of the cross-slip activation energy, AG;;, and acti- 
vation volume, Vas, over a large range of stress, still assuming tg © t'g © T. Goonstr 
is the constriction energy calculated by Stroh (1954). 


6.2.5.2 Atomistic Calculations 

Rao et al. (1999) simulated the core structure of cross-slipping dislocations in the 
framework of the embedded-atom method. The results show that the constriction 
is extended in the primary and cross-slip planes, as opposed to the point constric- 
tion considered so far (Fig. 6.23). Vegge and Jacobsen (2002) used molecular dy- 
namics calculations and the “nudged elastic band” method . The resulting cross- 
slip process is of the Escaig type as shown in Fig. 6.24. Activation volumes and 
energies are of the same order of magnitude as those estimated in the elastic cal- 
culations above. 


6.2.5.3 Experimental Results 

The tension-compression asymmetry predicted in Fig. 6.21 has been verified 
experimentally by Bonneville and Escaig (1979) (Fig. 6.25). By a suitable pre- 
deformation and change of straining axis of their samples, the authors were able 
to induce a large amount of cross-slip at yield, identified by slip line observations 
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Fig. 6.23 Series of cross-sections through a screw dislocation 
constriction. Atomistic calculations by Rao et al. (1999). 
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Fig. 6.24 Successive core configurations during cross-slip. The primary 
dissociation plane is in the plane of the figure, whereas only the trace of 
the cross-slip plane can be seen. From Vegge and Jacobsen (2002). 
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Fig. 6.25 Tension-compression asymmetry of the yield stress of pre- 
deformed copper, at 300 K: (a) tension; (b) compression. Curves A-C 
refer to the sample orientations of Fig. 6.21. From Bonneville and 
Escaig (1979). 
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and activation volume measurements. The corresponding stresses are tg > ta,c, 
in tension, and tg < ta,c, in compression, as expected from Fig. 6.21. Bonneville 
et al. (1988) measured V., = 280b? + 65b? in copper, a value close to the predic- 
tion of Eqs. (37) and (38) (Vs = 294b? with d/b = 7). They also deduced from 
their experiments an activation enthalpy AH = 0.47eV + 0.16eV which yields 
the activation energy AGo = 1.15eV + 0.37eV when extrapolated to 0 K. This 
value corresponds to AG.s — Gconstr at 0 K given by Fig. 6.22 (case of a pre-existing 
constriction). Similar results have been obtained in aluminum (Bonneville and 
Vanderschaeve 1985). 


6.2.6 
Dislocation Climb 


Dislocations are climbing when their motion involves the diffusion of atoms over 
long distances, i.e., when their Burgers vector lies out of their plane of motion 
(see Section 6.1). Climb plays a fundamental role in the mechanical properties 
of metals and alloys at high temperature. It is generally coupled with glide but 
sometimes can account for the whole plastic deformation, as discussed in Section 
6.2.6.6. Since it is the only process able to annihilate edge dislocations and to 
build regular sub-boundary networks, it also plays a fundamental role in high- 
temperature recovery and creep. In spite of its importance, climb is poorly 
known, however, mainly because of a lack of clear experimental data. 


6.2.6.1 Emission of Vacancies at Jogs 

Climb takes place by emission or absorption of vacancies or interstitial atoms at 
jogs. Only vacancy-mediated climb is treated here, because of its lower activation 
energy and higher velocity. The results can be transposed easily to the case of in- 
terstitials, however (Fig. 6.26). 


Fig. 6.26 Growth of (a) interstitial and (b) vacancy loops by exchange 
of vacancies (open symbol) or interstitials (full symbol). 
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According to Eqs. (3-6) (where y and h are taken equal to one interatomic dis- 
tance, a), the frequency of jog motion by vacancy emission in the direction fa- 
vored by the stress is given by Eq. (39a), where n is the number of first nearest 
neighbors in the lattice where the vacancy can go (n = 12 in fcc metals and al- 


loys), AG} and AGỌ are respectively the formation and diffusion energies of 
(v) 


vacancies, DY = a’ vp exp — i 
B 


AGO = AG, + AGỌ is the activation energy of self-diffusion, Q is the atomic 
volume (Q % a3), and tQ is the work done by the stress during the jog motion. 





is the self-diffusion coefficient of vacancies, 


(v) (v) (v) 
AG? +AG—72\ aD Q 
+ $ d = = 
P} = mvp exp uT y P ET (39a) 





The probability per unit time of the reverse motion by vacancy absorption is given 








(v) 
AG 
by Eq. (39b), where oO) = exp = L is the equilibrium vacancy concentration, 
B 
and cl!) is the local vacancy concentration near the jog. 
2) ©) G) 
; AG nD.) c 
= (i) d cs sd © 
P; = nvpc,” exp | BT 2 0 (39b) 


The dislocation velocity can then be expressed as Eq. (40), where x is the average 
distance between jogs; c4 and x are estimated in Sections 6.2.6.2 and 6.2.6.3 
below. 
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6.2.6.2 Diffusion of Vacancies from Jogs 

The vacancies which have a local concentration of cl’ around the jogs tend to dif- 
fuse away at the average velocity given by Eq. (41) (compare Chapter 5) where D, 
is the diffusion coefficient of vacancies, defined in Eq. (42). 


n =p, 2e (41) 
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keT 





D, = a’vp exp — (42) 


Considering a low jog density and a spherical diffusion around each jog 


clr) 


(Fig. 6.27), the diffusion flux away from the jog is ®4 = O Anr*v,. Using Eq. 





A 


oP 
(41), the steady-state condition = = 0 yields the vacancy concentration at the 
r 
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Fig. 6.27 Diffusion of vacancies from climbing jogs. 
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distance r from the jog, c,(r) = Oyar “and the total diffusion flux 
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away from the jog, ®y = 47 ð 5 ı) 


This flux must be equal to pie P; given by Eqs. (39a,b), which allows one to 
eliminate cl). The corresponding loben velocity is then given by Eq. (43a) or, 
assuming n x 47, Q x a°, b x a, and Qr « kgT, by Eq. (43b). 





Ann a TQ 
= pe) BR 43 
í n+ 4r bx ° exp kgT : (43a) 
(sd) 
Di” tQ 
Dr T > 


This method, proposed by Edelin (1971), yields a dislocation climb velocity sub- 
stantially different from that often found in the literature [Eq. (43c)]. 


D® Q 
a k B iE 





v=2n (43c) 


The latter expression is justified only when x x a, which is difficult to satisfy in 
practice. The average jog spacing x is estimated in Section 6.2.6.3. 


6.2.6.3 Jog Density and Jog-Pair Mechanism 
When the climb velocity is very low, the jog concentration a/x remains close to 
thermal equilibrium, according to Eq. (44) where G; is the energy of an isolated jog. 
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Fig. 6.28 Dislocations containing jogs: (a) thermal jogs on a 
dislocation at rest; (b) jog-pair nucleation and jog motion under stress. 


a Gj 
Í = exp - —Ż 44 
T exp — (44) 


This is generally not the case, however, because jogs are rapidly swept by the 
stress along dislocation lines, and jog-pair nucleation must be considered (Fig. 
6.28b). Jog-pair nucleation is different from the kink-pair nucleation discussed 
in Section 6.2.3.1, because jog motion is controlled by diffusion, whereas kinks 
can glide freely. Jog-pair nucleation can be better compared with the kink-pair nu- 
cleation in semiconductors, where the motion of kinks is slowed down by the cut- 
ting of covalent bonds. In fact, as pointed out by Hirth and Lothe (1982), the 
same equations can be used. This yields the mean free path of jogs issued from 
a pair between two jog-pair nucleations (Fig. 6.28b), Eq. (45) where AG;,(r) is 
given by Eq. (30) containing Gj instead of Gr. 


AG; 
x = a exp su (45) 


This value can be inserted in Eq. (43), which yields (for a dislocation length larger 
than x, as in Fig. 6.27) the climb velocity from Eq. (46a) or, at sufficiently low 
stresses, Eq. (46b). 


(sd) 
D 70 AGp(1) 
= 2n— 1 w 46 
v= 2n— E E =) | exp IET (46a) 








a D® Q oo fO 
= £47 3 
a kgT P 2kgT 





(46b) 


The corresponding activation energy is AG) + Ach) (t), which is thus larger 
than the self-diffusion energy, AC®. 
Fast diffusion in the core of climbing dislocations may increase the dislocation 


velocity at low temperature (see, e.g., Hirth and Lothe 1982). 


6.2 Thermally Activated Mechanisms 


6.2.6.4 Effect of Over- (Under-) Saturations of Vacancies: Chemical Force 

The diffusion of vacancies discussed in Section 6.2.6.2 is modified when for some 
reason the average vacancy concentration is not the thermal equilibrium one. 
Then, co) must be replaced by a higher or lower value, of), which results in a 
modified dislocation climb velocity [Eq. (47a)]; x is given by Eq. (44) or (45). 











(sd) (s) 
D; tQ Cs 
v=2n z E (=) a ; (47a) 
bkeT | ct" ree 
Defining the chemical force as F, = = In KOL Eq. (47a) can be written in the 
Co 
more symmetrical form of Eq. (47b) or, at sufficiently low stresses, as Eq. (47c). 
(sd) 
Dl 1Q F.Q 
v=2n = lop (=) exp ee =| ’ (47b) 
(sd) 
D” Q F, 
=2 ; 4 
v= dn ET | zl (47c) 


Equation (47) reduces to Eq. (43) when F, = 0. 

Chemical forces can originate from quenching or from the climb mechanism 
itself. As a matter of fact, pure climb deformation by emission of vacancies from 
system 1 (Fig. 6.29) strongly increases the average vacancy concentration. This in- 
duces a high chemical force, opposing the first system, but activating the second 
one, which is not activated by the applied stress. The second system thus contrib- 
utes to absorbing the vacancies emitted by the first one. A steady state is rapidly 
established, for which the number of emitted vacancies, which is proportional to 
&= pıbıvı, is equal to the number of absorbed ones, which is proportional to 
Prbrv2. If p1 X pz, and b, = bo, the two velocities are equal, which according to Eq. 

Fe F th 


(47c) implies that t — p p Le, Fo = En 





Fig. 6.29 Pure climb tensile deformation, by exchange of vacancies 
between system 1 activated by the applied stress, and system 2 
activated by a chemical stress. 
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6.2.6.5 Stress Dependence of the Dislocation Climb Velocity 
The dislocation climb velocity is usually considered to be proportional to stress, in 
agreement with Eq. (43b) or (43c), where x is assumed to be constant. Using Eq. 





v 
can more pre- 


(46b), the stress dependence of the dislocation velocity, n = A = ; 
nr 


cisely be expressed as in Eq. (48), and is higher than unity. 


Alnv t 6AGy(T) 


Alnt ‘QkpT ôr 








(48) 


6.2.6.6 Experimental Results 

Quantitative experimental studies of dislocation climb are rather scarce, because 
climb is difficult to isolate. Available results concern the growth and shrinkage 
of prismatic loops during annealing, and pure climb of c dislocations in hcp 
metals. 

Experiments have been carried out on quenched metals (essentially Al) where 
thermal vacancies coalesce to form vacancy-type perfect and faulted prismatic 
loops. The main results have been reviewed by Washburn (1972) and Smallman 
and Westmacott (1972). 

In thin foils, where vacancies can easily be created or eliminated at the sur- 
faces, the chemical stress remains close to zero. The shrinking velocity is then 
given by Eq. (43b) where x x b (very high jog density) and where the stress origi- 
nates from the line tension and the surface energy of the inner fault as expressed 
in Eq. (49), where R is the loop radius and y is the stacking fault energy. 


dR D® a (T F (49) 
dt bgt RO ?” 


For perfect loops (y = 0), this expression can be integrated to yield Eq. (50), 
where to is a constant. 


R= Ro (1 = ae (50) 








0 


This variation has been observed in experiments, as shown in Fig. 6.30, which 
demonstrates that the climb velocity is actually proportional to stress when the 
jog density is high. 

The most comprehensive study of climb in deformed materials has been car- 
ried out in hcp magnesium by Edelin and Poirier (1973). When compressed at 
high temperature along the c axis, deformation takes place by pure climb of dis- 
locations with Burgers vector parallel to the c axis. The density of dislocations has 
been measured as a function of strain, and the corresponding velocity has been 
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Fig. 6.30 Kinetics of shrinking of perfect prismatic loops in aluminum. 
The curve corresponds to Eq. (50). Annealing experiment in TEM by 
Silcox and Whelan (1960). 


deduced from the applied strain-rate using the Orowan law [Eq. (1)]. The results 
shown in Fig. 6.31 yield an activation energy larger than that of self-diffusion 


Al 
(1.80 eV instead of 1.43 eV) and n= aF 2.8. These parameters are not 
T 
consistent with Eq. (43). They may however be explained by Eq. (46b), provided 


Gip (1) 


oA 
5 AG; is of the order of 0.37 eV, and by Eq. (48), provided a ae is large 
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Fig. 6.31 Dislocation climb velocity in magnesium: (a) as a function of 
temperature; (b) as a function of stress. From Edelin and Poirier 
(1973). 
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enough. The latter value can be estimated from Eq. (30). Using Eqs. (30) and (48), 
and taking u = 1.75 x 10* MPa, t = 10 MPa, b= 0.55 nm, and h xb, we find 
n = 1.8, which is definitely greater than 1 although smaller than the experimental 
value (n = 2.8). 

Even if the theoretical estimates do not compare perfectly with the experimen- 
tal results, these experiments confirm that the activation energy of climb can be 
larger than the self-diffusion one, and that the stress dependence of the corre- 
sponding dislocation velocity, n, can be greater than unity, at least at high 
stresses. 

Dislocation motion by pure climb also takes place in intermetallic alloys, such 
as NiAl (Fraser et al. 1973a,b; Srinivasan et al. 1997), and in the y’ phase of super- 
alloys (Louchet and Ignat 1986; Eggeler and Dlouhy 1997; Epishin and Link 
2004). However, these results have never been incorporated in any model of 
plastic deformation or creep at high temperature. 

Other examples of pure climb dislocation motion with similar properties are 
described in Section 6.4.5, on quasicrystals. 


6.2.7 
Conclusions on Thermally Activated Mechanisms 


This review of the principal thermally activated dislocation mechanisms in metals 
and alloys shows that the temperature and strain-rate dependence of mechanical 
properties can result from various origins. Peierls-type stresses and interactions 
with fixed solute atoms are generally important at low temperature, dynamic in- 
teractions with mobile solute atoms and cross-slip become rate-controlling at in- 
termediate temperatures, and diffusion-controlled glide and climb are restricted 
to high temperatures. Other thermally activated mechanisms such as grain 
boundary sliding, dislocation emission at free surfaces and crack tips, etc. are 
less well documented and will not be described here. The identification of these 
mechanisms requires a fine analysis of the experimental activation parameters 
(especially the activation volume) as well as detailed TEM analyses of the disloca- 
tion substructure and in-situ strain experiments. 


6.3 
Hardening and Recovery 


The mechanisms described in this section are related to the long-range elastic in- 
teractions between dislocations, which lead to the formation of complex disloca- 
tion structures. Internal stresses appear progressively as a result of the accumula- 
tion of back-products of dislocation motion: stored dislocations, debris, tangled 
zones, etc. It is thus important to make the distinction between mobile disloca- 
tions which account for the strain rate, and stored ones which are obstacles. Sev- 
eral aspects of hardening and recovery processes are summarized below. 


6.3 Hardening and Recovery 


6.3.1 
Dislocation Multiplication versus Exhaustion 


As already discussed in Section 6.1, plastic deformation can be expressed by the 
Orowan law: 


E = Pmb% 
Here e is the plastic strain, p,, is the density of mobile dislocations (here assumed 
to be only a fraction of the total), b is their Burgers vector, and x is their traveling 
distance. The plastic strain-rate can then be expressed as Eq. (51a), where v is the 
dislocation velocity. 


é=p,,bv + p,,bx (51a) 


Equation (51a) yields Eq. (1) when the second term of the right-hand side is ne- 
glected. The exact meaning of this second term can be understood better in the 








0 
more explicit form of Eq. (51b), where Ap,, = Ri x is the variation of the original 
density pm after the displacement x. C% 
8 Pm OX 
é=p,,bv4 aaa bx = (Pm + Ap,,) bv (51b) 


The value of Ap,, can be either positive, as a result of dislocation multiplication, 
or negative, as a result of dislocation exhaustion. At constant strain rate é, any in- 
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Fig. 6.32 Decrease in stress after yield (“yield drop”) as a result of 
intensive dislocation multiplication in germanium. (From Dupas et al. 
2002.) 
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crease in p,, results in a decrease in the dislocation velocity v, and a decrease in 
the corresponding effective applied stress. This accounts for the high yield drop 
observed in materials such as semiconductors (Fig. 6.32). On the contrary, any de- 
crease in p,, must be compensated by the activation of new sources, which re- 
quires an increase in stress, called exhaustion hardening. The difference between 
exhaustion hardening and classical internal stress hardening will be discussed in 
Section 6.3.2. 


6.3.1.1 Dislocation Sources 
Various types of dislocation sources have been observed by means of TEM of in- 
situ strain experiments. 

e Spiral sources consisting of one dislocation revolving around 
a single anchoring point have been observed in most cases 
(Fig. 6.33): these occur more frequently than the classical 
Frank-Read source, which requires the presence of two 
strong anchoring points at a suitable distance. 
Open and closed loops forming on mobile screw dislocations: 
the principles of this mechanism are as follows. A screw 
dislocation subjected to extensive cross-slip meets a weak 
obstacle, such as a small cluster or a solute atom. A jog is 
formed, provided cross-slip takes place independently on 
both sides of the obstacle (Fig. 6.34); then, the two arms can 





T:300K O.2pm 
So 
ia 


Fig. 6.33 Dislocation source rotating around a single anchoring point 
(arrowed). Note the elongation of the emitted loops in the screw 
orientation. In-situ experiment in magnesium at 300 K by Couret and 
Caillard (1985). 
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Fig. 6.34 Dislocation multiplication, by formation and expansion of an 
open loop. Magnesium deformed in situ in prismatic slip at 300 K. 
From Couret and Caillard (1985). 


rotate around the jog in two parallel planes. The lifetime of 
such sources is generally very short because the two arms 
can cross-slip back in their original slip plane after one 
revolution. The same process can lead to the formation of a 
dipole (or closed loop) which subsequently expands as shown 
in Fig. 6.15(a)-(d) (where the closed loop fa has been 
nucleated by the dislocation x). Multiplication at closed and 
open loops is restricted to materials where straight screw 
dislocations subjected to Peierls-type forces exhibit frequent 
cross-slip (bcc and hcp metals and alloys at low 
temperatures). 


Several multiplication laws can be proposed: 

e Increment of dislocation density dp proportional to the area 
swept by the dislocations: this corresponds to the open- and 
closed-loop mechanism described above. This yields 
op x pv dt, or, using Eq. (1) (the Orowan law), yields Eq. (52). 


op x ¿dt (52) 


e Natural loop expansion: the length increment of an 
individual dislocation loop is proportional to its traveling 
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distance. This yields dp oc Nv dt oc zit where N x p/R is the 


number of loops per unit volume and R is their average 
radius. If the geometry of glide is such that R remains 
constant but inversely proportional to stress, the expression 
proposed by Alexander and Haasen (1968) for semicon- 
ductors is obtained, i.e., Eq. (53). 


op x èt dt (53) 


Multiplication at Frank—Read sources: if dislocations move at 
a constant velocity, and if N is the number of dislocations 
emitted by the source, with respective radii of curvature 


Rı œ 1/t, Ro = 2Rı, R3 = 3Rı, etc., the dislocation density 
2 
can be expressed as p & Ry + R2 +-+- - cc Ry —, whence 


2p \"/2 

N= (2) . Then, the same calculation as above yields the 
1 

expression already proposed by Moulin et al. (1999), i.e.: 


ôp œ Nv dt œ nöd, and Eq. (54). 
p 


2 
ap xa( di (54) 
p 


Many more expressions could be proposed as well, depending on the actual situ- 
ation observed in the electron microscope. 


6.3.1.2 Dislocation Exhaustion and Annihilation 

Dislocation storage results in a decrease of mobile dislocation density, p,,, also 
called exhaustion. Exhaustion tends to increase the effective stress tep (i.e., the 
thermally activated component of the total stress defined in Section 6.1), because 
any decrease of p,, must be compensated by an increase of dislocation velocity v, 
in order to keep the applied strain rate & constant [Eq. (1)]. At low strain and low 
dislocation density, exhaustion can also be compensated by the activation of new 
sources, which requires an additional increase in applied stress (see the early part 
of Section 6.3.1). This effect, and the variation of internal stress t; (which will be 
studied in Section 6.3.2), contribute to strain hardening. 

Dislocation storage can result from interactions with extrinsic obstacles such as 
other dislocations or cell walls, from cross-slip (in the case of Ni3Al; cf. Section 
6.4.1.2), or from solute diffusion (Section 6.2.2.3). As discussed in Section 6.4.1, 
the two latter processes, which are thermally activated, can be at the origin of 
stress anomalies. 

Several empirical exhaustion laws can be used, for example: 


Op x —p dt 
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(constant probability of locking per unit time), or: 
Op x£ —px x —édt 


(probability of dislocation locking proportional to the traveling distance). 

The first expression can be used to describe thermally activated dislocation stor- 
age by cross-slip in Ni3Al (Section 6.4.1.2), and the second corresponds to disloca- 
tion locking on fixed obstacles, as in stage II work-hardening in fcc metals and 
alloys (Section 6.3.3). 

On the other hand, dislocation annihilation can take place by cross-slip or 
climb, at a rate given by: 


op x —&,/p dt 


(probability of dislocation annihilation proportional to the traveling distance, with 
additional effect of the annihilation stress between neighboring dislocations of 
opposite signs). 

Here again, it is seen that different laws can be used, depending on the exact 
mechanisms involved. 


6.3.2 
Dislocation-Dislocation Interaction and Internal Stress: the Taylor Law 


At high strain, dislocation multiplication and storage increase the internal stress 
t; according to the Taylor law, Eq. (55a), where p is the total dislocation density 
(1/./p = d, average dislocation length or spacing), and «; ranges between 0.2 and 
0.5. 


"= ubyp (55a) 
u 


This law originates from the long-range interactions between moving dislocations 
and all other (mobile or stored) ones. These interactions are of two kinds: 
e When two parallel dislocations are close to each other, they 
are subjected to an elastic interaction stress of the order of 
b/2nd (Eq. (55b); see Fig. 6.35a). 


ae (55b) 


e When gliding dislocations intersect “forest” dislocations, they 
are often pinned by the formation of attractive junctions (Fig. 
6.35b). They can bow out and escape when their radius of 
curvature, R, is small enough to reach the breaking angle, 0 
necessary to recombine the junction. For 0 = 0, the 
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a) b) 


Fig. 6.35 Schematic description of the origin of internal stresses: 
(a) long-range elastic interaction; (b) forest mechanism. 


T u 
corresponding internal stress is t; X DR’ with R x d/2 and T 


(line tension) ~ }ub?, whence Eq. (55c) results. Smaller 
values of z; are obtained for more realistic values of 0. 


TUN = (55c) 


These various interactions have been calculated in detail by Saada (1960) and 
Schéck and Frydman (1972) using elasticity theory. More recently, all the possible 
situations have been simulated at the mesoscopic scale by Wickham et al. (1999), 
Shenoy et al. (2000), and Madek et al. (2000). They confirm the validity of the Tay- 
lor law and show that the stability of junctions (which determines the internal 
stress) is independent of the core structure of dislocations. Madek et al.’s calcula- 
tions also confirm that t; depends essentially on the average dislocation density, 
not on its spatial arrangement. As a matter of fact, the proportionality coefficient 
a is found to be only 10% lower when dislocations are rearranged from a uniform 
distribution to the inhomogeneous structure of cells and walls observed in experi- 
ments. The origin of this surprising property will be discussed in Section 
6.3.3.1. 

At low strain and low average dislocation density, Eq. (55c) gives the stress nec- 
essary to activate a source as a function of its length, d. However, since the 
lengths of potential sources are generally not equal to the average value 1/,/p, 
the proportionality coefficient of the Taylor law adapted to this situation can be 
different. This describes “exhaustion hardening”, as opposed to “internal stress 
hardening” operating at higher strain, and considered above. An example of ex- 
haustion hardening is the pre-yielding of bcc metals, where shorter and shorter 
edge segments trailing screw dipoles must be activated, at the expense of an in- 
creasing stress. Another example is the origin of yield stress anomalies, treated in 
Section 6.4.1. Exhaustion hardening and internal stress hardening are not funda- 
mentally different processes, however, and a transition occurs between them 
when the dislocation structure is sufficiently homogeneous to have a critical 
source of size d = 1/,/p. 
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Fig. 6.36 Experimental evidence for the Taylor law in copper at room 
temperature. Data collected by Mecking and Kocks (1981). 


Sometimes, the total stress t = ty + 7; can be written in the same form, i.e., as 
Eq. (55d), where « = ogg + «i (Fig. 6.36). 


= ob,/p (55d) 


This form is justified in case of forest mechanisms where Sa, 2 oe by/p (cf. Sec- 
tion 6.2.3). i 


6.3.3 
Hardening Stages in fcc Metals and Alloys 


Most studies of hardening have been carried out in fcc metals and alloys. How- 
ever, the results should be transposable to bec metals, which exhibit the same 
stages as fcc ones (see, e.g., Kuhlmann-Wilsdorf 2002). The dislocation density 
and corresponding deformation stress [(given by Eq. (55a)] increase with increas- 
ing deformation, as a result of the dislocation multiplication and storage de- 
scribed in Sections 6.3.1.1 and 6.3.1.2, respectively. Hardening is defined by the 


coefficient 0 = = as often plotted as a function of stress. 

Reviews of strain hardening in fcc metals and alloys have been published by 
Basinski and Basinski (1979) and by Nabarro (1989), and more recently by several 
authors in Volume 11 of Dislocations in Solids devoted to work-hardening, and by 
Kocks and Mecking (2003). Some important conclusions on this complex topic 
are discussed below. 
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Fig. 6.37 The different strain-hardening stages in fcc metals. 


Hardening is usually divided into several stages, observed at increasing strain, 
and described schematically in Fig. 6.37. Stage I of “easy glide” is observed at the 
beginning of the deformation of single crystals in single slip. The corresponding 
value of hardening coefficient 0 is fairly low, on account of the weak elastic inter- 
action stress given by Eq. (55b). TEM observations reveal a homogeneous distri- 
bution of dipoles in this stage. Stage II is observed as soon as multiple slip forms 
a three-dimensional dislocation structure with junction reactions, building tan- 
gles that subsequently evolve to cell walls surrounding cells (see Fig. 6.40, below). 
The corresponding deformation stress is then described by Eq. (55c). It is charac- 
terized by a constant value of 6, i.e., a linear increase of stress with strain (“linear 
hardening”). Stage III starts when 9 decreases with increasing strain and stress 
(Fig. 6.37). The onset of stage II is strongly dependent on temperature and strain 
rate, in such a way that stage II can disappear at high temperatures. Lastly, stage 
IV corresponds to a stabilization of 0 at a fairly low value. Stages II to IV are now 
described in some more detail. 


6.3.3.1 Stage Il (Linear Hardening) 

The strain-hardening coefficient in stage II has a more or less constant value, 
Or © u/200, which is almost independent of material, temperature, and strain 
rate (within a factor of 2). Figure 6.38 shows the relative variation of the Taylor 
factor a, defined by Eq. (55d), as a function of temperature. The rapid decrease 
shows that the total flow stress involves a thermally activated component, tef, pro- 
portional to a, which depends on stacking fault energy. Since the Cottrell-Stokes 


T a 
GT ig constant along deformation 


law is verified by experiments, the ratio 

Ti Oj 
curves. Then, the Taylor law which is valid for q is also valid for ty, which dem- 
onstrates that the forest mechanism is active. This conclusion is corroborated by 
Fig. 6.39 showing the “Haasen plot” of the inverse of the activation volume (de- 
duced from strain-rate jumps) as a function of the applied stress corresponding to 


various amounts of deformation. It is linear at small stresses, which shows that 
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Fig. 6.38 Effect of thermal activation on the proportionality coefficient 
of the Taylor law. From Kocks and Mecking (2003). 


the activation volume varies as expected from the forest mechanism (Section 
6.2.3). 
Stage Il is observed as long as the Cottrell-Stokes law is verified, i.e., as long as 


T 
-# remains constant. The flow stress in stage II is thus thermally activated 
Ti 
(and controlled by the forest mechanism), although the corresponding strain- 
hardening coefficient is independent of temperature and strain rate. 

The formation and the evolution of the cell structure shown in Fig. 6.40 result 
from a very complex mixture of multiplications, reactions, and annihilations of 


dislocations belonging to several slip systems. It could be the result of statistical 





stress T 


Fig. 6.39 Haasen plot of the inverse of the activation volume V as a 
function of stress, in silver. From Mecking and Kocks (1981). 
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fluctuations in an out-of-equilibrium system, according to Aifantis (1986) and 
Hahner (1996). The heterogeneous structure could also be driven by the decrease 
of the total elastic energy aone, according to Hansen and Kuhlmann-Wilsdorf 
(1986) and Kuhlmann-Wilsdorf (2002). As a matter of fact, the total elastic 
energy stored in the deformed crystal must decrease upon dislocation tangling, 
because the dislocation-line energy is proportional to the logarithm of the disloca- 
tion distance. 

The simplest explanation for dislocation patterning is based on the composite 
model of Mughrabi (see, e.g., Mughrabi and Ungar 2002). The deformed crystal 
is treated as a two-phase material with hard zones (tangles or walls) and soft 
zones (cell interiors) which are subjected to different local stresses. Since these 
zones must deform plastically at the same rate, a heterogeneous internal stress 
must develop, positive in the hard zones (forward stress), and negative in the 
soft ones (backward stress). Assuming now that local stresses are related to local 
dislocation densities according to the Taylor law [Eq. (55a)], the applied stress can 
be written as Eq. (56), where Ap is the fraction of the initial dislocation density 
which is transferred from the soft zones (volume fraction 1— K) to the hard 
ones (volume fraction K). 


K(» | Ñ + (1—K) (> = 2) | (56) 


A 
The dislocation density in cell walls, p + Z, is hereafter denoted p,,, and the cor- 





t = ob 





responding dislocation-line length is dw = 1/,/p,. 

Assuming that the cell interior contains a negligible dislocation density, i.e., 
that Ap = (1 — K)p, the expression of the flow stress reduces to Eq. (57) and the 
dislocation density in cell-walls reduces to p,, = p/K, whence Eq. (58) follows. 


t= aubKV?/d (57) 
dw 
FS VK (58) 


The crossing of the wall-network, of mesh size d,, requires a local stress 


b 
Ty = a, which, using Eqs. (57) and (58), requires a stress concentration factor 


w_ 1 ; ; ; ; ; f 
we K This stress concentration can be provided by pile-ups of dislocations 
T 
emitted by sources (Fig. 6.40). 
In the case of cubic cells of size D and thickness w, the volume fraction of hard 


zones is given by Eq. (59). 


K ~3w/D (59) 


(*) Note that under such conditions, the cell interior can deform under a zero stress. 
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Fig. 6.40 Dislocation emitted by a source S, and crossing through cell 
walls. Note the spreading ô of the slip planes after wall crossing. In-situ 
experiment in aluminum, at 80 °C. 


According to Kocks and Mecking (2003), this coefficient is not much lower than 
unity, because the proportionality factor of the Taylor law for a uniform disloca- 
tion distribution in stage I (t = «ub/,/p) is only 10% higher than for a heteroge- 
neous dislocation distribution in stages II and III (t = «K'/?ub/ /p). 

Using Eq. (1), the strain-hardening coefficient in stage II can be expressed as 
Eq. (60), where the increase in the average dislocation density with strain is 





dp 2 ; : s 
de BA’ and A/2 is the dislocation mean free path. 
dd 1/2 dp) _& 1/2 ub dp 
= Er aubK Je 5 K Wee (60) 


Then the resulting strain-hardening coefficient takes the very simple form of Eq. 
(61a). 


0 
= „Run $ (61a) 


However, since it is difficult to estimate the ratio d/A directly, it is interesting to 
express the strain-hardening coefficient as Eq. (61b), where D is the cell size. 


mes et 


, 2 (61b) 


ola 
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Fig. 6.41 Dislocation cell size D versus average dislocation spacing in 
copper polycrystals strained to 10% at various temperatures. From 
Kocks and Mecking (2003). 


As a matter of fact, experiments show that d/D is more or less constant and of 

the order of 1/15 in fcc pure metals (Fig. 6.41), and the slip-line length A is a 

few times larger than the cell size, D (cf. Fig. 6.42, Mughrabi and Ungar, 2002). 
0 Ki/2 

This yields — x 730° which, considering that K'/? is slightly lower than unity, is 
u 


in reasonably good agreement with the average experimental value (1/200). 
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Fig. 6.42 Average slip length A and cell-size D as a function of stress, 
in a single crystal of copper. From Ambrosi and Schwink (1978). 
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A constant value of D/d x 15 has been explained tentatively by Nabarro (2000), 
Kuhlmann-Wilsdorf (2002), and Kocks and Mecking (2003), on the basis of geo- 
metrical and energy considerations. 

The experimental finding that A is in the region of a few D is more surprising 
and difficult to justify, because only two alternative situations should be met a 
priori: either wall crossing is impossible, and A/D should be equal to unity, or 
wall crossing is possible, and A/D could take any large value. Figure 6.40, which 
shows that the relationship “A ~ a few D” is reproduced in an in-situ experiment 
in Al, yields an answer to this problem. The dislocations emitted by the source S 
can indeed cross through neighboring walls, as long as the required stress con- 
centration (by a factor of K) can be provided by pile-up effects. This is easily real- 
ized close to the source when dislocations move in the same glide plane. How- 
ever, dislocation reaction and cross-slip at walls result in their spreading in 
parallel slip planes, which progressively inhibits pile-up formation and subse- 
quent wall crossing. 


6.3.3.2 Stage Ill 

In stage III, 0 versus t curves exhibit a more or less linear decrease which can be 
described by the Voice law (Fig. 6.37) stated in Eq. (62), where 6, is of the order of 
the initial constant strain-hardening coefficient Orr, and where the limit stress t; 
strongly depends on strain rate and temperature. 


o= o(1 2) (62) 


TI 


Equations (60) and (61) show that the decrease of the strain-hardening coefficient 
0 results from the decrease of «, the proportionality coefficient of the Taylor law, 
and from an increase of the dislocation mean free path, A. The transition be- 
tween stages II and III is described by the generic term “dynamic recovery”. It 
can be observed on the Haasen plot of Fig. 6.40 that the linear relationship breaks 
down, which shows that tep no longer increases at the same rate as 1;. In other 
words, a new thermally activated mechanism, which can no longer be described 
by the classical equations of the forest mechanism, allows an easier forest cross- 
ing (Mecking and Kocks 1981). 

The transition between stages II and III is traditionally attributed to the onset 
of recovery by cross-slip. The main arguments in favor of this interpretation are 
based on slip-line observations, and on the strong dependence of the onset of 
stage III on stacking fault energy. However, Basinski (1968) reported experimen- 
tal observations of cross-slip operating long before the onset of stage III, from 
halfway through stage II to halfway through stage III. In addition, the similarity 
of stage III in fcc and bcc metals is difficult to reconcile with the cross-slip de- 
scribed in Section 6.2.5 which is restricted to fcc metals (Nabarro 1986). Several 
cross-slip and climb mechanisms have been compared by Nabarro (1989). 

Most authors consider now that cross-slip is necessary but not sufficient to ac- 
count for the whole of stage III. Under such conditions, dynamic recovery could 
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be helped by the glide of dislocations in non-close-packed planes (Section 6.2.4.4), 
which has been observed abundantly at the end of stage III in copper (Anongba 
et al. 1993 — see also Fig. 6.47, below), and which offers interesting possibilities of 
decreasing the strength of attractive junctions. Indeed, it can be shown that the 
only way to move three reacting dislocations in three glide planes intersecting 
along a common direction is to push one of them in a {100} plane. For instance, 
many reaction segments in the form of Lomer—Cottrell locks, which are formed 
in stage II (Basinski and Basinski 1979), have been shown to move in {100} 
planes by Karnthaler (1978). A similar mechanism has also been observed by 
Caillard and Martin (1982) in aluminum during creep. 


6.3.3.3 Stage IV 

Stage IV corresponds to the transformation of thick cell walls into thin regular 
dislocation networks called sub-boundaries. The deformation stress is stabilized 
at a value slightly higher than qı, which more or less corresponds to the so-called 
stage II (or steady-state stage) of creep. Dislocation climb is expected to play an 
important role, at least in the formation and evolution of sub-boundaries. Inten- 
sive glide in non-close-packed planes (Section 6.2.4.4) has also been observed at 
the onset of stage IV in copper (Anongba et al. 1993). 


6.3.3.4 Strain-Hardening in Intermetallic Alloys 

Several intermetallic alloys such as Ni3Al and TiAl exhibit the so-called yield 
stress anomaly, which will be discussed in Section 6.4.1 below. In these materials, 
strain hardening can be much higher than in fcc metals, and values as high as 
u/10 to 4/2 have been reported. Viguier (2003) has shown that Eqs. (55a) (the 
Taylor law) remains valid, which means that high values of strain hardening 0 
are related to short dislocation mean free paths A. At low strain, the proportion- 
ality coefficient « of the Taylor law can be higher than expected. This corresponds 
to the transition between exhaustion and internal stress hardening discussed in 
Section 6.3.2. 


6.4 
Complex Behavior 


Many materials exhibit a complex behavior which can be related to mixtures of 
thermally activated glide, dislocation multiplication, hardening, and recovery pro- 
cesses. In that sense, stage III of hardening could be considered as the first exam- 
ple of a complex behavior. We describe in this section a few other examples, the 
list of which is of course nonexhaustive. 


6.4.1 
Yield Stress Anomalies 


Many alloys and intermetallic alloys, in their intermediate temperature range, ex- 
hibit an anomalous increase of yield stress with increasing temperature, called 
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yield stress anomaly (YSA) (Caillard 2001). This effect accounts for good mechan- 
ical properties at high temperatures because it compensates for the natural de- 
crease in yield stress at increasing temperature. YSAs originate from several ther- 
mally activated mechanisms of dislocation locking. In contrast to what occurs in 
the locking-unlocking mechanisms described in Section 6.2.4.2, unlocking is as- 
sumed to be impossible or at least very difficult. This results in a small disloca- 
tion mean free path A decreasing with increasing temperature. According to Eq. 
(61), this accounts for a high strain-hardening coefficient 0 increasing with in- 
creasing temperature. Note, however, that the proportionality coefficient « of Eq. 
(61) may be different from that expected from the Taylor law, because hardening 
at low strain is of the “exhaustion type” (see Sections 6.3.2 and 6.3.3.4). Lastly, 
high cumulated strain hardening results in an increase in flow stress at given 
strain (0.2% for the yield stress). YSA can be classified in two groups, with two 
different origins for the thermally activated locking mechanism, which are de- 
scribed below. 


6.4.1.1 Dynamic Strain Aging 

Several alloys exhibit a small YSA associated with stress instabilities and negative 
values of the stress-strain rate sensitivity. According to the mechanism described 
in Section 6.2.2.3, there is a range of dislocation velocities where dislocations are 
subjected to an increasing friction force at increasing temperature, because more 
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Fig. 6.43 Yield stress anomaly in stainless steel. The bar corresponds 
to the observation of stress instabilities (or jerky flow). From Kashyap 
et al. (1988). 
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solute atoms can diffuse to them and hinder their motion (Fig. 6.6). This effect, 
which accounts for stress instabilities, can also induce YSAs. An example of this 
behavior in stainless steel is shown in Fig. 6.43. Other examples are CuAl alloys 
(see Fig. 6.3), L1,-stabilized titanium trialuminide alloys (see Caillard 2001; Cail- 
lard and Couret 1996), and Fe3Al alloys (Molénat et al. 1998). 


6.4.1.2 Cross-Slip Locking 
The most remarkable example of YSA as a result of cross-slip locking is provided 
by Ni;Al. In this intermetallic alloy with the L1, structure, superdislocations are 
dissociated in two superpartials according to [110] = 1/2[110] + APB + 1/2[110], 
where APB denotes a stacking fault ribbon of an antiphase boundary. These dis- 
locations glide freely in {111} planes. However, they rapidly lock themselves by 
thermally activated cross-slip and change of APB plane from {111} to {001} where 
the APB energy is lower. The locked configuration is called complete Kear- 
Wilsdorf lock when the APB is completely in the {001} plane, and incomplete 
Kear-Wilsdork lock when the amount of APB in the {001} plane is very small. 
The resulting high strain-hardening coefficients, which are at the origin of the 


oe 1 
YSA, have been measured by several authors. They can be as high as — x 7 at 
u 


small strain, according to Neveu (1991). Figure 6.44 shows the yield stress, and 
the strain hardening, at a larger strain (3%), as a function of temperature. It can 
be noted that 0 increases up to the temperature T(Omax) and decreases above it. 
This shows that dislocation locking becomes less efficient for temperatures 
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Fig. 6.44 Temperature dependence of the deformation stress (t) and 
work-hardening coefficient (6) in Ni; (Al, 1.5% Hf). From Ezz and 
Hirsch (1994). 
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higher than T(Omax), and stresses higher than t(Omax). In other words, stresses 
higher than t(@max) can release a substantial amount of locked screw dislocations. 
In that sense, the unlocking stress z(Omax) is similar to the yield stress in bec 
metals for which there is a transition between microyielding (locked screws) and 
macroyielding (mobile screws). The unlocking stress is characteristic of the ob- 
stacle involved. Measurements by Conforto et al. (2005) showed that they are in- 
complete Kear-Wilsdorf locks, for which the width of the APB in the {001} plane 
is of the order of one interatomic distance. 

The formation of incomplete Kear-Wilsdorf locks, and their breaking above 
T(Omax) (which leads to the so-called “APB jumps”), have been observed in de- 
formed specimens and in experiments in situ (see Caillard 2001; Conforto et al. 
2005). 


6.4.2 
Fatigue 


A fatigue test involves a series of small amounts of deformation by tension and 
compression over a very large number of cycles. Although the average final defor- 
mation is zero at the end of the test, a very large cumulative strain is at the origin 
of irreversible damage of the sample. A steady state with stress amplitude z, (sat- 
uration stress) is reached after a certain number of cycles. 

Figure 6.45 is the schematic representation of the variation of the saturation 
stress as a function of the strain amplitude. A homogeneous dislocation structure 
is built in region A, whereas deformation concentrates in the so-called “persistent 
slip bands” (PSBs), in region B. The density of PSBs increases with increasing 
strain amplitude in region B, until deformation is again homogeneous in region 
C. PSB play a very important role in the lifetime of fatigued samples because they 
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Fig. 6.45 The three stages of fatigue. 
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are privileged sites for crack initiation. The main aspects of fatigue are developed 
below. Many more details can be found in the review paper of Basinski and Basin- 
ski (1992). 


6.4.2.1 Microstructure of Fatigued Metals and Alloys 
Most observations have been carried out in the PSBs developing in region B of 
Fig. 6.45. PSBs are slices of material parallel to the primary glide plane. They 
are divided into channels bounded by walls in which screw dislocations move 
back and forth by glide and cross-slip (Fig. 6.46). 

One can find a very good correspondence in several materials between the sat- 
uration stress, Ts, and the corresponding channel width, D. It can be expressed by 


2T 
Ts = 1.5 to 2theng, Where Tpend = ID is the stress necessary to bend a screw disloca- 


tion in a channel [the same equation as (55c)] (Basinski and Basinski 1992). Since 
Ts > Thend, Screw dislocations have the possibility of overcoming obstacles between 
the walls, in agreement with TEM observations of Mughrabi (1983) showing that 
they do interact with many dipoles and debris. 

Walls are made of dipoles of height h and average distance d. Although they 
were too small to be observed by conventional TEM, the dipoles were first as- 
sumed to be vacancy-type. Their density was estimated from resistivity measure- 
ments (Basinski and Basinski 1992), and the stress 2T/bd necessary to cross 
through the walls was accordingly considered to be a few times z,. More recent 
measurements by weak-beam TEM have shown that (a) vacancy and interstitial 
dipoles are both present in the walls, and (b) dipole sizes and densities vary only 
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Fig. 6.46 The microstructure of fatigue. 
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weakly with temperature and stress (Tippelt et al. 1997). The minimum dipole 
height seems to increase slightly with increasing temperature, which gives some 
indication in favor of a climb-controlled recovery mechanism. However, all other 
quantities, average and maximum dipole heights, and average dipole distances 
(d x 8-10h), have too a small variation to be related to the saturation stress t,. 
Under such conditions, the stress necessary to cross through the walls is consid- 
ered to be larger than the saturation stress, but not proportional to it. 


6.4.2.2 Comparison with Stages II and Ill of Monotonic Strain Hardening 

It has been noted that the saturation stress is very close to the stress tyz at which 
stage III begins in tension (Section 6.3.3.2) (Basinski and Basinski 1992). This 
fairly good correspondence is illustrated in Fig. 6.47. 

The activation energy varies between 0.2 eV at 77 K and 0.7 eV at 300 K in cop- 
per, which a priori excludes climb as a rate controlling process (Basinski and Ba- 
sinski 1992). Activation volumes have been measured by means of strain-rate 
jumps by Kaschner and Gibeling (2002). When inserted in a Haasen plot (Fig. 
6.48), they follow the same variation as in stage III of hardening, which corrobo- 
rates the link between the two mechanisms. 


6.4.2.3 Intrusions, Extrusions and Fracture 

Many extrusions and intrusions are visible along the emerging PSBs (see the 
schematic representation in Fig. 6.46). They are usually interpreted as the result 
of the partly irreversible motion of screw segments subjected to cross-slip. The 
role of the vacancies generated by the annihilation of the dipoles inside the walls 
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Fig. 6.47 Comparison between the saturation stress in fatigue, r,, and 
the onset of stage III, in copper. Non-close-packed (NC) glide is 
observed in the highest temperature range. From Basinski and Basinski 
(1989) and Anongba et al. (1993). 
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Fig. 6.48 Haasen plot of the activation volume of fatigue in copper, and 
comparison with data from monotonic tests. From Kaschner and 
Gibeling (2002). 


is unclear. Long-range diffusion of vacancies may account for extrusions being 
more pronounced than intrusions. However, Basinski and Basinski (1992) con- 
cluded that there is no direct evidence for the existence of single vacancies 
formed as the result of fatigue. They also pointed out that intrusions are more 
difficult to observe than extrusions, and that vacancies are not sufficiently mobile 
to account for the formation of intrusion/extrusion at very low temperatures. 

In any case, intrusions are responsible for crack initiation leading to failure. On 
the contrary, the homogeneous deformation taking place in region A of Fig. 6.46 
inhibits sample fracture. More details can be found in the ‘‘Proceedings of Fun- 
damentals of Fracture” published in Phil. Mag. A 82(17-18) (2002). 


6.4.2.4 Conclusions 

Fatigue appears to be controlled by the motion of screw dislocations in the chan- 
nels rather than by the interaction of the edge segments with the walls. The role 
of dipole recovery by climb and point defect diffusion is unclear, especially at low 
temperature. The strong connection between the saturation stress in fatigue and 
the onset of stage III in monotonic tension indicates that the cross-slip and/or 
forest mechanism should be rate-controlling. 


6.4.3 
Strength of Nanocrystalline Alloys and Thin Layers 


Nanocrystalline metals and alloys have been studied extensively in recent years 
because of their ultra-high deformation stress. The Hall—Petch law, which ac- 
counts for the strength of polycrystals with grain sizes down to 20 nm, is intro- 
duced in Section 6.4.3.1. Then, the possible origins for the Hall-Petch law break- 
down which occurs at very small grain size is discussed in Section 6.4.3.2. Results 
are transposable to thin layers although they have a reduced size in one direction. 
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6.4.3.1 The Hall-Petch Law (Grain Size D > 20 nm) 

It has been known for a long time that the yield stress of polycrystal, t, varies as a 
function of grain size D according to the Hall-Petch law [Eq. (63), where k and to 
are constants for a given material]. 


t(D) = to + k/V D (63) 


Different elaboration processes (sintering, large deformation, etc.) generate differ- 
ent types of grain boundaries (low angle sub-boundaries, various types of high- 
angle boundaries, twins, etc.) which oppose different resistances to dislocation 
movements and hence give rise to different values of k (see, e.g., Hansen 2004). 
It has been shown that the plasticity results mostly from dislocation move- 
ments (see, e.g., Kumar et al. 2003: in Ni with D as low as 30 nm). Under such 
conditions, the Hall—Petch law was first explained in terms of dislocation pile-ups 
emitted by sources and blocked against grain boundaries. However, since this 
model cannot describe large-grain materials where long dislocation pile-ups have 
never been observed, a more general explanation has been proposed on the basis 
of the strain-hardening laws discussed in Section 6.3.3 (Saada 2005; Lefebvre et 
al. 2005). The results are summarized below. 


i TN i ER ; RE d 1 
The increase in dislocation density with increasing strain is given by > 


de bA 
[Eqs. (60-61)], where A is the mean free path of mobile dislocations. Considering 
that A at yield is determined by the grain size D, we obtain, after integration: 


papa tote 


and, using the Taylor law [Eq. (55d)], 
t = oub,/p + cte = ap eb/V/D + cte 


This expression is similar to Eq. (63) with k = au/eb, which, taking e = 0.2%, has 
the right order of magnitude. It should be noted that A can be slightly larger than 
the grain size D in nanomaterials processed by heavy deformation, where grains 
are often limited by walls and low-angle sub-boundaries (cf. Section 6.3.3.1). This 
probably explains why the corresponding values of k can be smaller than ex- 
pected. 


6.4.3.2 Hall-Petch Law Breakdown (Grain Size D < 20 nm) 

Although the experimental results are not always fully convincing, the Hall- 
Petch law seems to break down below a critical grain size of the order of 20 nm 
(Takeuchi 2001; Zhao et al. 2003; Nieh and Wang 2005). A transition toward an 
inverse behavior is observed instead. Obviously, this results from the increasing 
fraction of atoms that are contained in grain boundaries. Considering that the 
grain boundary thickness is of the order of 0.5 nm, Saada (2005) indeed estimates 
this fraction as 10% for D = 30 nm. Assuming that grain boundaries behave like 
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amorphous materials, Nieh and Wang (2005) interpret the properties of nanocrys- 
talline BeB alloys (in particular their lack of ductility) by a progressive transition 
toward a 100% amorphous material. 

Although some dislocation activity is still sometimes reported, it is clear that 
deformation arises mostly from grain boundary sliding (Markmann et al. 2003; 
Kumar et al. 2003). However, the term “sliding” should not be interpreted in the 
sense of a pure shear mechanism, because it is controlled by grain boundary self- 
diffusion. Considering that the grain boundary self-diffusion activation energy is 
smaller than the bulk one, substantial amounts of plasticity can be achieved by 
grain boundary sliding even at yield and at room temperature. Simulations taking 
all these effects into account have been carried out by Derlet et al. (2003). 


6.4.4 
Fracture 


The fracture behavior of alloys changes from brittle to ductile as the temperature 
rises. This is a key issue in many structural applications. Fracture is ductile as 
soon as dislocations can be emitted at the crack front, thus blunting the crack 
tip, releasing the stress concentration, and slowing down the crack propagation. 
Rice and Thompson (1974) performed the first attempts at modeling dislocation 
activity at the crack tip. They considered the different forces acting on a disloca- 
tion emitted at the crack surface: the crack stress field, the surface tension force 
corresponding to the creation of a step, and the image force repelling the disloca- 
tion. The authors concluded that materials are the most ductile when they have a 
small ub/y (where y is the surface energy) and wide dislocation core. It is clear, 
however, that this process is helped by thermal activation via an easier dislocation 
nucleation at free surface and easier propagation away from the crack tip. 





Fig. 6.49 Dislocation emission from a crack tip, in Ti3Al. In-situ 
experiment at 200 °C. From Legros et al. (2006). 
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Dislocation emission is also enhanced at crack tip heterogeneities (Hirsch 
et al. 1989). Dislocation emissions in complex but realistic situations have 
been calculated by Schöck (1996) and Xu and Argon (1997). Fracture is easily ini- 
tiated at surface heterogeneities such as extrusions created in fatigue (Section 
6.4.2.3), voids appearing in highly deformed polycrystals and multiphased materi- 
als, etc. Experiments in situ can yield information on the emission of dislocations 
at crack tips (Caillard et al. 1999; Legros et al. 2006). One example is shown in 
Fig. 6.49. This experiment shows that both dislocation emission from the crack 
tip and dislocation motion in the tangled area close to the crack tip are difficult 
processes. 


6.4.5 
Quasicrystals 


The mechanical properties of quasicrystals are different from those of ordinary 
alloys in several aspects. First, they are highly brittle up to three-quarters of their 
melting temperature and become very ductile above. Second, the stress-strain 
curves exhibit a pronounced yield drop followed by a stage of very low (some- 
times negative) strain hardening. Third, although it is well known since the 
work of Wollgarten et al. (1993) that deformation proceeds by dislocation motion, 
specific mechanisms are expected on account of the lack of translational periodic- 
ity. 

The structure of quasicrystals has been described in Chapter 2 (Section 2.6). Let 
us just recall what is necessary to understand the structure of dislocations. Figure 
6.50 shows a two-dimensional aperiodic tiling defined by the cut of a pile-up of 
cubes along an irrational plane close to {111}, where each tile is the projection 
of one face of an emerging cube. Important to note is the presence of wavy bands 
of constant thickness (one is highlighted on Fig. 6.50b) which correspond to the 
emergence of the {100} planes of the three-dimensional cubic structure. These 
bands are equivalent to the parallel (but not equidistant) dense planes of the qua- 
sicrystalline structure. The real icosahedral quasicrystalline structure is defined in 
the same way, by cut and projection of a six-dimensional hypercubic lattice. It 
contains corrugated “dense planes” which are equivalent to the wavy bands of 
Fig. 6.50(b). The space which is lost during the projection (i.e., the direction per- 
pendicular to the cut plane in Fig. 6.50) is called “perpendicular space”. 

Dislocations can be introduced by the Volterra process described in Fig. 6.1, via 
two distinct procedures which are described schematically in Fig. 6.50(c)-(f). 

e If the Volterra process is performed in the aperiodic two- 
dimensional lattice (i.e., the cut plane in Fig. 6.50), for 
instance by removing half a wavy band and pasting the 
two lips together again, a partial dislocation is obtained 
(Fig. 6.50c). As a matter of fact, a fault is created on account 
of the lack of translational periodicity, which is seen in Fig. 
6.50(d) as a step in the truncated three-dimensional pile-up 
of cubes. 
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Fig. 6.50 Schematic representation of dislocations in quasicrystals: 
(a,b) definition of a two-dimensional aperiodic tiling from a three- 
dimensional periodic network; (c,d) creation of an imperfect dislocation 
by the Volterra process in two-dimensional tiling; (e) creation of a 
perfect dislocation by the Volterra process in the three-dimensional 
network; (f) phason point defect created at dots in (e). From Mompiou 
and Caillard (2005). 


e If the Volterra process is performed in the periodic three- 
dimensional lattice, by insertion of a supplementary half 
{100} plane (dark in Fig. 6.50e), and if the dislocated 
structure is subsequently cut and projected in the two- 
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dimensional plane, a perfect dislocation is obtained. In this 
case, the dislocation is no longer connected to a fault. 
However, since the supplementary {100} half-plane has 
distorted the periodic cubic lattice along every direction, in 
particular the direction perpendicular to the figure, the cut 
plane does not intersect the very same cubes, which results 
in local tiling changes all around the dislocation. These point 
defects, which are typical of quasicrystals, are called phasons. 


In summary, quasicrystals can contain partial dislocations connected to “phason 
walls” (which can be compared to partial dislocations connected to stacking faults 
in crystals). However, these partial dislocations can transform into perfect ones if 
their phason walls evaporate and evolve to a cloud of individual phasons sur- 
rounding the dislocations. These two kinds of dislocations have been observed 
abundantly in TEM (see, e.g., Mompiou and Caillard 2005). 

The mechanical properties of quasicrystals have been reviewed by several au- 
thors (Feuerbacher et al. 1999, Takeuchi 1999). They are characterized by low acti- 
vation volumes and high activation energies which do not yield any clear indica- 
tion on the rate-controlling process. 

Microscopic observations indicate that the back-stress due to the formation of 
phason walls in the wake of moving dislocations is a substantial but not prepon- 
derant fraction of the flow stress. The remaining part is due to internal stresses 
and to the dislocation-lattice interaction. The latter has been explained in a first 
step by obstacles opposing dislocation glide. More recent investigations, however, 
showed that dislocation motion takes place by climb (Fig. 6.51; see Caillard et al. 
2000; Mompiou et al. 2003). Conversely, glide appears almost impossible, proba- 
bly on account of the very high corrugation of dense planes, which is an intrinsic 
property of quasicrystals, as shown in Fig. 6.50b. 

Under such conditions, the low-temperature brittleness can easily be inter- 
preted by the absence of glide, which is the usual mode of deformation of 
conventional alloys at low temperature. However, interpretations of the high- 
temperature mechanical properties are facing the same difficulties as in Mg (Sec- 
tion 6.2.6.6), because the activation parameters are not those expected a priori 
from the simplest theories of climb. As a matter of fact, the dislocation velocity 
varies with stress as v œc t”, with n ranging between 4 and 8 (Mompiou 2004), 





Fig. 6.51 Dislocations dı and d moving by climb in an 
Sb AIPdMn quasicrystal. The contrast of the phason wall is 
arrowed. From Mompiou et al. (2003). 
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whereas Eq. (48) accounts for smaller values not exceeding 3. Further work is 
thus necessary to reconcile the microscopic and macroscopic aspects of climb, in 
crystals and in quasicrystals. 


6.5 
Outlook 


Although several aspects of dislocation plasticity may appear complex, major 
progress has been made toward rationalizing mechanical properties in terms of 
a few elementary processes. Indeed, the number of different thermally activated 
mechanisms controlling the plasticity of metals and alloys — even the most com- 
plex ones — appears fairly small, and the main tendencies of dislocation pattern- 
ing in monotonic deformation and in fatigue are now well understood. 

Since several elementary mechanisms are contributing, in parallel and/or in se- 
ries, to the mobility of each moving dislocation, the determination of the most 
important one — which will determine the macroscopic mechanical properties — 
is a delicate step. It can be done unambiguously only when there is a fairly good 
agreement between results of several complementary techniques, e.g., conven- 
tional “post mortem’ TEM, in-situ TEM, and transient deformation experiments. 

Research into new materials is still very active in the direction of lower weight, 
higher strength (especially at high temperature), and higher ductility. In this re- 
spect, it is expected that small-grain and nanograin alloys and intermetallic alloys 
will be developed extensively in the near future. However, and without any doubt, 
further progress in this field requires a better understanding of almost all the 
fundamental properties described in this chapter, in particular diffusion- 
controlled climb processes, grain boundary sliding, and nucleation of dislocations 
at surfaces and interfaces. 
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7 
Phase Equilibria and Phase Transformations 


Brent Fultz and Jeffrey J. Hoyt 


Metallic alloys are composed of more than one chemical species of atom, and 
these atoms have preferred spatial arrangements at fixed temperatures and pres- 
sures. This chapter explains the thermodynamic origin of these spatial prefer- 
ences, using simple models and some advanced concepts.” It comprises five 
main sections. Section 7.1 is devoted to thermodynamic equilibrium, with 
emphasis on how phase diagrams can be understood with generic interatomic in- 
teractions. Section 7.2 covers departures from equilibrium owing to kinetic con- 
straints, and also how interatomic interactions bias the atom movements that 
lead to equilibrium structures. The last three sections (7.3-7.5) cover the major 
categories of phase transformations in alloys, with emphasis on microstructural 
evolution from a continuum viewpoint. 


7.1 
Alloy Phase Diagrams 


This section presents the alloy physics of phase diagrams. It begins with the de- 
velopment of phase diagrams of binary alloys as functions of temperature and 
composition, using concise arguments that should be understood by all students 
of materials science. The section continues with an overview of more advanced 
phenomena that are needed for a quantitative understanding of alloy phase dia- 
grams. Some of these topics are covered in greater depth in Chapters 4, 10, and 11. 


7.1.1 
Solid Solutions 


Consider a binary alloy with atom species A and B arranged randomly on the 
sites of a crystal. The concentration of B atoms is c, and the concentration c (in 
1) Some phenomena are described in more depth in Chapters 3, 4, 5, 8, 10, 11, and 12. 
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mole fractions) varies from 0 to 1. When c = 0 the alloy is pure element A, when 
c = 1 the alloy is pure element B. The concentration of A atoms is 1 — c. The 
crystal has N sites, and therefore it has cN B atoms and (1 — c)N A atoms. Each 
site is surrounded by z sites as first-nearest-neighbors (1NN). For a random solid 
solution, on average each A atom is surrounded by zc B atoms, and its other 
(1 — c)z neighbors are A atoms. We start with a basic thermodynamic analysis 
of a solid solution using two key assumptions: 
e All the energy of atom configurations depends on first- 
nearest-neighbor pairs of atoms.” Each A-A pair has energy 
eaa, each A-B pair has energy eag, and each B-B pair has 
energy egg. To obtain the total energy of an alloy 
configuration, we count the number of pairs with each 
energy. 
e The alloy is truly random, so the species of atom on any 
lattice site does not depend on the species of atoms 
occupying its neighboring sites. 


Consider an A atom on a lattice site. Its energy depends on the number of its 
A atom neighbors, z(1 — c), and the number of its B atom neighbors, zc. The 
energy for this average A atom, and likewise for the average B atom, is 





z z 
ea = 3 (1 = c)eaa +5 cas (1) 
eg = 5 (1 — c)eag + 5 cezp (2) 


The division by 2 corrects for the double counting that occurs when finding the 
average by summing over all the atoms in the crystal (because the bonds from 
the neighbor atoms back to our central atom are counted two times, in both 
directions). 

We can now write the partition function for a single site, Zisite! 


Zisite = e fra F e Pes (3) 


where $ = 1/(kgT). Using the second assumption above, all site occupancies are 
independent, so the total partition function of the alloy with N sites is 


P= Die (4) 


Isite 


We can evaluate Zy with the binomial expansion, 


2) One way to justify this assumption is to 
argue that all the chemical bonding energy 
is associated with neighboring atoms only. 
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2y= (aa) emt" (5) 


n=0 
When N is very large, Zy becomes a sharply peaked function with a maximum at 


n= CN. It is a standard practice in statistical mechanics to replace Zy with its 
maximum value (Kittel 1969), 


zu) = (a) (6) 


—n)!n! 


Obtaining the Helmholtz free energy from the expression F = —kgT In Z, and 
using the Stirling approximation ln(x!) ~ x In x — x, we obtain Eqs. (7-10) 


F(c) = [(N — n)ea + neg] (z/2) 














— kgT[N In N—nInn—(N—n) In(N—n)] (7) 
F(c) = [eaa + 2c(€ap — eaa) + €7(4V)](ZN/2) 

— kT (In N—clnn—(1—c) In(N—n)|N (8) 
Econt (c) = [eaa + 2c(eag — eaa) + c?(4V)](zN/2) (9) 
Sconf(¢) = kp|—c In c — (1 — c) In(1 — c)|N (10) 


Here E.onr(e) and Scong(c) are the configurational energy and configurational 
entropy of the alloy. The word “configurational” refers to arrangements of atoms 
on the crystal sites, and here E.onf(c) and Sconf(c) were derived for a random 
arrangement. 

In Eq. (8) we defined an important combination of pair potentials: 


V= (eaa + egg — 2eap)/4 (11) 
This parameter V is sometimes called the “interchange energy.” Its physical 


meaning is made clear when we consider the change in configuration of an A 
and B atom pair as shown in Fig. 7.1. For the initial configuration of the central 


B) B 
| | 
A B} A) AA) A) 
as Alota ma Ae A 
B}—{A}—(B) B}—(B}—{B) 
| | 
a (A) b A 


Fig. 7.1 (a) Initial configuration around central A and B atoms. 
(b) New configuration after exchange of the central A-B pair. 
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A and B atoms in Fig. 7.1(a), the figure shows a total of five A-B pairs, one A-A 
pair, and one B-B pair. After interchange of only the central A and B atoms,” 
we have three A-B pairs, two A-A pairs, and two B-B pairs (Fig. 7.1(b)). After 
interchange, the change in pair energy is precisely 4V = e44 + egg — 2eag (or an 
integer multiple of 4V for different configurations). 

Consider first the case where V < 0, meaning that the average of A-A and B-B 
pairs is more favorable than A-B pairs. This gives the alloy a tendency to unmix 
chemically. The opposite sign of V > 0 means that at low temperatures the alloy 
will tend to maximize the number of its A-B pairs. It does so by developing 
chemical order, defined by order parameters presented in Section 7.1.7. 

The energy and entropy terms of Eqs. (9) and (10) are plotted in Fig. 7.2(a). 
When Evonfig(c) is added to —TSonfig(c) to obtain the free energy curves in Fig. 
7.2(b), the logarithmic singularity of the entropy always forces a downward slope 
away from c = 0 and c = 1. The energy term, positive in this case with V < 0, can 
dominate at intermediate concentrations, especially at low temperatures. For 
small magnitudes of zV/kgT (high temperatures), the minimum in free energy 
is obtained for a random solid solution, which has a simple shape in Fig. 7.2(b) 
when the ratio is —0.667. The free energy Fronfig(¢) has an interesting curvature 
in c when the ratio is more negative than —1, or equivalently for temperatures 
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Fig. 7.2 (a) Configurational energy and entropy for V < 0 [Eqs. (9) and 
(10)]. (b) Free energy for different values of z4V/(kT) as labeled. 


3) Note that this interchange conserves the or B-B, has no effect on the total number 
total number of atoms of each type. Also of like or unlike pairs. 
note that an interchange of like pairs, A-A 
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below Te = —4.0zV/kg. This T, proves to be the “critical temperature” for “spino- 
dal decomposition’ at a concentration of c = 0.5. Further interpretation of these 
curves, the subject of the next sections, leads to the unmixing phase diagram of 
Fig. 7.6(a), below. 


7.1.2 
Free Energy and the Lever Rule 


When working with alloy thermodynamics, the analysis of free energy versus 
composition curves is an essential skill because it is the basis for understanding 
equilibrium phase diagrams of temperature and composition. In what follows we 
deduce the basic forms of alloy T-c phase diagrams. Furthermore, modifications 
of this analysis can sometimes be used when equilibrium is constrained by kinetic 
processes. Some considerations for the Helmholtz free energy, F(c) = E(c) — 
TS(c), for each phase are listed below: 
e The phase of maximum entropy will dominate at the highest 
temperatures. For metallic systems this is a gas of isolated 
atoms, but at lower temperatures most metals form a liquid 
phase with continuous solubility of A and B atoms. There are 
cases of chemical unmixing in the liquid phase, however, for 
which the thermodynamics of Eq. (8) is again relevant. 
e At the lowest temperatures, the equilibrium case for a 
general composition is a combination of crystalline phases 
differing in their chemical compositions. 
e At low to intermediate temperatures, the equilibrium phases 
and their chemical compositions depend in detail on the free 
energy versus composition curves F:(c) for each phase é. 


A simple but important phase diagram is for a binary alloy with just two 
phases: a liquid phase of continuous solubility, and one crystalline solid phase 
of continuous solid solubility. The free energy versus composition curves, Fs(c) 
and Fı(c), are similar to the lowest curve of 7.2(b), but they differ in detail. Figure 
7.3 shows a typical case. At the highest temperature, T4, the liquid curve has the 
lowest free energy for any composition. The alloy is a liquid at all compositions, 
as expected, since the liquid has the higher entropy. The —TS part of the free en- 
ergy for the liquid, F,(c), changes quickly with temperature, however, because Sı 
is large. With decreasing temperature, the free energy curve for the liquid rises 
relative to the solid curve (and may flatten somewhat). At the lowest temperature, 
Tı, the crystalline solid curve has the lowest free energy, F, for any composition, 
consistent with its lower energy, E. At the temperature Tj, the alloy is a solid at 
all compositions. The intermediate temperatures T, and T; require further analy- 
sis. It is not simply a matter of picking the lowest of the free energy curves at a 
particular composition because the alloy can unmix chemically. The free energy is 
minimized by selecting an optimal combination of liquid and solid phases of dif- 
ferent chemical compositions. 
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Fig. 7.3 Free energy versus composition curves for solid and liquid 
phases at four temperatures Ty > T3 > T2 > Th. 


The conservation of solute provides a constraint between the fractional 
amounts of phases and the chemical compositions of these phases. Consider the 
material depicted in Fig. 7.4, which has an overall composition of co = 0.3. If it 
decomposes into a mixture of phases having different compositions, conservation 
of solute requires that one of the phases has more solute than co, and the other 
less. The composition deviations from co, indicated by the gray zones in Fig. 7.4, 
must average to zero. The condition is stated in Eqs. (12) and (13). 


(Ca — Co) fa + (cg — co) fp = 0 (12) 


Co — Ca _ Cp — Co (13) 


Ss fi 


The symbols « and £ are used here as general names for phases, and f, and fg are 
the mole fractions of the « and $ phases. Because f, + fs = 1, Eqs. (14) and (15) 








follow: 
= Cp — Co 
f= (14) 
Co — Ca 
f= (15) 


Cp — Cy, 
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Composition (c) 








Position (x) 


Fig. 7.4 Composition profile of a two-phase alloy with overall composi- 
tion co = 0.3. The area of the upper gray band is (0.7 — 0.3) x 0.2 = 

0.8, whereas the areas of the two lower gray bands are each (0.2 — 0.3) x 
0.4 = —0.4. 


Recall the balancing of a lever, where the heavier mass (larger f) placed on the 
shorter arm of the lever (smaller Ac) balances a smaller mass on the longer lever 
arm. Equation (13) is therefore known as the “lever rule.” The lever rule is a con- 
sequence only of solute conservation, and is useful when working either with 
curves of free energy versus composition, or with T-c phase diagrams. 


7.1.3 
Common Tangent Construction 


One more tool is needed to relate curves of free energy versus composition to 
phase diagrams - the “common tangent construction.” It is based on the reason- 
able assumption that the total free energy, Fr, is the sum of contributions from 
the fractions of the « and $ phases. In Eq. (18) below we transform this to a de- 
pendence on composition [through Eq. (15)] to make it more applicable to curves 
of free energy versus composition |Eqs. (16-19)] 

















Fr Sas Cas Sp, Cp) = faFalCa) + SpFp(cp) 16) 

Fe (Cas op, fi) = (1 GE) + FoF (op) 17) 
u Co — Cu _ (% — & 5 

Fr (Cu, Cp, Co) (1 ae =) F,(c,) 4 (@ — =) F (cp) 18) 

Frl) = Falei) + (BEE) Elen) Fale) 19) 
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Fig. 7.5 Free energy versus composition curves for solid (S) and liquid 
(L) phases at the temperature Tz. For a selected co, the total free energy 
is obtained on the straight line between selected points on the two free 
energy curves. The minimum free energy is found on the tangent 
common to the two curves. 


One way to interpret Eq. (19) is as the equation of a straight line of free energy 
versus the composition co in the range c, < co < cg. The values of Fr at the end- 
points of the line are F,(c,) and Fg(cg), and Fr remains on a straight line of F 
versus Cy at any intermediate composition. This linear relationship of Eq. (19) 
can be used to optimize the chemical compositions of a two-phase mixture. One 
of the curves of Fig. 7.3 (for temperature T,) is redrawn in Fig. 7.5. This figure 
includes three sloping straight lines between pairs of endpoints on the curves Fi 
and Fs. A chemical composition of co = 0.7 is picked for illustration. 

For thermodynamic equilibrium, we minimize the total free energy Fr by mov- 
ing the endpoints of the sloping lines in Fig. 7.5, keeping them on the curves of 
the two allowable phases. From the linear relationship of Eq. 7.19, we know that 
Fr will be at the three intersections marked by the crosses, “X”. From Fig. 7.5 
we see that the lowest possible position for the “X” is found on the “common 
tangent” between the two curves of free energy versus composition. The common 
tangent minimizes Fr. The equilibrium compositions of the solid and liquid 
phases are therefore at the compositions of the common tangent, and are labeled 
“cy” and “cs” in Fig. 7.5. 


7.1.4 
Unmixing and Continuous Solid Solubility Phase Diagrams 


We now construct phase diagrams for alloys with the free energy versus composi- 
tion curves of Figs. 7.2 and 7.3. First note from Fig. 7.5 that the compositions cı 
and cs from the common tangent construction serve to minimize the free energy 
of a mixture of solid and liquid phases for any intermediate composition (here 
from 0.4 < co < 0.84). With F:(c) curves at various temperatures for each phase 
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Fig. 7.6 (a) Unmixing phase diagram derived from curves of Fig. 7.2. 
Arrow marks composition limits obtained by common tangent 
construction from top curve of Fig. 7.2(b). (b) Continuous solid 
solubility phase diagram derived from curves of Fig. 7.3. Dots mark 
compositions obtained by common tangent constructions. 


é, it is a straightforward exercise to construct common tangents and identify the 
compositions of the phase boundaries. Results are presented in Fig. 7.6. 

The unmixing phase diagram in Fig. 7.6(a) shows a solid solution at high tem- 
peratures, denoted the “«-phase.” At low temperatures this solution becomes 
unstable to chemical unmixing, and forms an A-rich “‘«’-phase’ and a B-rich 
“a”-phase.” Beneath the arching curve in Fig. 7.6(a), the chemical compositions 
of the two phases can be read directly from the curve itself for any temperature.” 
The horizontal arrow points to these two compositions for a normalized tempera- 
ture of 0.5 (which is consistent with the common tangent applied to the top free 
energy curve of Fig. 7.2(b)). In this unmixing phase diagram of Fig. 7.6(a), the 
composition segregation is greater at lower temperatures, and is infinitesimally 
small at c = 0.5 when at the critical temperature Te = 4.0z|V|/kp. 

The phase diagram shown in Fig. 7.6(b) is for the case of continuous solid 
solubility. The dots on this diagram were obtained directly from the common 
tangents of the free energy versus composition curves of Fig. 7.3 and 7.5. Some 
discretion was exercised in interpolating between the points. Between the temper- 
atures T; and T4, perhaps halfway between them, Fig. 7.3 shows that the last of 
the A-rich solid phase has melted. The pure element A has a single melting point, 
of course, and this is also true for the element B. The phase diagram shows that 
an A-B alloy does not have a single melting temperature, however. In equilib- 
rium at temperatures above the “solidus” (the lower curve in this phase diagram), 
the alloy tends to form a little liquid that is enriched in the element B. Just below 


4) The fractional amounts of the phases are set by the lever rule or Eqs. (14) and (15). 
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the “liquidus” (the upper curve in this phase diagram), the alloy is mostly liquid 
with a small amount of solid phase enriched in the element A. 


7.1.5 
Eutectic and Peritectic Phase Diagrams 


It is possible to have equilibrium between three phases in a binary A-B alloy at a 
fixed temperature and pressure. The canonical examples involve a liquid phase in 
equilibrium with two solid phases. For example, Pb dissolves in Sb in the liquid 
phase, but at low temperatures the Pb-rich phase is fcc, whereas the Sb-rich 
phase is hexagonal. A separate free energy versus composition curve is needed 
for each crystalline phase. 

There are two possible forms of phase diagrams involving one high- 
temperature phase and two low-temperature phases — “eutectic” and “peritectic” 
phase diagrams. The form depends on the ordering in composition of the min- 
ima of the three F;(c) curves. The eutectic case occurs when the F,(c) curve of 
the liquid phase lies between the curves for the two solid phases as shown in 
Fig. 7.7(a). This figure shows four liquid free energy curves for four tempera- 
tures. For simplicity, the solid phase curves are assumed constant with tempera- 
ture, since they have the lower entropy. With increasing temperature, the liquid 
free energy curve falls relative to the solid curves owing to the large —TS; term 
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Fig. 7.7 (a) Free energy curves for two solid phases, « and ß, and 

a liquid phase drawn for four temperatures Ti < Tz < T3 < T4. 
Three common tangent constructions are shown, from which seven 
compositions are found. (b) Eutectic phase diagram derived for 
curves of part (a). Dots mark compositions from common tangent 
constructions. 
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Fig. 7.8 (a) Free energy curves for two solid phases, « and ß, and a 
liquid phase drawn for one temperature. (b) Peritectic phase diagram 
estimated from curves of part (a). 


in the liquid free energy. Three common tangent constructions are shown in the 
figure. The most interesting one is at the temperature T}, where the common 
tangent touches all three phases, with the liquid in the middle. This temperature 
T2 is the “eutectic temperature.” These points of contact are shown in Fig. 7.7(b), 
which is a complete eutectic phase diagram. 

The other possible ordering in composition for three free energy curves occurs 
when the liquid curve has its minimum on one side of the minima of the two 
curves for the solids. This case is depicted schematically in Fig. 7.8(a), with the 
corresponding “peritectic” phase diagram in Fig. 7.8(b). This phase diagram can 
be constructed by the same common tangent exercise as was used for the eutectic 
case in Fig. 7.7 — lowering with temperature the F,(c) curve relative to F,(c) and 
Fx(c). Peritectic phase diagrams are common when one crystalline phase, here 
the ß-phase, has a much lower melting temperature than the a-phase. Note also 
that if the liquid free energy curve were to lie to the left of the two solid curves, 
the common tangent construction would produce a peritectic phase diagram with 
the higher melting temperatures to the right. 


7.1.6 
More Complex Phase Diagrams 


Eutectic, peritectic, and continuous solid solubility phase diagrams can be as- 
sembled into more complex diagrams with multiple phases and two-phase re- 
gions (see Okamoto 2000). In various combinations, however, the forms of Figs. 
7.6, 7.7(b), and 7.8(b), plus the ordering diagrams discussed below, account for 
almost all features found in real binary alloy phase diagrams. For example, two 
eutectic phase diagrams over composition ranges 0 < c < 0.5 and 0.5 < c < 1.0 
could be attached together at the composition c = 0.5, where a third solid phase, 
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(a) (b) 


a+ 


composition d composition 





Fig. 7.9 (a) Ternary phase diagram of elements A, B, and C (to back). 
(b) Sections removed to show that the ternary eutectic composition has 
a lower melting temperature than any of the three binaries. The 
triangular grid at the bottom depicts the alloy composition. 


perhaps an ordered compound, is stable.” Alternatively, a peritectic phase dia- 
gram could be attached to one side of a eutectic diagram. In another variation, 
the liquid phase in a eutectic diagram could be replaced by a random solid solu- 
tion, perhaps the disordered solid solution in the phase diagram of Fig. 7.6(b). 
(When a solid solution replaces the liquid region in Figs. 7.7(b) or 7.8(b), the dia- 
gram is called a “eutectoid” or “peritectoid” diagram, respectively.) 

The same principles apply to “ternary” alloys containing three elements. In- 
stead of one independent composition variable as for a binary alloy, a ternary alloy 
has two, so a two-dimensional T-c diagram is no longer sufficient. Figure 7.9 is 
an example of a compound eutectic diagram for a ternary alloy. Here the temper- 
ature is plotted vertically, and the concentration of the third element, C, increases 
with distance from the front to the back of the structure. This ternary diagram is 
a special case where each pair of elements, A-B, B-C, and A-C, has a eutectic 
phase diagram. The regions of the A-B eutectic diagram are labeled in Fig. 
7.9(a). The Gibbs phase rule (Gibbs 1876) states that adding a component to the 
system (the element C in the ternary alloy) gives an additional degree of freedom 
in the intensive variables of temperature, pressure or mole fractions. By the 
Gibbs phase rule, the two-phase areas of the binary T-c eutectic diagram at 
constant pressure are now volumes in the ternary diagram. Two of these volumes 
are removed in Fig. 7.9(b), showing more clearly the solidus surfaces and the low- 
temperature ternary eutectic point of equilibrium between the liquid and three 
solid phases. 

Handbooks of ternary phase diagrams usually do not show three-dimensional 
structures as in Fig. 7.9, but instead present various cuts through the structure. 
A cut with a vertical plane includes all temperatures, but only a constrained set 


5) This double eutectic diagram could be 
generated with the free energy versus 
composition curves of Fig. 7.17. 
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of compositions. The intersections of the phase boundaries on such vertical cuts 
are called “pseudo-binary” diagrams. More commonly, horizontal cuts are used to 
depict isothermal sections of the ternary diagram. These are shown with a trian- 
gular composition boundary, with the pure elements at each corner. A stack of 
these diagrams is required to depict the ternary diagram at multiple tempera- 
tures. A subtlety of ternary isothermal sections concerns the compositions in 
equilibrium across two-phase regions. The lever rule is still valid, but for an alloy 
composition in a two-phase region it is necessary to know the “tie-lines’” that 
identify the compositions of the two equilibrium phases. Ternary alloys allow for 
phases and microstructures beyond those possible for binary alloys, but today our 
knowledge of ternary phase diagrams is much less complete. Many compilations 
are missing phases, temperatures, or tie-lines, and it remains a challenge to ob- 
tain this important information by experiment and theory. 


7.1.7 
Atomic Ordering 


We return [Eq. (20)] to the issue of atom interaction preferences expressed in Eq. 
(11): 


a) 
va at = = (20) 





An alloy with unmixing tendencies has V < 0, so A-A and B-B pairs are prefer- 
able to A-B pairs. An alloy with the opposite tendency, V > 0, tends to maximize 
the number of A-B pairs at low temperatures. At high temperatures both types of 
alloys are expected to become random solid solutions if the configurational en- 
tropy is dominant, but their phases at low temperatures are fundamentally differ- 
ent because of their different signs of V. 

For illustration of an ordering transformation we choose the square lattice 
shown in Fig. 7.10, and an equiatomic A-B alloy. For V > 0, we expect the struc- 


A on ct NA on Q 
> 2 . oN ee o s 
` ` . Bon B ` ` B on B 
> . ə » © 2 . e Aon 
. . - ` ` B on & 
« . . ® o NG . . 
. . 2 . ° . 
ae » » è be ° e e 


Fig. 7.10 (a) Ordered square lattice with two species {A, B} and two 
interpenetrating square sublattices {x,ß}. (b) Misplaced atoms A and B 
(antisites) on generally ordered sublattices. The double arrow indicates 
how a duo of A and B atoms can be exchanged to eliminate antisites. 
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ture of Fig. 7.10(a) at T = 0. Figure 7.10 identifies sublattices for the two species 
of atoms; for a crystal with N sites, each of these sublattices contains N/2 sites. 

Finite temperatures favor some disorder in the structure — putting a few atoms 
on the wrong sublattice gives a big increase in the configurational entropy,” driv- 
ing some mixing of A atoms onto the f-sublattice. By conservation of atoms and 
conservation of sublattice sites, an equal number of B atoms must move onto the 
a-sublattice. The number of these “wrong” atoms (or ‘‘antisites’”) on each sublat- 
tice is W, and the number of “right” atoms is R. The sublattice concentrations 
are used to define a “long-range order” (LRO) parameter, L, for the alloy (Warren 
1990), given by Eq. (21). 





(21) 


This L can range from —1 to +1. The case of L = 1 corresponds to Fig. 7.10(a), 
where R = N/2 and W = 0. The disordered alloy has as many right as wrong 
atoms on each sublattice, i.e., R= W, so L = 0 for a disordered alloy. 

We now calculate the temperature dependence of the order parameter, L(T), 
again by counting the number of A-A, A-B, and B-B pairs of atoms as in Section 
7.1.1. As we move atoms between sublattices, we must be sure that we conserve 
the number of each species. We ensure this by considering two atoms at a time — 
we transform between a “right duo” which comprises an A atom and a B atom on 
their proper sublattices, and a “wrong duo” which comprises an A atom and a B 
atom on the wrong sublattices (antisites). This corresponds to swapping the two 
atoms at the ends of the double arrow in Fig. 7.10. To calculate the equilibrium 
value of the LRO parameter, we seek the ratio of right and wrong duos as a func- 
tion of temperature. The energy of the two atoms in the wrong duo is given by 
Eq. (22). 


zR zR zW 


} + 2 22 
na t N/2 j Na ( ) 





ww = 


site: wrong A wrong B wrong A (B) 
neighbor: right A right B wrong B (A) 


To understand Eq. (22), remember that the neighbors of an atom are on the other 
sublattice, so the first term is from the wrong A which has right A atoms as 
neighbors, the second term is from the wrong B which has right B atoms as 
neighbors, and the third term is from the other neighbors of the wrong A which 
are wrong B atoms (plus the wrong A neighbors of the wrong B). Exchanging the 


6) In fact the entropy has a logarithmic 
singularity in sublattice concentrations 
analogous to the concentration dependence 
of Eq. 7.10 near c = 0. 
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positions of the wrong A and B atoms (as indicated by the double arrow in Fig. 
7.10) puts the two atoms on their proper sublattices. The analogous expression 
for the energy of the “right” duo is Eq. (23), where the A neighbors of the right 
A are wrong, the B neighbors of the right B are wrong, but the B (A) neighbors of 
the right A (B) are right. 


zW zW zR 
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On the average, the energy of a single atom will be half that of Eq. (22) or (23). 
We calculate the ratio of right to wrong duos (equal to R/W) as the ratio of their 
Boltzmann factors, Eqs. (24) and (25). 





R e Per/2 

Wen = 
R 1 z 

vo e| B5 (R — W)(2eas — eaa — ese) N/2 (25) 


With Eq. (21) and noting that R+ W = N/2, we find that R = (1 + L)N and L = 
(1 — L)N, so Eq. (25) becomes Eq. (26), where again V = (e44 + egg — 2eag)/4. 


isr HRT (26) 


1+L_ E “ 
Equation (26) cannot be reduced further into a simple analytical expression for 
L(T). It can be solved numerically, and the result for L(T) is shown in Fig. 7.11. 
The critical temperature is found analytically by taking L as infinitesimally small, 
for which Eq. (26) becomes Eq. (29) after rearrangement via Eqs. (27) and (28). 
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Fig. 7.11 Long-range order versus temperature in the Bragg-Williams approximation. 
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L2Vz 
In(1 + 2L) = BT (27) 
L2Vz 
2b = T (28) 
Vz 
Te = kp (29) 


The theory presented in this section is sometimes called the “Bragg—Williams” 
or “mean field” theory of ordering (Bragg and Williams 1934, 1935). Equation 
(22) makes a simplifying assumption about counting pairs — if we pick an A- 
atom, its neighbors on the adjacent sublattice sites will be determined only by 
the overall concentration on the sublattice. There is no provision for any prefer- 
ence for the neighbors of a wrong A-atom to be B-atoms, for example. To describe 
the atom configurations, the theory uses only a simple sublattice concentration at 
each site (or “point”) in the crystal. The Bragg-Williams approach is a theory in 
the “point approximation.” Because it keeps track only of the average sublattice 
concentrations, there is no spatial scale. The sublattices are infinite in size. It is 
therefore appropriate to call L a long-range order parameter. A better thermody- 
namic theory is possible by defining “pair variables’ to account for the short- 
range preference of neighboring atoms (Bethe 1935). This is discussed further 
in Sections 7.1.9 and 7.2.6, and Chapter 10. 


7.1.8 
Beyond Simple Models 


The models presented above in Section 7.1 give a valuable conceptual under- 
standing, but any quantitative understanding requires additional physical con- 
cepts and complexity. This section presents an overview of “advanced” concepts 
and methods which are essential for understanding phase diagrams of real alloys. 
The most reliable concept is the oldest - minimizing the free energy of multi- 
phase mixtures is the way to obtain equilibrium alloy phase diagrams. The more 
advanced concepts are needed for obtaining accurate free energy functions for 
these minimizations. 
e Enthalpy: At low temperatures the enthalpy originates with 

static structures of the nuclei and electrons, which are the 

essential components of matter. Both the bonding energy and 

the effects of pressure can be obtained from the electronic 

structure of the alloy. Most quantitative electronic structure 

calculations of today are based on the local density 

approximation for the electron exchange and correlation 

energy. The reliability of these methods is now known, and is 

often satisfactory for many predictions. Improvements are 

expected over the next decade, especially for calculations on 

magnetic systems and rare earth alloys. 
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e Entropy: The configurational entropy can be calculated with 
more accuracy than in Eq. (10). The problem with Eq. (10) is 
that at intermediate temperatures it is not correct to assume 
that the occupancy of a crystal site is independent of the 
occupancy of neighboring sites. There are short-range 
correlations between atom positions, and these reduce the 
configurational entropy. (Likewise, these correlations alter the 
configurational enthalpy.) 

Configurational entropy is not the only source of entropy, 
and in some systems it is only the minor contribution. For 
crystals of pure elements there is no configurational entropy, 
and for shear transformations there is no change in the 
configurational entropy. Dynamical sources of entropy from 
atom vibrations must always be considered, and motions of 
magnetic spins and electronic excitations can be important, 
too. Metals have a modest electronic entropy that is calculated 
from the density of electron states near the Fermi level. 
High temperatures and quasiparticle interactions: The 
configurational entropy has no intrinsic temperature 
dependence - it depends only on atom positions. On the 
other hand, the dynamical sources of entropy have several 
sources of temperature dependence. From the quasiparticle 


viewpoint, there are more interactions between phonon and 
electron excitations when more of them are present at 
elevated temperatures. These anharmonic sources of entropy 
can be large, and are currently an active area of research. 


The topic of electronic structure is covered in Chapter 11, and will not be 
discussed further in the present chapter. Further discussion of the statistical 
mechanics of atom configurations is presented in Chapter 10, so only a brief over- 
view is presented here. The subject of lattice dynamics is covered in Chapter 4, 
but here we present the thermodynamic consequences of phonons. Analogous 
arguments pertain to magnons. 


7.1.9 
Entropy of Configurations 


Section 7.1.7 presented a thermodynamic analysis in the point approximation, 
which assumed that all atoms on a sublattice were distributed randomly. This as- 
sumption is best in situations when (a) the temperature is very high, so the atoms 
are indeed randomly distributed on the sublattice; (b) the temperature is very low, 
and only a few antisite atoms are present; or (c) in a hypothetical case when the 
coordination number of the lattice goes to infinity. For more interesting temper- 
atures around the critical temperature, for example, it is possible to improve upon 
this analysis by systematically allowing for short-range correlations between the 
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positions of atoms. In the pair approximation, the state of order is parameterized 
in more detail by using independent variables to count the numbers of pairs of 
atoms of the different types. In a solid solution these can express the local ten- 
dency for preferences of like or unlike neighbor pairs, which is “short-range 
order” (SRO). In an ordered alloy the SRO can change with some independence 
of the long-range order (LRO). In fact, considerable SRO exists in alloys above the 
critical temperature where the LRO is zero. 

A pair variable pi; is the probability that an arbitrarily chosen INN (first- 
nearest-neighbor) pair of sites contains an i and j atom. In the pair approxima- 
tion, the free energy of a disordered solid solution is formulated with an energy 
of pairs according to Eq. (30), and the configurational entropy in the pair approx- 
imation for an alloy without the sublattices of LRO is given by Eq. (31). (Sato 
1970; Kikuchi 1974): 


z 
Econf2 = N2 (30) 


Sconf2 = kgN (z a 1) X [pi In Pi — pil 
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-3D ly In Be] (31) 
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The double sum in the entropy expression is expected from the combinatorics of 
arranging pairs of atoms on pairs of lattice sites, assuming they are placed ran- 
domly without consideration of the neighboring pairs. The double sum comes 
from a random mixing of pairs, not individual atoms (points) as in Eq. (6).” The 
pij are the independent variables in the pair approximation. This leads to a diffi- 
culty with the point variables, p;, which are derived as sums over pair variables as: 
p= = pij. Each site in a square lattice belongs to four pairs, however. There is 
an overcounting problem that needs to be addressed to ensure that the numbers 
of atoms are conserved as order evolves.” This is done with the “Fowler- 
Guggenheim’ correction factor to the combinatorics (Guggenheim 1935; Fowler 
and Guggenheim 1940), and with the Stirling approximation it completes the ex- 
pression of Eq. (31). (Note the opposite sign of the first term with point variables 
in Eq. (31)). With this correction, Eq. (31) for a truly random solid solution with- 
out local atomic preferences is the same as the entropy of mixing of Eq. (10). 

For further accuracy, it is natural to extend this hierarchy of approximations 
from point to pair to larger clusters such as squares on the square lattice, for ex- 


7) The double sum in Eq. (31) is obtained 8) Counting pairs is not equivalent to counting 
analogously to the entropy expression of Eq. points. Consider the replacement of one B- 
(10), which involved the logarithm of the atom with an A-atom in an equiatomic alloy. 
factorials in Eq. (6), and the Stirling In a disordered alloy this will create z/2 
approximation for In(x!). new A-A pairs, but in an ordered alloy z 


A-A pairs will be created. 
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ample. A systematic “cluster variation method” to do this was developed by Kiku- 
chi (1951), and has been in widespread use since the late 1970s (Van Baal 1973; 
de Fontaine 1979). Variables for the populations of larger clusters, such as first- 
nearest-neighbor square variables, pj, are placed randomly on the square lattice, 
for example, consistently with their concentrations. In this case the pair variables 
are overcounted, and the handling of this problem is the key to the method. In 
essence, it is necessary to account for the overcounting of all subclusters within 
the largest cluster, using weights that depend on how many subclusters are pres- 
ent in the large, independent cluster. The systematics to do this are elegant and 
straightforward, and are the subject of Chapter 10 of this book (see also (Kikuchi 
1951; Sanchez and de Fontaine 1980)). One measure of the success of these 
higher-order approximations is their convergence on the actual critical tempera- 
ture for ordering, as determined, for example, by Monte Carlo methods. In the 
pair approximation the exchange potential, V, sets the critical temperature at 
T; = 0.869zV/kg, which is lower than the critical temperature of the point ap- 
proximation of Eq. (29), and is more accurate. Discrepancies of 1% or so are pos- 
sible with modest cluster sizes, but the analysis of accuracy is itself a specialty. 

An important conceptual point is addressed by cluster approximations for the 
problem of chemical unmixing, or spinodal decomposition. The phase diagram 
for unmixing, Fig. 7.6, is based on the free energy versus composition curves of 
Fig. 7.2. At intermediate compositions, these curves obtained in the point approx- 
imation show a characteristic “hump” at temperatures below the critical temper- 
ature for unmixing. In the pair approximation, the magnitude of this hump is 
decreased considerably, and it continues to decrease as the cluster approximation 
extends to higher order (Cenedese and Kikuchi 1994). In the limit of very large 
clusters, the hump in the free energy versus composition curve vanishes, and a 
straight line connects two solid solutions of different compositions. What hap- 
pens is that the populations of higher-order clusters are themselves biased toward 
unmixed structures, where all the A-atoms are on one side of the cluster and the 
B-atoms on the other side, for example. With these internal degrees of freedom 
accounting for much of the unmixing enthalpy, the composition variations have 
less effect on the free energy. Composition is therefore insufficient for character- 
izing the state of the alloy, and the free energy versus composition curve in the 
point approximation is naive. This viewpoint helps to overcome a conceptual 
problem with curves such as in Fig. 7.2: “How reliable is an unstable free en- 
ergy function for calculating equilibrium phase diagrams?” Such difficult ques- 
tions are avoided in part by treating spinodal decomposition as a kinetic process 
with a modified diffusion equation, as in Section 7.4. 


7.1.10 
Principles of Phonon Entropy 


The partition function for a single harmonic oscillator of frequency œ; = &/h is 
given by Eq. (32). 
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Equation (33) was obtained by identifying Eq. (32) as a geometric series times the 
constant factor exp(—-ße;/2). The partition function for a harmonic solid with N 
atoms and 3N independent oscillators is the product of these individual oscillator 
partition functions, Eq. (34), from which we can calculate the phonon free energy 
as F = —kgT ln Z [Eq. (35)], and the phonon entropy by differentiating with re- 
spect to T [Eq. (36)]. 
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It is often useful to work with a phonon density of states (DOS), g(e), where 
3Ng(e) de phonon modes are in an energy interval de. For a DOS acquired as dig- 
ital data in m intervals of width Ae (so &; = jAe), the partition function can be 
computed numerically [Eq. (37)]. 


3Ng (ej) Ae 


m ebe/2 
ZN = 1 = =) (37) 


With no derivation we present Eq. (38), a useful expression for the harmonic pho- 
non entropy of a material at any temperature, where g(e) is normalized to 1 and 
n(e) is the Planck distribution for phonon occupancy at the temperature of interest. 


oo 


Son = 3kp j gle)[(n(e) + 1) In(n(2) + 1) — n(2) In(n(e))] de (38) 


A handy expression for the high-temperature limit of the difference in phonon 
entropy between two harmonic phases, « and £, Eq. (39), can be obtained readily 
from Eq. (36). 
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7.1 Alloy Phase Diagrams 


The important point about Eq. (37) is that the only material parameter relevant 
to the thermodynamic partition function is the phonon DOS, g(e). Measuring or 
calculating the phonon DOS is the key to understanding the phonon contribu- 
tions to alloy thermodynamics. This must be done with high accuracy. For exam- 
ple, if we apply Eq. (39) to a case where the phonon DOS curves of the « and £ 
phases have the same shape, but differ in energy scaling by 10%, we obtain a 
change in phonon entropy of ASph = 3kg In(1.1) = 0.3kg/atom. This change in 
phonon entropy is almost half of the maximum possible change in configura- 
tional entropy of the order-disorder transformation of a binary alloy, which is 
kg In 2/atom. When comparing the phonon entropies of different alloy phases, a 
1% accuracy in the difference in logarithmic-averaged phonon energy is often re- 
quired. Although the role of phonon entropy in phase transformations has been 
discussed for many years (Slater 1939; Booth and Rowlinson 1955), only in recent 
times have measurements and calculations become adequate for assessing trends 
(Anthony et al. 1993; Wallace 2003). 

At low temperatures it is possible to use electronic structure calculations with 
the local density approximation to calculate phonon frequencies with reasonable 
accuracy, but a 1% accuracy remains a challenge. Inelastic neutron scattering (see 
Chapter 4) faces its challenges too (Squires 1978; Lovesey 1984). A triple axis 
spectrometer at a reactor neutron source is highly accurate in its energy and mo- 
mentum measurements, and when a full set of phonon dispersions are measured 
on single crystals of ordered compounds, the phonon DOS derived from a lattice 
dynamics model is highly reliable. This technique is not so reliable for disordered 
solid solutions, unfortunately, because the dispersions are inherently broadened, 
and this broadening need not be symmetrical in energy. Direct measurements 
of the full phonon DOS of disordered alloys are possible with chopper spectrom- 
eters at pulsed neutron sources, but other difficulties arise. In general, the neu- 
tron scattering from phonon excitations does not receive an equal contribution 
from the different elements in an alloy, and the displacement amplitudes of the 
different elements may be different in different phonons. This causes a “neutron 
weight” that is difficult to correct, although differential measurements are often 
possible. 


7.1.11 
Trends of Phonon Entropy 


Our knowledge about the phonon entropy of alloy phases is still being organized, 
but some trends are clear, and some correlations between phonon entropy and al- 
loy properties are known. The Hume-Rothery rules of alloy thermodynamics are 
based on the atomic properties of: (a) electronegativity, (b) metallic radius, and (c) 
electron-to-atom ratio, with the first factors being the most important (see Chap- 
ter 2). For phonon entropy, we should perhaps add (d) atom mass to this list. It is 
found that across the periodic table, where atomic mass varies from 1 to 238 and 
beyond, for matrices having atoms of mass Mo, and solutes having mass Ms, the 
phonon entropy upon alloying tends to scale with In(Mo/Ms) (Bogdanoff and 
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Fultz 1999). This is a general consequence of the relationship for an oscillator of 
œw = ı/k/M, where the frequency, œ, scales as the inverse root of the mass, M, 
assuming that the spring constant, k, is in fact a constant. There is a large scatter 
in this correlation, however, and it is not a reliable one for most alloys. The spring 
constants, or more specifically the interatomic force constants ®;; (see Chapter 4), 
are in fact not constant in alloys. In particular, a larger solute atom will cause 
a local compression. Since interatomic forces generally become stiffer under com- 
pression, alloying with an atom having both a large size and a large mass will 
produce an uncertain result for the vibrational entropy. 

Progress has been made, however, by van de Walle and Ceder (2002), who pro- 
posed a model based upon a bond stiffness versus bond length argument. In 
their model, an atom pair in different local atomic configurations will have differ- 
ent bond stiffnesses, with greater stiffness for shorter interatomic distances. 
These characteristics seem to be transferable when atom pairs are in different 
crystal structures. The model is useful for semiquantitative arguments, and 
when calibrated for specific elements by ab-initio calculations, for example, this 
model can be used for comparing phonon entropies of different alloy phases. 

In alloying there are correlations between phonon entropy and electronegativ- 
ity. In a systematic study of transition metal solutes in vanadium, it was found 
that the phonon entropy had a robust correlation with the difference in electro- 
negativity between the solute atom and the vanadium atom. Results shown in 
Fig. 7.12 indicate that this correlation works well for solutes from the 3d, 4d, 
and 5d rows of the periodic table, in spite of their large differences in mass 
and atomic size. Electronic structure calculations have shown that the large differ- 
ences in electronegativity cause large charge transfers between the solute and its 
first-nearest-neighbor atoms that alter the charge screening sufficiently to vary 
the interatomic forces (Delaire and Fultz 2006). 
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Fig. 7.12 Vibrational entropy of alloying of 6 at% solutes in bcc 
vanadium, plotted versus Pauling’s electronegativity of the solute 
element. 


7.1 Alloy Phase Diagrams 


7.1.12 
Phonon Entropy at Elevated Temperatures 


A harmonic oscillator has generalized forces that are linear with the displacement 
of a generalized coordinate. This is consistent with Hooke’s law. The word “an- 
harmonic’ describes any oscillator with forces that deviate from this linearity. In 
thermodynamics, the word “anharmonic” is used more restrictively, and the word 
“quasiharmonic” describes some effects from classical thermodynamics that are 
not strictly harmonic, but can be interpreted as altered harmonic behavior. Here 
we use the following words to describe how phonon frequencies change with the 
intensive variables of temperature and pressure. 
e Harmonic phonons undergo no change in frequency with T 
or P. 
e Quasiharmonic phonons have frequencies that depend on 
volume only. At a fixed volume, however, they behave as 
harmonic oscillators. Their frequencies can change with 
temperature, but only because thermal expansion alters the 
volume of the solid. 
e Anharmonic phonons change their frequencies for reasons 
other than volume. For example, the interatomic potential 
may change if temperature drives electronic excitations 
across the Fermi surface. Alternatively, quartic terms in the 
interatomic potential change the phonon frequencies with 
phonon occupancy, but need not cause thermal expansion. 
A pure temperature dependence of phonon frequencies, 
independent of volume effects, is typical of anharmonic 
effects. 


With increasing temperature, energy goes into phonon creation, but energy 
also goes into the expansion of a crystal against its bulk modulus. It is necessary 
to minimize a free energy function that includes phonon energy, phonon entropy, 
and the elastic energy of thermal expansion. 

We can test for anharmonicity by the following method, which compares the 
thermal shifts of phonons to predictions from classical thermodynamics. It is 
possible to calculate a free energy F(V,T), with an elastic energy that depends 
on thermal expansion (a quadratic function of BAT), and a quasiharmonic pho- 
non entropy that increases by ySAT (where y is the Grüneisen parameter and ß 
is the coefficient of linear thermal expansion) as the phonon frequencies soften.” 
With the elastic energy and phonon entropy, optimizing the amount of volume 
expansion at a fixed temperature gives an equality from the energy term, Bpv, 
and the phonon entropy term, yCy, where v is the specific volume. B is the bulk 


9) It is also possible to add a contribution from small compared to the change in elastic 
the phonon energy that softens with energy. 
volume, but it is found that this effect is 
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Fig. 7.13 Heat capacity versus temperature for two assumed physical 
models. Inset is the phonon DOS at 0 K - all modes were assumed to 
soften by 6.5% at 300 K. A simple harmonic heat capacity with a 
phonon DOS unchanged with temperature is also shown. 3R is the 
classical limit. The full heat capacity also includes an electronic 
contribution that may be comparable to the difference curve. 


modulus, and Cy is the specific heat at constant volume. The thermal expansion 
is therefore given by Eq. (40). 


_ Cy 
p= (40) 


(With electronic entropy, an additional term, y.1Ceıv/Bv, would be added to the 
thermal expansion coefficient.) Equation (40) is useful for identifying quasihar- 
monic behavior. Suppose we know the phonon softening from the parameter y, 
or better yet we know the softening for all phonon modes in the DOS. With the 
phonon DOS we can calculate Cy, and conventional measurements can provide 
B, v, and $, accounting for all unknowns in Eq. (40). If Eq. (40) proves to be 
true, we say that the solid is “quasiharmonic.” Typical quasiharmonic effects on 
the heat capacity are shown in Fig. 7.13. 

Anharmonic behavior is phonon softening or stiffening inconsistent with Eq. 
(40). Anharmonicity involves phenomena beyond those of independent phonons. 
Electron—phonon interactions, changes in electronic structure with temperature, 
or higher-order phonon-phonon interactions can cause anharmonic behavior. To- 
day, unfortunately, it is difficult to differentiate between these anharmonic effects 
(Wallace 1998). Nevertheless, substantial anharmonic effects are known for Cr 
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(Trampenau et al. 1993), V (Bogdanoff et al. 2002), Ce (Manley et al. 2002), Mo, 
W, Si, and Ge (Wallace 2003), and anharmonic effects likely occur in many other 
systems. 


7.2 
Kinetics and the Approach to Equilibrium 


A phase diagram is a construction for thermodynamic equilibrium, and therefore 
contains no information about how much time is needed before the phases ap- 
pear with their equilibrium fractions and compositions. It is usually true that 
the phases found after practical times of minutes or hours will be consistent 
with the phase diagram, since most phase diagrams were deduced from experi- 
mental measurements on such time scales. For rapid heating or cooling, how- 
ever, the kinetic processes of atom rearrangements often cause deviations from 
equilibrium, and these are described below. The last part of this section addresses 
atomistic theories of kinetics by kinetic extension of the statistical mechanics of 
Sections 7.1.7 and 7.1.9. 


7.2.1 
Suppressed Diffusion in the Solid (Nonequilibrium Compositions) 


We start with equilibrium predictions for phase fractions and compositions for an 
alloy of composition co, cooled from the liquid and allowed to freeze (Fig. 7.14). 
The relevant graphs to examine are on the right of Fig. 7.14, specifically the thick 
solid curves (the thin dotted lines indicate the correspondence between the cool- 
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Fig. 7.14 Analysis of eutectic phase diagram for phase fractions, f, 

and compositions, c, upon cooling under equilibrium conditions 

(thick curves on the right), suppressed diffusion in the solid phases 
(thick broken curves), and very rapid cooling with suppressed diffusion 
in the liquid (thin curves). See text for discussion. 


10) On the other hand, this does not necessarily 
mean that all phases on phase diagrams are 
in fact equilibrium phases. Exceptions are 


found, especially at temperatures below 
about half the liquidus temperature. 
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ing curves and the eutectic phase diagram). The six graphs are the fractions and 
compositions for each phase, L, «, and $, over a range of temperatures. At high 
temperatures the alloy is entirely liquid, with composition co, but below the 
liquidus temperature the fraction of liquid decreases as crystals of the f-phase 
grow in the liquid. Using the lever rule [Eq. (13)], the fractions of ß-phase and liq- 
uid phase are approximately equal at the eutectic temperature of Fig. 7.14. Their 
chemical compositions are simply read from lines drawn in the phase diagram. 
Note that the «-phase does not exist above the eutectic temperature for this alloy 
of composition co, but the lever rule shows that immediately below the eutectic 
temperature the «-phase should exist with a phase fraction, f,, of approximately 
0.3. 

Kinetic deviations from this equilibrium analysis of Fig. 7.14 can be predicted 
approximately. First consider the effects of kinetic processes with the longest time 
scales, since these will be the first to alter phase formation as cooling or heating 
becomes more rapid. Atom diffusion in the solid phases is a kinetic process that 
is typically slow. Solid-state diffusion will modify the equilibrium cooling curves 
of Fig. 7.14 as follows. At the highest temperature where the f-phase first forms 
during cooling (the liquidus temperature for the composition co in Fig. 7.14), the 
ß-phase is especially rich in B atoms. At lower temperatures the phase diagram 
shows that this initial S-phase is too rich in B atoms, and some B atoms in the 
solid should exchange with A atoms in the liquid. This often does not occur be- 
cause it requires atoms in the interior of the crystals of the -phase to migrate to 
the outside of the crystal to reach the liquid. Diffusion in the solid is often too 
slow to ensure equilibrium during cooling. In essence, the extra B atoms in the 
ß-phase are unable to participate in the equilibrium process, and the effective 
alloy composition can be considered to decrease below co (see Chapter 3). The ef- 
fects are shown in the thick broken curves of Fig. 7.14. Although the average 
composition of the f-phase is too rich in B atoms, at the surface of f-phase crys- 
tals the new layers of solid are forming with their equilibrium composition, and 
the adjacent liquid composition is given by the equilibrium liquidus line. Conser- 
vation of solute requires that the total amount of f-phase be somewhat sup- 
pressed in comparison to the amount of liquid (consistent with the effective over- 
all composition decreasing below co). The same argument holds for temperatures 
below the eutectic temperature — the B-rich cores of the f-phase cause less £- 
phase to be present with the «-phase at low temperatures. 

A common phenomenon that occurs during freezing of eutectic systems is the 
formation of “dendritic” or tree-like microstructures of intertwined «- and £- 
phases. There are two reasons why the interface between the liquid and solid 
may not be stable to asperities in the solid, which grow forward rapidly into the 
liquid (Chalmers 1959; Mullins and Sekerka 1964). First, there is latent heat 
release during freezing, and this makes the interface warmer than the solid and 
liquid immediately around the interface. The interface, however, is at the melting 
temperature, and therefore the nearby liquid must be below the melting temper- 
ature and especially unstable to solid formation. A protruberance that juts into 
this undercooled liquid can grow quickly. Second, the new solid phase at the in- 
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terface rejects A atoms into the liquid, making a liquid that freezes at a lower 
temperature. This chemical contribution to the liquid instability to protruber- 
ances is called “constitutional supercooling.” Solid dendritic structures grow 
rapidly into the liquid, often with side branches. Between these dendrites is a 
liquid of a composition which solidifies at lower temperatures. Dendritic freezing 
produces coarse compositional inhomogeneities in the solidified material (see 
Chapter 3), and processings such as forging may be required to eliminate them. 


7.2.2 
Nucleation Kinetics 


Nucleation of a solid phase frequently slows the kinetics of formation of a new 
phase during cooling. Nucleation is a broad topic, which is discussed in Section 
7.3, but suffice it to say that some undercooling is necessary before the new solid 
phase can form. Undercooling is especially important for solid — solid phase 
transformations, although some undercooling is also typical of liquid — solid 
phase transformations.” There is an energy penalty associated with the solid- 
liquid interface, and undercooling is needed to boost the thermodynamic driving 
force for nucleating a new volume of solid. The undercooling depends on many 
factors, including contact between the liquid and the walls of its container, so it is 
not easy to predict. Nevertheless, at exactly the critical temperature of a phase 
transformation, the free energies of the two phases are equal, so it takes an infi- 
nite amount of time to nucleate the new phase at the critical temperature. 

The number of nuclei of the new phase that are able to grow increases rapidly 
with the undercooling below the critical temperature of the phase transformation 
(Section 7.3). On the other hand, the diffusional processes needed for growth are 
thermally activated, and are slower at lower temperatures. The general kinetics of 
a nucleation-and-growth phase transformation is indicated on the left in Fig. 7.15. 
The rate of the phase transformation is zero at the critical temperature and above 
(owing to nucleation), and sluggish at low temperatures (owing to diffusion). On 
the right-hand side of Fig. 7.15 is a kinetics map called a “time-temperature- 
transformation,” or TTT, diagram. It pertains to a particular phase transforma- 
tion at a particular chemical composition. The assumption in using a TTT dia- 
gram is that the high-temperature phase is cooled instantaneously to the temper- 
ature on the ordinate of the plot, and held at this temperature for various lengths 
of time. The degree of completion of the phase transformation (often given as 
10% and 90%) varies with temperature and time, as shown in the TTT diagram. 
A typical nucleation-and-growth transformation has a “nose” at a temperature 
where the transformation occurs in the shortest time. 

Finally, at the top of Fig. 7.15 are drawn microstructures for two nucleation- 
and-growth transformations that came to the same degree of completion in the 


11) On the other hand, superheating tends to tures predicted from the phase diagram, 
be much less of an effect - an alloy will even for moderately fast heating rates. 
usually begin to melt close to the tempera- 
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Fig. 7.15 Left: the number of nuclei, N, and diffusivity, D, at 
temperatures near the critical temperature, T,, of a nucleation-and- 
growth type of phase transformation. Right: the TTT diagram, with 
typical microstructures above. 


same time, consistent with the TIT diagram. For the microstructure on the 
left, which formed at the lower temperature, the number of nuclei was high, but 
their growth rates were low. The opposite case is true for the microstructure on 
the right. In general, a phase transformation that occurs at a lower temperature 
is finer-grained than one occurring at a higher temperature. 


7.2.3 
Suppressed Diffusion in the Liquid (Glasses) 


Liquid diffusion is much faster than diffusion in a solid, but liquid-phase diffu- 
sion can become an issue at much higher cooling rates. The partitioning of B 
atoms between the solid and liquid can be suppressed at high cooling rates, and 
in this case the f-phase forms with a lower concentration of B atoms than pre- 
dicted with the phase diagram of Fig. 7.14. An effective extension of A-atom solu- 
bility in the f-phase is expected owing to this kinetic constraint. An extreme case, 
amenable to convenient analysis, can occur for cooling rates in excess of order 
10° K s~!. Here we assume that solute partitioning between the solid and liquid 
is impossible, and all regions of the alloy maintain a composition of co at all tem- 
peratures. A set of free energy versus composition curves are shown in Fig. 7.16 
(consistent with Fig. 7.7). The common tangent construction for thermodynamic 
equilibrium predicts the coexistence of two solid phases (« and £) at this temper- 
ature, but without solute partitioning the liquid phase is preferable over a wide 
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Fig. 7.16 Free energy versus composition curves, similar to those for 
the phase diagram of Fig. 7.7 at a temperature below the eutectic 
temperature. If solute partitioning between phases is suppressed by 
kinetics, at this temperature we may expect liquid phase to exist for 
compositions between 0.28 < c < 0.72. 


range of compositions. Without solute partitioning between the «- and /-phases, 
the formation of a crystal of -phase of composition co would cause an increase in 
free energy, as indicated by the line with the arrowheads. Such crystallization 
does not occur. The liquid phase is especially stable for compositions in the mid- 
dle of Fig. 7.16. Near the eutectic composition, deep undercoolings of the liquid 
are possible at high quench rates. At low temperatures, however, the liquid be- 
comes increasingly viscous, and as the viscosity increases past 10! Pa s, the alloy 
will not crystallize in measurable times. This frozen liquid is a “metallic glass” 
(Klement et al. 1960). Many alloys having phase diagrams with low eutectic tem- 
peratures can be quenched into a metallic glass by rapid cooling. 

Figure 7.14 shows a broken line labeled “To” in the two-phase region of liquid 
and f-phase coexistence. This line denotes the locus of intersections of the solid 
and liquid free energy curves. One such point for one temperature is indicated 
in Fig. 7.16 as the composition labeled c’. To the right of the composition c’ in 
Fig. 7.16 the free energy curve of the solid ß-phase is lower than the curve for 
the liquid at the same composition, but the situation reverses to the left of c’. 
The composition of the crossing of the two curves is roughly halfway between 
the liquidus and £ solidus lines on the phase diagram. The To line denotes the 
limit of solubility of A atoms in the f-phase, or equivalently the limiting compo- 
sition where metallic glass could form upon rapid cooling (see Chapter 3). 


7.2.4 
Suppressed Diffusion in a Solid Phase (Solid-State Amorphization) 


Another example of a kinetic constraint is demonstrated with Fig. 7.17. Free en- 
ergy versus composition curves are shown for three solid phases. The new solid 
phase, the y-phase, is typically an ordered compound of A and B atoms. Many 


375 


376 | 7 Phase Equilibria and Phase Transformations 








Free Energy 


1 4 4 ili dis iii 


ii al 
0.0 0.2 0.4 0,6 08 1.0 





Concentration (c) 


Fig. 7.17 Free energy versus composition curves, similar to those of 
Fig. 7.16, but with a curve for an additional y-phase. Arrows indicate 
reductions in free energy that are possible from a starting mixture of 
x- and ß-phases for overall composition c = 0.33. 


such compounds have large unit cells, requiring substantial diffusion for their 
nucleation. Their formation can be suppressed by rapid cooling, and if y-phase 
formation is ignored, the analysis of glass formation upon cooling is the same 
as that used above for 7.16. More interesting, however, is the possibility that inter- 
metallic phases can be suppressed upon heating, because this can cause crystal- 
line phases to become amorphous. Suppose that solid phases of A-rich «-phase 
and B-rich f-phase are brought into contact at low temperatures, and atoms of A 
and B are allowed to interdiffuse across the interface. An important consideration 
is that there are often large differences in the mobilities of A and B atoms. As an 
extreme case, suppose that only the A atoms are diffusively mobile in the solid 
phases. The formation of the y-phase, with its large unit cell, usually requires 
the coordinated movements of B atoms. If these movements are suppressed ki- 
netically, the y-phase formation is suppressed. Without the possibility of nucleat- 
ing the y-phase, the free energy may reduce to the next best phase, the liquid 
phase, as indicated by the first arrow in Fig. 7.17. This process of “solid-state 
amorphization’ produces an amorphous phase at the interface between the «- 
and f-phases which is stable at low temperatures. Solid-state amorphization was 
first reported in 1983 (Schwarz and Johnson 1983), but many examples have 
since been found (Johnson 1986). In the time required for the layer of metallic 
glass to grow to macroscopic dimensions, however, the y-phase usually nucleates 
and then grows quickly, consuming the amorphous layer. 


7.2.5 
Combined Reactions 


Microstructures of engineering materials are often complex, having composi- 
tional inhomogeneities, different morphologies for different phases, and different 
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defect structures within the different phases. Many such microstructural features 
can be altered independently. For example, a phase with fast solid-state diffusion 
kinetics may undergo recovery processes at a lower temperature than a phase 
with slower atom mobilities. The consequent difference in defect concentrations 
can lead to differences in free energies of the two phases, and this can affect the 
relative stabilities of the two phases. 

When multiple microstructural features relax toward equilibrium, the kinetics 
are usually altered in comparison to independent relaxation processes. In the 
simplest case, two transformations such as defect recovery and chemical unmix- 
ing may occur sequentially. For sequential reactions it may be possible to con- 
sider the steps independently, but the earlier transformations are altered because 
they occur in material that is not relaxed in its other microstructural parameters. 

More complex “combined reactions’ may have simultaneous relaxations of 
microstructural parameters. Usually such transformations are first-order or dis- 
continuous, because second-order transformations must satisfy symmetry rela- 
tionships between the new phase and the parent phase (Landau and Lifshitz 
1969). These symmetry constraints are difficult to satisfy when multiple parame- 
ters are free to change. A common combined reaction occurs at a reaction front 
where the parent microstructure is consumed, and a new phase (or phases) grows 
into it. Behind the moving interface is a microstructure that is closer to equilib- 
rium in its chemical composition and defect concentrations owing to atom move- 
ments that bring equilbrium to the interface. For such simultaneous combined 
reactions, the decrease in free energy is larger as the interface moves further. 
This reduction in free energy has contributions from both chemical rearrange- 
ment and defect annihilation. The driving force for interface motion has similar 
factors, and this kinetic process involving combined, simultaneous reactions can 
be quite complicated when the relaxations of one microstructural feature, for ex- 
ample vacancy annihilation, alters the kinetics of the other relaxation process, for 
example diffusional unmixing. Reactions in steels are often characterized by such 
complexity, as described in a review article (Hornbogen 1979). A phase field 
approach to coupled evolution of composition and dislocation fields was reported 
recently (Haataja and Leonard 2004). 


7.2.6 
Statistical Kinetics of Phase Transformations 


Section 7.1.9 explained in general how pair variables account for the atomic struc- 
ture and the thermodynamics of alloy phase transformations in a more detailed 
way, and hinted at how higher-order cluster variables could be defined and used 
for thermodynamics. This section describes an atomistic kinetic theory that uses 
such a level of structural detail. Any full kinetics treatment requires a mecha- 
nism for atom movement. Here the kinetic mechanism is the exchange of an 
atom with a neighboring vacancy. The moving atom breaks its initial pairs and 
forms new ones after it moves to its new site. Rate equations based on activated- 
state rate theory account for how the chemical neighborhood of a moving atom 
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affects the rate of its diffusive jump. Specific developments of rate equations and 
their solution for chemical ordering on different lattices are given by Kikuchi and 
Sato (1969, 1972), Sato and Kikuchi (1976), Gschwend et al. (1978), and Fultz 
(1990). With all these theories, a steady-state solution is found by setting the rates 
of change equal to zero, and the state variables become identical to those of 
thermodynamic equilibrium for the same level of approximation. The path to 
equilibrium, however, depends on the individual atom mobilities and chemical 
preferences, and proves rich in its own phenomena. 


7.2.7 
Kinetic Pair Approximation 


We define a pair variable as pij, where, for example, the two sites on an arbitrary 
nearest-neighbor pair have the probability pag of being occupied by an A atom 
and a B atom. With long-range order (LRO) and sublattice formation as in 
Fig. 7.10, it is necessary to define sublattice preferences for A atoms on the o- 
sublattice, for example, so p% # pÉ when LRO exists. Conditional pair probabil- 
ities, such as the probability of having an A on a, given a B on £, p(Aa|Bf), are 
also useful for describing short-range order (SRO). For conciseness, in this sec- 
tion the exponents of Boltzmann factors will be written without the required fac- 
tors of (kgT) ". 

Figure 7.18 depicts the pairs undergoing change as an A atom changes sites 
with a vacancy. This example pertains to the square lattice of Fig. 7.10, or to the 
B1 or B2 ordered structures, for example. We make two assumptions. 

e There are no vacancy—atom interactions. These can be 
handled easily (Fultz 1992), but we neglect them here for 
clarity. 

e In this activated-state rate theory, the moving A atom 
surmounts an activation barrier. If it surmounts the barrier, 
it always moves to the new configuration on the right of Fig. 
7.18, irrespective of the types of neighbors in the new 
environment. The jump process is not influenced by the new 
pairs formed by the jump, but the initial pairs do influence 
the jump. For example, if the A atom initially has an 
energetically unfavorable chemical neighborhood, it is more 
likely to jump. 
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Fig. 7.18 Left: an A-V pair before the jump, showing the initial bonds 
to neighboring pairs on the sublattices of a square lattice. Right: a V-A 
pair after the jump, showing the pairs that were formed by the jump. 





7.2 Kinetics and the Approach to Equilibrium 


The kinetic master equation developed in Chapter 5 is applied here to show 
how pair variables evolve with time. Rate equations for the different pair variables 
must be developed independently, and a set of coupled ordinary differential equa- 
tions must be solved.” For example, the exchange of a B atom with a vacancy, V, 
has no effect on the number of A-A pairs. The kinetic master equation allows us 
to write an expression [Eq. (41)] for how the jump process of Fig. 7.18 alters the 
number of A-A pairs. 


Pin = PTUDGBIAR} TE- 1) 
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VB\Aa) + p(AplAajee + p(BB|Ax)e%s 
(41) 


The second subscript on the left-hand side of Eq. (41) identifies the kinetic step of 
Fig. 7.18, which is « — A— £$ (a reverse motion of a vacancy is implied). The 
right-hand side has two main factors. The first is a frequency factor, which de- 
pends on the number of A-V pairs that are able to exchange, and a jump rate, 
T, that is set by the initial configuration of atoms around the A atom (described 
by { p(if|Aa)}, where i may be A or B). 

The second main factor in the large parentheses ( ) is the statement of the ki- 
netic master equation, that the change in the number of A-A pairs has a positive 
contribution from the new A-A pairs that are formed by the jump, and a negative 
contribution from the broken A-A pairs that existed before the jump. Since the 
neighborhood of the A atom after the jump is assumed to have no influence on 
the jump rate, the positive term is a simple probability that does not depend on 
the temperature or energetics of the final configuration, i.e., (z — 1) p(Ag|Vf).” 
The jump frequency does depend, however, on the initial chemical environment 
around the A atom on the «-sublattice. The more complicated fraction in Eq. (41) 
is the rate at which the moving A atom leaves its surrounding neighborhood ow- 
ing to the influence of its A neighbors. This contains a probability of the different 
numbers of A neighbors, and a Boltzmann factor for how these A neighbors alter 
the initial energy of the moving A atom. (The denominator is a normalization for 
the rate from the full environment of the A atom.) This factor shows the level of 
detail provided by the pair approximation. It is obtained from a weighted average 
of all types of A-A pairs. With a binomial distribution of probabilities for inde- 
pendent pairs, the weight is simply p(Af|Aa). The presence of one A-A pair 
does not correlate with the presence of other pairs. A similar type of averaging is 
performed in Eq. (42) to obtain the rate T({ p(if|Aa)}, T) of Eq. (41). 


T({ p(ip|Ax)}, T) = e%[p(VB|Aa) + p(Ap|Aaje* + p(Bp|Aa)e]7 = (42) 


12) For low vacancy concentrations the set is are the initial neighbors of the vacancy on 
“stiff,” and can be numerically challenging the ß-sublattice. The coordination number is 
to solve efficiently. reduced by 1 because one of the pairs is the 


13) Note that the final neighbors of the A atom moving A atom. 
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One way to understand Eq. (42) is by expanding the trinomial. Each term in this 
expansion corresponds to a possible configuration of A, B, and V in the neighbor- 
hood about the A atom, with its average Boltzmann factor for the exchange. The 
trinomial expansion is consistent with the assumption of randomness in the 
placements of pairs with respect to other pairs. The factor e%4 in Eq. (42) is the 
activation barrier height without chemical interactions. Substituting Eq. (42) into 
(41) gives a rate equation for the kinetic process « — A — $. This is but one of 32 
such rate equations (Fultz 1990) for the change in the nine pair variables { p } 
through changes of the 18 conditional pair probabilities { p(iß| jæ), p(ial jf) }, 
where i and j denote {A, B, V}. 


7.2.8 
Equilibrium State of Order 


Equilibrium at a given temperature is obtained by setting the rates of change 
equal to zero. For example, from Eq. (41) we obtain Eq. (43). 





(43) 


o= (pavo) -— n aade™ ) 


VPplAa) + p(Ap|Aaje + p(Bp|Ax)eas 


To recover the thermodynamics of a binary A-B alloy, we neglect terms like 
p(VP|Ax) because the vacancy concentration is small. The result from Eq. (43) is 
Eq. (44), and likewise for expressions involving B-atom motions it is Eq. (45), 
where the conditional probability p(i|j) is the probability of finding an i atom at 
a specific INN site near a known j atom. 
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As mentioned in Section 7.1.9, the pair approximation accounts for the chemi- 
cal SRO that exists even above the critical temperature for LRO. Equations (44) 
and (45) for the SRO are monotonic through the critical temperature of the phase 
transition, but they will be altered when sublattices are formed. 


7.2.9 
Kinetic Paths 


The formation of a sublattice, which involves long-range atom coordinations, is a 
relatively slow process. The statistical kinetics approach as developed above pre- 
dicts three regimes of time scale (Gschwend et al. 1978). The first is the equilibra- 
tion of vacancies, which respond quickly to any change in atom configurations. 
The second, longer time scale is for SRO evolution, which is faster in proportion 


7.2 Kinetics and the Approach to Equilibrium 





Fig. 7.19 (a) Kinetic paths for V = 0.3, U = 1, eaa = 0.3 for the two 
cases €, = eġ + 5 ande; = eg. (b) Free energy surface showing 
intitial and final states of panel (a). Here LRO = (p — pty /4. 


to the number of vacancies in the alloy. The slowest time scale is that for LRO, 
but its rate also scales with the number of vacancies. 

With multiple state variables, it is often useful to make parametric plots of 
one variable versus another, termed “‘kinetic paths’ (Fultz 1989). Plots of kinetic 
paths will be independent of vacancy concentration, so long as the vacancy con- 
centration is low. Examples are presented in Fig. 7.19(a). The solid curves were 
calculated for identical chemical interactions, but with different activation barrier 
energies. The end states of the kinetic paths are the same, having the SRO 
and LRO parameters of thermodynamic equilibrium for the selected value of 
V = (ea + egg — 2eag)/4. For the case with ei # ež, the A and B atoms have dif- 
ferent mobilities, and this imbalance helps drive the sublattice imbalance and the 
more rapid growth of LRO for the same initial conditions. Similarly, the individ- 
ual exchanges of A and B atoms with vacancies require more information than 
the thermodynamic parameter. Kikuchi and Sato (Kikuchi and Sato 1969, 1972; 
Sato and Kikuchi 1976) define the new parameter for chemical preferences of 
individual atom species, U = (egg — eap)/(€aa — eag). This U alters the kinetic 
paths, but does not alter the final equilibrium state of the alloy. 

If the initial conditions for LRO have no bias of A atom occupancy for one of 
the sublattices, the kinetic master equation will have a detailed balance of rates of 
A-atom flux on and off both the «- and f-sublattices, and LRO cannot evolve. With 
some initial bias, an incubation time for the evolution of LRO is observed. To 
some extent this is a characteristic of a theory that has only pair variables and 
LRO parameters, and no state variables of intermediate length scales. Larger clus- 
ter variables do not solve this problem, however, because the formation of LRO 
still requires a breaking of symmetry as sublattice formation occurs, and such 
incubations are also found in kinetic Monte Carlo simulations. 

An advantage of statistical kinetic theories is that they allow mapping of the ki- 
netic path onto a corresponding free energy surface. This makes it possible to see, 
for example, that the change in activation energy can readily alter the time scale 
and kinetic path when LRO is evolving because this part of the kinetic path lies in 
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a broad minimum of the free energy surface of Fig. 7.19(b). It is also clear that 
there is rapid variation of the free energy with SRO, whereas large changes in 
LRO have weaker effects on reducing the free energy. 

In theories of kinetic evolution containing multiple state variables, a free en- 
ergy surface may show considerable structure beyond a simple thermodynamic 
minimum. The existence of local minima is widely known; they are termed 
“metastable states.” Saddle points become increasingly common in functions 
with more independent variables. A saddle point in a free energy surface is 
marked “SP” in both Figs. 7.19(a) and (b). As mentioned previously, the initial 
conditions of the calculations need to be biased so that ps # pP. Even with 
this bias, the breaking of symmetry caused by atom preferences for o or f- 
sublattices is a slow process because it occurs near a saddle point of the free en- 
ergy surface, where there are no gradients. A kinetic arrest often occurs at such 
points, and this is confirmed with Monte Carlo simulations (Anthony and Fultz 
1994). Such states, which have been termed “pseudostable” (Fultz 1993), are an 
example of many kinetic processes where the initial breaking of symmetry is 
driven only weakly by the free energy. 


73 
Nucleation and Growth Transformations 


Section 7.1 described states of thermodynamic equilibrium and how they are 
summarized in phase diagrams, and Section 7.2 explained how kinetic processes 
alter the equilibrium phase fractions and compositions of alloys. The arguments 
of these previous sections tended to use generic interatomic interactions, since 
these are the basis for alloy thermodynamics. Ordering transformations, which 
occur at the spatial scale of atomic separations, are treated well with the argu- 
ments of Sections 7.1 and 7.2. Nevertheless, the formation of ordered compounds 
often occurs by nucleation and growth, which requires longer-range atom redis- 
tributions that can be treated classically. The present section 7.3 tends toward 
classical continuum theories for understanding microstructural evolution as a 
nucleus of a new phase grows within a parent phase. 


7.3.1 
Definitions 


Consider a binary A-B alloy with the temperature-composition equilibrium dia- 
gram of Fig. 7.20. Shown schematically on the diagram are two quenching experi- 
ments denoted by the downward arrows labeled “I” and “II.” In both cases a 
single phase is brought quickly to a final temperature, the tips of the arrows, 
where nucleation of another phase is imminent. In quench I, pure A is cooled 
from the liquid state to a temperature below its melting point where the resulting 
nuclei will be crystals of pure A. In quench II, the single phase « is brought into 
the two-phase “« + ß region,” where now the / nuclei will, in general, possess a 
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A c B 


Fig. 7.20 Binary phase diagram depicting two quench paths, labeled | 
and II, that lead to nucleation of a new phase. L liquid phase, «, £ solid 
phases, «+ 8 miscibility gap. Shading denotes regions of a single 
stable phase. 


different crystal structure from the parent «. Moreover, the emerging /-phase will 
be of a composition much richer in component B than that of the «-phase. 

Although cases I and II can be treated within the same general theoretical 
framework, it will sometimes be necessary to distinguish between them. The 
driving force for nucleation in case I depends on the free energy difference be- 
tween the pure solid and liquid at the final temperature, whereas the nucleation 
driving force for p particles involves free energy changes that also depend on con- 
centration. Path II better illustrates a “supersaturated phase.” Immediately after 
this quench, the single phase « finds itself possessing a concentration of B greater 
than the solubility limit for the final temperature. The «-phase will relieve its 
supersaturation by precipitating B-rich particles. 

“Heterogeneous” nucleation refers to the formation of nuclei at specific sites in 
a material. In solid — solid transformations, heterogeneous nucleation occurs on 
grain boundaries, dislocation lines, stacking faults or any other crystalline defect. 
For liquids, the wall of the container is a common heterogeneous site. “Homoge- 
neous” nucleation refers to nuclei that form randomly throughout the bulk mate- 
rial, i.e., without preference for location. In solids, heterogeneous nucleation is 
usually the favored mechanism. 

Solid phases precipitating in solids are classified as “coherent” and “incoher- 
ent” nucleation. Figure 7.21(a) illustrates, in two dimensions, an incoherent nu- 
cleus. The precipitating ß-phase has a crystal structure different from the parent 
a-phase. In contrast the coherent nucleus in Fig. 7.21(b) exhibits lattice planes 
that are continuous with the « matrix. The £ particle in this case has a different 
composition and different lattice parameter from the «, but there is no structural 
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Fig. 7.21 Schematic representation of (a) an incoherent precipitate, 
(b) a coherent precipitate, and (c) a semicoherent interface. For the 
coherent case crystal lattice planes are continuous throughout the 
matrix and particle, whereas for a semicoherent boundary a periodic 
array of dislocations relieves some of the elastic strain. 


discontinuity between the two phases. Heterogeneous, incoherent nucleation is 
the most common form of precipitation reaction in solids. Homogeneous, coher- 
ent nucleation is observed in some systems, however, including well-documented 
cases of Co-rich precipitates in the Cu-Co system, metastable 6’ (Al; Li) nuclei 
in Al-Li alloys, and the y’ phase in Ni-based superalloys. To date, there have 
been no reported instances of homogeneous, incoherent nucleation in any crys- 
talline solid. 

The key difference between coherent and incoherent nucleation is that, in the 
incoherent case, a definite surface of discontinuity between the nucleating and 
parent phases can be identified and a surface energy can be assigned to it. In 
the coherent case, a concentration gradient and elastic distortion exist between 
the two phases. The elastic field usually has a significant effect on the thermody- 
namics of nucleation. 

There is a type of interface intermediate between coherent and incoherent. To 
relieve the elastic strain of a lattice mismatched interface, an array of edge dislo- 
cations may form on the boundary as shown in Fig. 7.21(c). An interface that is 
coherent except for a periodic sequence of dislocations is termed “semicoherent.” 
With increasing mismatch of lattice parameters, the spacing of dislocations de- 
creases and the interfacial energy increases. The same can be said of a low-angle 
grain boundary, which is in fact a special case of a coherent boundary. 


132 
Fluctuations and the Critical Nucleus 


The probability of occurrence of a nucleus, 2, is given by Eq. (46). 


P(T) x exp (- er) (46) 


7.3 Nucleation and Growth Transformations 


Since the volume of the nucleus is very small, the work of formation W must 
include an extra term Ag, the area of interface between the two phases times the 
surface energy. The introduction of surface costs energy, and a, the reversible 
work required to form a unit area of surface, is a positive quantity. From the 
change in Gibbs’ free energy, the work of formation is calculated via Eq. (47), 
where AGy is the Gibbs free energy per unit volume of forming the new phase 
from the parent phase, and V is the volume of the new phase. 


W = VAGy + Ao (47) 


Assume that the surface energy of the second phase is isotropic, i.e., ø is inde- 
pendent of crystallographic directions and planes.” For all $ particles of a speci- 
fied volume, the preferred shape will be a sphere of radius R, which minimizes 
the surface area-to-volume ratio, and Eq. (48) is obtained. 


4 
W= z7RAGy +4nR?o (48) 


In most cases low-index planes such as {100}, {111}, etc., will possess a lower 
surface energy than higher-index planes, e.g. {221}. In these instances the shape 
that minimizes the total energy will have a larger surface area than a sphere, but 
will expose a greater area of low-index planes. 

When AGy > 0, the work of formation rises very quickly with crystal size, and 
nucleation does not occur. When AGy < 0, nucleation of the £-phase is possible 
for large R where the contribution from the surface energy is relatively small. As 
shown in Fig. 7.22 for AGy < 0, the negative term varying as R? is added to a pos- 
itive surface contribution varying as R?. The result of this competition is a maxi- 
mum in the work of formation given by W*, located at the size R*. For small 
R < R*, W increases with R. Because Boltzmann statistics govern the fluctuation 
process, it is more probable that a nucleus smaller than R* will dissociate, rather 
than grow by atom attachment. For sizes greater than R*, growth is more likely. 
The quantity R* locates the maximum in the W(R) function, and R* is called the 
“critical radius.” Similarly, W* is the critical work of formation, and represents 
an energy or activation barrier that must be overcome for the stable phase to nu- 
cleate and grow. The values of the critical quantities can be determined by a 
straightforward differentiation [Eqs. (49) and (50)]. 











dw 20 
0 => R* (49) 
dR | p_p AGy 
16r o 
W* = W(R*) W' = 7 (50) 
3 (AGy) 
14) This is not expected to be true in crystalline minimizing shape for anisotropic cases 

solids, especially when low-index crystal- requires the “Wulff construction,” which 
lographic interfaces favor lower surface can be found in advanced texts on the 


energies. Determination of the energy- thermodynamics of solids. 
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Fig. 7.22 Work of formation vs. nucleus size for for T < Tm. The 
positive surface energy contribution and the negative volume energy 
term lead to a critical nucleus radius, R*, and critical work of formation, 
w*. 


The critical radius and the critical work of formation are key quantities of 
nucleation theory. Using more precise definitions, a fluctuation whose size is 
less than the critical radius is called a “cluster” (some texts prefer the word “em- 
bryo”), and a fluctuation greater than critical size is called a “nucleus.” 

For a system in equilibrium, the variation of its energy with respect to any ther- 
modynamic variable must be zero. This leads to an alternative and often very use- 
ful definition of the critical nucleus; it is that nucleus which is in thermodynamic 
equilibrium with the parent phase. This equilibrium at R* is unstable, of course. 

For the case of a binary system quenched from a single phase to a two-phase 
region of the equilibrium diagram, quench II, the expression for the work of for- 
mation is exactly the same as Eq. (47), but we must reconsider the meaning of 
AGy. The average concentration of the system is denoted by co and, for nuclea- 
tion to occur, co must lie between the values c, and cg. Consider the effect of the 
internal pressure in the nucleus of the new phase, which originates from the 
positive surface tension. If we know that the pressure inside a critical nucleus is 
P, larger than the ambient Po, we can draw the broken curve of Fig. 7.23 for 
GP(P) above the G*(Po) curve. This G*(P) is the free energy of the f-phase at 
the pressure of the critical nucleus, P. 

Let us assume that the volume of the critical nuclei of $-phase is so small that 
the loss of B atoms from the «-phase causes a negligible change in the composi- 
tion of the parent «-phase. Then, since the critical nucleus is in thermodynamic 
equilibrium with the parent phase, the concentration of the nucleus can be found 
immediately by a second common tangent construction (Section 7.1.3) between 
the G” and the broken G?(P) curves of Fig. 7.23. The quantity AGy is found by 
an additional equilibrium condition, Eq. (51). 


ow 
=0 


a (51) 
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Fig. 7.23 Graphical illustration of the quantity AGy for a binary A-B 
solution. The solid arrow is the distance between a common tangent 
line constructed from the average composition co and a parallel line 
tangential to the GË (Po) curve. The approximation given by the broken 
arrow is found by the distance from the G*(P) and the GP(Po) curves 
evaluated at the equilibrium concentration c’. 


Invoking this constraint leads to a graphical determination of AGy as shown in 
Fig. 7.23. It is the vertical distance between the common tangent line described 
above and a second line parallel to the first, but tangential to the G4( Po) curve. 
The vertical distance is depicted in Fig. 7.23 as the downward pointing, solid ar- 
row. Notice that the concentration of the critical nucleus is very close to that of 
the f-phase at equilibrium pressure. Furthermore, the value of AGy is very nearly 
equal to the vertical distance between the solid and broken common tangent 
curves evaluated at the concentration cg; i.e., the broken and solid arrows in Fig. 
7.23 are practically the same length. 


7.3.3 
The Nucleation Rate 


The thermodynamic concept of the critical nucleus of the preceding section, due 
to Gibbs (Gibbs 1948), provides key insights into the nucleation process, but it 
does not address a quantity of primary importance: the nucleation rate. Through- 
out the 1920s to 1940s, the Gibbs droplet picture of nucleation was extended to 
include kinetic considerations. Of primary importance in the subsequent discus- 
sion is the work of Volmer and Weber (1926), Farkas (1927), Becker and Döring 
(1935), and Zeldovich (1943). 
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In what follows, clusters and nuclei are not described by their radii, but instead 
by the number of molecules (or atoms) in them, termed the “number of mono- 
mers,” g, in the cluster. Equation (48) then gives rise to Eq. (52). 


W(g) = AGyQg + osig?’ (52) 


Here Q is the volume and sı is the surface area of the monomer. 

To establish a nucleation rate it will be important to define two cluster size 
distributions. The cluster size distribution given by n(g) refers to the number of 
clusters of size g that exist at equilibrium. From the preceding discussion of 
Boltzmann statistics n(g) is obtained from Eq. (53) where nı is the total number 
of monomers in the system. 


n(g) = m exp (- al (53) 


The number of clusters of critical size can be found from Eq. (53) by replacing 
W with W*, where the critical work of formation in terms of the new spatial vari- 
able g is found in a procedure analogous to that indicated in Eqs. (49) and (50). 
Consider the different ways in which a cluster can change its size. A cluster con- 
taining five monomers can, by fluctuation processes, combine with a monomer to 
produce a cluster of size six, a g = 2 cluster can attach to a g = 7 cluster creating 
a size g = 9, etc. Here we consider only the most probable reactions — those oc- 
curring by single monomer attachment. Define f(g) as the rate of the reaction 
g—g-+1. Noting that the nucleation rate is simply the rate at which critical 
clusters are promoted to a larger size class, it seems reasonable that the nuclea- 
tion rate is the number of critical clusters times £(g*) [Eq. (54)] 


Jss = B(g")m exp (- ur) (54) 


Equation (54) is the nucleation rate first proposed by Volmer and Weber. The sub- 
script “ss” refers to steady state. 

There is one serious problem with the Volmer and Weber prediction of the 
nucleation rate. The formation of clusters is through fluctuations. Therefore, it 
is possible for a cluster of critical size to lose monomers and thus be demoted to 
a lower size class, i.e., g* — g* — 1. The arguments leading to Eq. (54) do not in- 
clude such reverse reactions. 

The cluster size distribution evolves in time, and we let f(g,t) represent the 
actual cluster size distribution at a given time t. An equation for the evolution of 
f can be established by first examining the flux of clusters from a given size g to 
size g +1. Using £ for the rate of monomer attachment and y for the rate of 
monomer detachment, the net flux of clusters of size g is given by Eq. (55). 


Je) = Bg) f (8,4) — ve) f(g + 1,4) (55) 


7.3 Nucleation and Growth Transformations 


Equation (55) can be transformed to a partial differential equation of the Fokker- 
Planck form, Eq. (56). 


Ff _ a pn (£ (56) 


ôt ôg n 


It is the flux evaluated at the critical size, g*, that sets the nucleation rate. The 
solution of this Fokker—Planck equation with the time derivative set equal to zero 
will yield the steady-state nucleation rate, but first boundary conditions must be 
specified. Let a factor Z be given by Eq. (57). 


2 
Bes a 
2nkpT ôg? 


With a Taylor expansion of W(g) about the critical size and the definition of the 
complementary error function, the approximate result of Eq. (58) for the steady- 
state form of the cluster size distribution is obtained, a result found by Zeldovich. 





(57) 





g=g* 


fe) = Inte) erfelvrZig -8° (58) 


The quantity Z of Eq. (57) is known as the “Zeldovich factor.” It accounts for the 
possibility that a critical nucleus can either grow or shrink with equal probability, 
and larger nuclei have some probability of shrinking. The behavior of the func- 
tion f is shown by the curve labeled “Z” in Fig. 7.24. The actual distribution of 
clusters lies below the equilibrium value, but approaches n at small g. At g*, 


Kg) 





8 £ 


Fig. 7.24 Steady-state cluster size distributions from Volmer—Weber 
(VW) and Zeldovich (Z) theories. Equilibrium distribution is the broken 
line, and g* is the critical cluster size. 
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f(g) = 1/2n(g). The Zeldovich curve in Fig. 7.24 is nonzero past the critical size, 
a result consistent with the fact that the reverse reactions of the type g —> g — 1 
have been included in the formalism. 

The steady-state nucleation rate of Eq. (59) can now be found by combining Eq. 
(58) with flux expressions at g = g*. 


Je = ZB(g*)m exp(- =) (59) 


This differs from the nucleation rate of Volmer and Weber by the factor Z. 

To complete the picture of the nucleation rate we need to evaluate the only re- 
maining unknown, ß(g*). Here the functional form of the rate constant will de- 
pend on the type of system, liquid from vapor, solid-solid, etc. In the case of a 
liquid droplet nucleating from a supersaturated vapor it has been shown, using 
the kinetic theory of gases (Frenkel 1955), that £ has the form of Eq. (60), where 


m is the mass of monomer. Note that the rate of attachment of monomer to the 
critical nucleus is proportional to the total surface area through the sı (g*)?/ 3 


term. 
kgT 
Big‘) = msh N (60) 


For nucleation of a crystalline solid from an amorphous or liquid phase, Turnbull 
and Fisher (1949) obtained the attachment rate from Eq. (61), where h is Planck’s 
constant, AG, represents the activation energy required for an atom to jump 
across the interface between the parent phase and the nucleus, and A is the 
jump distance. 


kgT AG 
Be") = mitg)? aE eph TA) (61 


The above two expressions for the £ parameter are actually of the same general 
form. In both cases the rate at which atoms are hopping across the interface of 
the nucleus is multiplied by the number of atoms that are poised to make the 
jump. For example, in Eq. (61), kgT/h represents a characteristic attempt fre- 
quency for the monomer and the exponential term represents the fraction of at- 
tempts that are successful. The term sı (g*)?/ 31 is the volume of a thin spherical 
shell immediately adjacent to the nucleus. Within this shell there are nı mono- 
mers per unit volume poised to make the jump onto the nucleus. 

The nucleation rate of Eq. (59) is a very strong function of temperature; to illus- 
trate the sensitivity, consider again quench I of Fig. 7.20. The quantity AGy is ap- 
proximately linear with temperature in the vicinity of the transition temperature. 
If the undercooling is defined as AT = Tm — T, where Tn is the melting temper- 
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ature, Eq. (62) can be shown to hold, where L is the latent heat of fusion and the 
approximation is valid for small AT. 


AGy x = (62) 
m 


The critical work of formation can now be found with Eq. (50) and the result 
when substituted into Eq. (59) yields the nucleation rate [ Eq. (63)] 





16n03T2 1 
(63) 


Js = Jo on 312keT (AT)? 


Here all the pre-exponential factors have been lumped into the quantity Jo. 

Consider a typical system in which Tm = 1000 K, L= 1.0 x 10° J m, and 
g=1J m™. Furthermore, an order of magnitude estimate for Jo is 10*° 
m? s-!, and for the purposes of illustration we will neglect the temperature 
dependence of Jo. At an undercooling of 126 K below the melting point the 
system will, after 1 s, contain 100 nuclei per cubic meter, a number so low that 
the f-phase is essentially undetectable. Just four degrees lower in temperature, 
AT = 130 K, the nucleation rate jumps by a factor of 100. The extreme sensitivity 
to temperature suggests that for practical purposes a specific temperature can be 
identified as the onset of nucleation. The onset is referred to as the critical under- 
cooling. 


7.3.4 
Time-Dependent Nucleation 


Consider again path II of Fig. 7.20. Immediately after the quench a metastable «- 
phase matrix exists with B atoms arranged more or less randomly on the lattice; 
there are few B-rich clusters of sizes greater than g = 1. For a nucleation event to 
occur at some time after t = 0, a critical nucleus must form by atom transport 
through the « crystal. So before the steady-state nucleation rate can be observed, 
one has to wait for a time before a critical-size cluster can form by fluctuations. 
The “incubation time” or “nucleation time lag” is the characteristic time before 
which the nucleation rate is very low, and after which the nucleation rate is ap- 
proximately the steady-state value. 

To calculate the incubation time, and to map out the full time-dependent 
behavior of J, requires the solution to the Fokker-Planck equation [Eq. (56)]. Al- 
though the partial differential equation is linear, obtaining its analytic solution is 
a formidable task. Two approximate solutions exist, but the mathematical tech- 
niques employed are beyond the scope of this chapter and only the final results 
will be presented. For the initial condition, assume only clusters of size g = 1 
exist at t = 0. Trinkaus and Yoo (1987) used a Green’s function approach to arrive 
at the time-dependent nucleation rate according to Eq. (64), where E is defined in 
Eq. (65), and t represents a natural time scale for the kinetic problem [Eq. (66)]. 
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Jo = ns oo nZ a ) (64) 
E = exp (-:) (65) 
: (66) 


"= 2nZ2B(g*) 


Shi et al. (1990) obtained an analytic solution to Eq. (56) by employing the tech- 
nique of singular perturbation. The solution is Eq. (67), where 4 is given by Eq. 


(68). 
JO = Ju pf - (2) || (67) 


A= (gt)? — 14+ nf3vnZg*[1 - (g*) Ph (68) 





Although Eqs. (64) and (67) have different mathematical forms, the two approx- 
imate solutions are in fact quite close (see Fig. 7.25). The two predictions of the 
time-dependent nucleation rate are also consistent with the qualitative picture de- 
scribed above - the rate is quite low for a time period on the order of t, and rises 
rapidly to the steady-state value at later times. From the definition of r it is clear 
that the incubation time is a function of both the critical size and the attachment 
rate term £(g*). 


J(g,.t)/Jes 





E/T 


Fig. 7.25 Time-dependent nucleation rates normalized by the steady- 
state value, labeled by the cluster size g. The Trinkaus and Yoo (1987) 
solutions are the broken lines, and the Shi et al. (1990) solutions are 
solid curves, with labels for three critical sizes. 
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7.3.5 
Effect of Elastic Strain 


When a droplet of liquid condenses from gaseous vapor, a considerable volume 
change takes place, but this is easily accommodated by the flow of vapor. When 
a solid nucleates inside another solid, as illustrated in Fig. 7.21(b), the volume 
change may be smaller, but the nucleus is constrained by a stiff matrix. Accompa- 
nying the formation of a $ nucleus will be a buildup of stresses and strains in 
both the matrix and precipitate. The elastic strain energy of the system must be 
added to the work of formation of the nucleus, altering the kinetic picture pre- 
sented in the previous section. 
The nucleation of a cubic, elastically isotropic £-phase from an infinite cubic 
isotropic « matrix can consist of four steps: 
1. A sphere of « is cut out of the matrix material and allowed to 
transform unconstrained to f. The free energy change in this 
first step of the transformation is again AGy. 
2. Surface forces (tractions) are applied to the sphere to bring it 
back to the volume of « originally removed. The change in 
volume required of the sphere may be either positive or 
negative. 
3. The £ sphere is inserted into the « hole and the two phases 
are “welded’ together. 
4. Finally, the system is allowed to relax to mechanical 
equilibrium. In addition to the volume free energy decrease 
and the surface area increase, the total elastic strain energy of 
the nucleus-matrix system represents an extra term that 
must be included in the work of formation [Eq. (48)]. 


The displacements, stresses, strains, and energy of the misfitting sphere for the 
elastic continuum problem described above were first solved in a classic paper 
by Eshelby (1954) (see also Eshelby 1956). 

We now modify the work of formation by adding to it the total strain energy 
associated with the nucleus—matrix system. The strains and hydrostatic pressure 
of the $ sphere generate a strain energy density of the form of Eq. (69), where K 
is the bulk modulus, 6 is a misfit strain arising from the difference in lattice pa- 
rameters between the « and f# phases, A = 3K,/(3K; + 4u,), and u is the shear 
modulus. The total strain energy for the nucleus E; is given by Eq. (70): 





1_AV 9 22 
=P = A-—1)°6 6 
ep = 5 PT = 5 Kal ) (69) 
R 2 An 39 202 
Ep = Aur eg dr = Eu 7 KA = 1) ô (70) 
0 


A similar procedure applied to the «-phase results in Eq. (71). 


393 


394 


7 Phase Equilibria and Phase Transformations 


4 
E= T RGA? (71) 


The total strain energy for the « matrix containing a spherical $ nucleus is the 
sum of Eqs. (70) and (71). 


4 
E= E, + Ep = = R 6u, AÒ? (72) 


Since the total strain energy is proportional to the volume of the nucleus, Eq. 
(48) can be modified in the following way to Eq. (73). 


4 
W= > R3(AGy + 6u,A0*) + 42R2a (73) 


All of the parameters appearing in Eq. (73) are positive quantities. If the cost of 
elastic energy overrides the decrease due to the AGy term, i.e., AGy + e > 0, ho- 
mogeneous nucleation of the ß-phase cannot occur. Because the strain energy 
contribution is positive, the nucleation rate will be much slower than predicted 
by Eq. (59). The bottom two curves of Fig. 7.26 show the effect on the volume 
contribution to W(R) when strain energy is included. An increase in the coeffi- 
cient of the R? term leads to an increase in both the critical radius and the critical 
work of formation. The shift to larger W* and R* is shown by the middle two 
functions in Fig. 7.26. By reference to Eq. (59) it can be seen that even small 
increases in the work of formation cause drastic reductions in the steady-state 
nucleation rate. 


W(R) 


w” 





Fig. 7.26 Work of formation vs. cluster radius for the case where elastic 
strain energy is included. An increase in the volume energy term (lower 
two curves), due to positive elastic energy, results in an increase in W* 
and R*. 
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7.3.6 
Heterogeneous Nucleation 


Homogeneous nucleation in solid systems is rarely observed. The usual type of 
nucleation is heterogeneous, where the second phase nucleates at dislocations or 
at grain boundaries. To extend classical nucleation theory to heterogeneous nucle- 
ation, we first discuss how the size, shape, and work of formation of the critical 
nucleus change in the presence of a defect. 

In homogeneous nucleation, a spherical nucleus was considered because it is 
the shape that minimizes the surface area for a given volume. In the case of het- 
erogeneous nucleation on a grain boundary, the elimination of grain boundary 
area causes the shape of minimum surface area to become a double spherical 
cap. The surface area of a double spherical cap having a contact angle 0 (see Fig. 
7.27) is 4n(1 — cos 0) R?, and its volume is 2n(2 — 3 cos 0 + cos?0) R?/3, where R 
is the (constant) radius of curvature. With these expressions, the work of forma- 
tion as a function of R can be formulated in Eq. (74) along the same lines as in 
the derivation of Eq. (48). 


2 
wW = (2 — 3 cos 0 + cos30)R’AGy 








+ 4n(1— cos 0)R?o — nR? sin”Oogp (74) 


The new last term on the right-hand side contains the grain boundary surface en- 
ergy, ogb, and has a negative sign. When a grain boundary nucleus is formed, an 
area of pre-existing «-a interface is eliminated, as shown by the broken line in 
Fig. 7.27. The energy saving of nR? sin?0cq, decreases the work of formation. 
Since this decrease is not realized for a nucleus forming in the bulk, heteroge- 
neous nucleation is preferred energetically. Critical quantities can again be found 
by taking the derivative of W with respect to the radius of curvature and setting 
the result equal to zero. Interestingly the value of R* is identical to that derived in 
the homogeneous case [Eq. (75)], whereas the critical work of formation W* is 
now given by Eq. (76). 


Fig. 7.27 Schematic representation of a grain boundary f-phase 
nucleus as a double spherical cap. The contact angle 6 must satisfy a 
balance of surface energy forces depicted by the three vectors of 
lengths o and agp. The broken line shows the region of pre-existing 
grain boundary that is eliminated by the nucleus. 
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ro 








Fig. 7.28 Critical work of formation for grain boundary nucleation 
(top curve), grain edge nucleation (middle curve) and grain corner 
nucleation (bottom curve) vs. the quantity agb /o. Work of formation is 
normalized by the value obtained for homogeneous nucleation. From 


Cahn (1956). 








20 
R'=-—- 75 
AG, (75) 
. 16no’ 2-3 cos 0+ cos?0 
3(AGy)? 2 
2-3 0 36) 
u wi cos 0 + cos (76) 





2 


Here Wý m is the critical work of formation for homogeneous nucleation. The 
term (2 — 3 cos 0 + cos?@)/2 can range from zero to one, so the work of forma- 
tion of a heterogeneous grain boundary nucleus is less than that of a homoge- 
neous nucleus. The behavior of W* normalized by the homogeneous value is 
shown as the top (solid) curve in Fig. 7.28. 

Figure 7.27 was drawn assuming an isotropic surface energy for the «-ß inter- 
face. Lee and Aaronson (1975) have investigated the grain boundary nucleation 
problem and concluded that the double spherical cap morphology is very unlikely. 
A faceted nucleus is generally expected. Figure 7.29 shows a faceted f-phase 
nucleus that is expected when the crystallographic planes of the £- and «-phases 
have low-energy orientation relationships. The nucleus in Fig. 7.29 is called a 
“grain boundary allotriomorph.” 
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Fig. 7.29 A faceted grain boundary nucleus can lower its work of 
formation by exposing a large area of a crystallographic plane with low 
surface energy. The shape is referred to as an allotriomorph. 


7.3.7 
The Kolmogorov-Johnson-Mehl-Avrami Growth Equation 


For long aging times the steady-state nucleation rate of Eq. (59) ceases to be valid. 
For quench II of Fig. 7.20, as more and more £$ nuclei form and grow, the matrix 
phase « becomes depleted in solute. Since the work of formation of a critical 
nucleus, and hence the nucleation rate, depend strongly on supersaturation, J 
should decrease dramatically with time in the later stages of the phase transfor- 
mation. A description of the competition between solute depletion, nucleation, 
and particle growth is a difficult mathematical undertaking, but significant prog- 
ress was made by Langer and Schwartz (1980) (see also Binder and Stauffer 
1976). 

The full time dependence of the transformation after quench I of Fig. 7.20 is 
also a challenging problem. Fortunately the time dependence of a quantity of cen- 
tral interest, the volume fraction of /-phase, can be found readily by an ingenious 
construction developed by Kolmogorov (1937), Johnson and Mehl (1939), and 
Avrami (1939, 1940, 1941). Figure 7.30 depicts the microstructure of the «+ 
mixture at three representative times, where tı < h < t3. At tı a number of nuclei 
appear as governed by the nucleation rate J, and each f particle is approximately 
the critical size at this early time. At t} nucleation leads to an increased number 
of f-phase particles, and all the nuclei that had formed at earlier times have 
grown in size. If v denotes the rate of change of the particle radius, then the total 
volume of f-phase at early times can be written as Eq. (77), where V is the total 
volume of the system. The dummy variable t’ accounts for the growth of all par- 
ticles that nucleated at earlier times. 


vA(t) = * vf Jo3(t—#) dt’ (77) 
0 


Unfortunately, the kinetic equation (77) cannot possibly be valid for all times. As 
shown by the microstructure at t3, eventually the £ particles will impinge on one 
another, halting the transformation. A single growth rate v is not possible after 
impingement, so the trick now is to account for the impingement effects. 
Imagine that the f-phase can grow unimpeded by the presence of other par- 
ticles; in other words the particles are free to overlap as shown in the bottom mi- 
crostructure of Fig. 7.30. In addition let us allow nucleation of £ to take place in 
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Fig. 7.30 Evolution of the microstructure of a single-phase « 
transforming to the single-phase £ at three times, where tı < tz < ts. 
At time t3, ß-particles begin to impinge on one another; the extended 
volume concept allows the f-phase to grow and nucleate in previously 
transformed material as shown in the bottom diagram. 


regions that have already transformed! The volume of £ under this clearly artifi- 
cial construction is called the extended volume and is denoted by Vf. The actual 
volume of $ must approach the total volume as t— oo, but the extended volume 
increases without bound as time progresses. The quantity Vf is given precisely by 
the right-hand side of Eq. (77) for all time t. 

After a volume Vf has transformed, the volume fraction that remains untrans- 
formed is (1— V?/V). In the next time interval dt a fraction of the increase in 
extended volume dV/ will occur in regions of untransformed material and will 
thus contribute to the increase of the volume dV’. The remaining fraction of 
dV? will necessarily lie in already transformed regions and thus will add nothing 
to the volume of the f-phase. Now we make the key assumption that the /-phase 
nucleates randomly in the parent « matrix. On average, the fraction of increase in 
VF that lies in untransformed material is simply that volume fraction that has not 
yet been transformed, i.e. dV? from Eq. (78), or by a straightforward integration, 
VP from Eq. (79). 


p 

dv? = (1 = 7) dvi (78) 
f 

vP=-V in(1 = X) (79) 


Since the extended volume is given by Eq. (77), the time dependence of the true 
volume fraction of f-phase is obtained in Eq. (80), the Kolmologorov-Johnson- 


7.4 Spinodal Decomposition 


Mehl-Avrami (KJMA) equation, which predicts that the volume fraction at t = 0 
is equal to zero, but increases in a sigmoidal shape, and asymptotically ap- 
proaches 1 as t — oo. 


vr 4r (' 
7 1 e| 5 [et t’) ar! (80) 








The exact form of V/V depends, of course, on the precise description of the 
time-dependent nucleation rate term and the growth rate. Two limiting cases are 
typically analyzed, and both assume a constant v. First, consider a system in 
which all nucleation takes place instantaneously at heterogeneous sites distrib- 
uted randomly throughout the material. If all nucleation sites are assumed satu- 
rated at t = 0, Eq. (80) can be integrated to give Eq. (81), where N is the number 
of sites per unit volume which become instantaneously filled with nuclei. 


vr 4 
= 1 - epl- one] (81) 


The second limiting case is one in which the nucleation rate, here Jss, is as- 
sumed to be a constant with time. Integration of the KJMA equation (80) yields 
Eq. (82). 


vr 
we 1 — exp [zur] (82) 


Notice the change in exponent from t? to t* between Eqs. (81) and (82). Since the 
above two examples are limiting cases of nucleation behavior, Avrami suggested 
that a plot of In[In(1 — V*/V)] versus t will result in a straight line with a slope 
between 3 and 4, and the exponent of t will indicate the dominance of homoge- 
neous or heterogeneous nucleation. In deriving Eq. (81) Avrami assumed that 
heterogeneous sites were distributed randomly throughout the bulk material. 
This is a dubious assumption for polycrystalline solids, however. Second-phase 
nuclei tend to cluster on specific grain boundaries or along dislocation lines. 
Equation (81) can be modified to accommodate these processes (Cahn 1956), but 
this is beyond the scope of the present treatment. 


7.4 
Spinodal Decomposition 


In his classic work on equilibrium in heterogeneous substances, Gibbs (1948) 
distinguished between two types of concentration fluctuations that give rise 
to the formation of a new stable phase: (a) those that are small in spatial size 
but large in concentration change, and (2) those that are large in spatial size 
but small in concentration change. The first pertains to nucleation (the subject of 
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Section 7.3), the second to “spinodal decomposition” (the subject of this section). 
The process of spinodal decomposition by the growth of infinitesimally small- 
amplitude concentration fluctuations was elucidated by Cahn (1961). 


7.4.1 
Concentration Fluctuations and the Free Energy of Solution 


Consider the idealized, one-dimensional concentration fluctuation shown in Fig. 
7.31 for a sample of average concentration co. Over the spatial range, x, from 0 
to +¢, the composition of the sample is slightly greater than the average, i.e., 
c(x) = co + òc for 0 < x < +6 where dc is small compared to co. Similarly, in the 
range from —¢ to 0 the concentration is less than average. By conservation of 
solute, we must have c(x) = co — 6c for —¢ < x < 0. Take the cross-sectional area 
of the sample to be unity, and the total length to be L. Then the total free energy 
of homogeneous material before introducing the composition fluctuation is sim- 
ply Lf(co), where f is the Helmholtz free energy per unit volume. Of interest 
here is the total free energy change, AF, from an initial state of uniform compo- 
sition to the state depicted in Fig. 7.31. Since the amplitude of fluctuation is as- 
sumed small we can find the free energy of the material by Taylor-expanding 
about co. Therefore, AF for the region 0 to +¢ becomes AF* [Eq. (83)], where, 
for the time being, the gradient energy contribution has been neglected. 


af 


1 2? 
ôc +é of 
oc 


°2 dc? 





(8c)? +--+ — Cf (co) (83) 


e= 


AFT = (f(a) +¢ 








t= 


Similarly, the free energy change over the region where x < 0 is given by Eq. (84). 





(8c)? +--+ — Cf (co) (84) 


c=c& 


ô 1 a2 
AF = Cf (co) —¢ u | be+ C5 of 
c=Co É 





The total free energy change upon introduction of the fluctuation is, of course, 
the sum of Eqs. (83) and (84), minus f (co) for the homogeneous alloy, according 





Fig. 7.31 Idealized, one-dimensional concentration fluctuation in a 
material of average composition co. The positive and negative parts of 
the fluctuation account for the same amount of solute. 
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Fig. 7.32 Free energy per unit volume as a function of composition for 
a system exhibiting a miscibility gap. Also shown are the ranges of 
concentration for which the solution is stable, metastable, and 
unstable. 


to Eq. (85), where the notation f” denotes the second derivative of the free energy 
with respect to composition, evaluated at co. 


AF = cf" (êc)? (85) 


In Eq. (85) both the terms ¢ and dc? are positive, and therefore the sign of the 
total free energy change is governed by f”. Figure 7.32 shows the behavior of f 
versus c for a binary system exhibiting a miscibility gap. For the ranges of average 
composition labeled “stable” and “metastable,” the second derivative of f is pos- 
itive. In these ranges, small fluctuations in concentration will increase the free 
energy of solution, so the unfavorable fluctuation will decay, and the system will 
return to a homogeneous state. On the other hand, for a range of composition in 
the unstable region, f” is in fact negative. In this unstable range, concentration 
disturbances, no matter how small in amplitude, cause the free energy to de- 
crease. Furthermore, Eq. (85) suggests that if the fluctuation continues to grow, 
the free energy can decrease more. The formation of a stable two-phase mixture 
from a solution for which f” < 0 is known as spinodal decomposition. 

The meaning of the labels of Fig. 7.32 is now clear. For the metastable region, 
small concentration fluctuations (for which the Taylor expansion of Eq. (85) re- 
mains valid) will decay, but large composition deviations (i.e., those that create a 
critical nucleus resembling the stable «-phase) will grow. In the unstable range 
where f” < 0, even infinitesimally small fluctuations will increase spontaneously 
in amplitude. Finally, in the concentration regions labeled “stable,” the equilib- 
rium state is a single-phase solid solution. Here any departure from a homoge- 
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neous solution of concentration co, no matter how large, will eventually return to 
the single-phase state. The concentration marked c, in Fig. 7.32, corresponding to 
the point at which f” = 0, represents the limit of stability of the solution and is 
called the “spinodal composition.” In the continuum theory of nucleation, the 
quantity c, coincides with the vanishing of the critical work of formation. In light 
of Eq. (85) and the curve shown in Fig. 7.32, the fact that W* = 0 at cs is no sur- 
prise. At this composition there is no longer an initial increase in energy of the 
system, or activation barrier on forming a “nucleus,” but rather a decrease in 
total energy. 


7.4.2 
The Diffusion Equation 


The gradient in chemical potential drives diffusion, not the gradient in concen- 
tration per se. Fick’s law is valid for ideal solutions, but the free energy in the 
unstable region of Fig. 7.32 is certainly not ideal. We seek a diffusion equation 
appropriate for the case of spinodal decomposition. When lattice sites are con- 
served, the gradients of chemical potential can be replaced by a single gradient 
in the difference of chemical potentials between the two species A and B in solu- 
tion. One can then show that the flux of solute becomes J [Eq. (86)], where M is 
the mobility of solute atoms, a positive quantity, and N, is the number of atoms 
per unit volume. Here F is a functional of c. The term in parentheses in Eq. (86) 
is the variation of F with respect to the function c(7). 


j=-*0(2) (86) 


With an approach used by Cahn and Hilliard (1958, 1959), the variation is then 
given by Eq. (87), where x is the gradient energy coefficient, here assumed inde- 
pendent of concentration. 


ie | {2 — 2.02. a7 (87) 
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For a small element of volume, the variation of F with respect to c is given by an 
Euler-Lagrange equation. Using Eq. (87), Eq. (88) is obtained for the flux. 


Maff 5 en 


=~ 


When combined with the continuity equation, Eq. (88) yields the general diffu- 
sion equation describing any diffusional transformation in which the gradient en- 
ergy contribution is important. The result is Eq. (89), where in anticipation of a 
stability analysis, we used the function u(r, t) = c(r,t) — co, describing the devia- 
tion of the concentration from its average value. 
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Io perform a stability analysis of the concentration, we must specify the spatial 
dependence of the small perturbation, and examine how this specific form of 
the fluctuation evolves with time. Consider a one-dimensional concentration dis- 
turbance of the form in Eq. (90), where a(t) is the amplitude of the concentration 
wave. 


u(F, t) = a(t) cos(kx) (90) 


The time dependence of a determines the stability. Here k is the wavenumber, 
which can also be written as 27/4, A being the wavelength of the concentration 
profile. Substitution of Eq. (90) into the diffusion equation (89) leads to Eq. (91). 


Z a(t cos(kx)] = > 2mo + M'a(t) cos(kx) + M”a?(t) cos? (kx) +] 


v 


+ 2 f' + f"alt) cos(kx) + fa (t) cos? (kx) +- 


= Ka) NEIN DENE 
OU 


(91) 


Two functions, M and ôf /ôu, which are generally nonlinear functions of u, have 
been replaced by their Taylor expansions, and primes indicate differentiation eval- 
uated at u = 0. In keeping with the linear stability analysis, only small-amplitude 
perturbations are considered. Therefore, any term in Eq. (91) that is multiplied by 
a factor a?(t), a3(t), etc., is neglected. The small-amplitude assumption results in 
the simplification to Eq. (92), to which Eq. (93) has the solution. 








a 22 z= [f"k — xkJa(2) (92) 
a(k, t) = a(k,t = 0) exp Mo k?(f" + 2rk?)t (93) 


v 


The result, Eq. (93), shows that if f” > 0, all small-amplitude concentration fluc- 
tuations will decay exponentially with time, and the system is stable. On the other 
hand, if f” < 0, then some perturbations are stable, i.e., those wavenumbers for 
which | f”| > 2xk?, but all others are unstable. 

Figure 7.33 shows the behavior of the amplification factor, a(k), versus k where 
a is the term multiplying the time in the exponential function of Eq. (93), i.e., 
a(k) = —(Mo/N,)k?(f" + 2xk?). For wavenumbers less than kc, a(k) is positive 
and the amplitude «(t) grows exponentially with time, whereas for k > k, the 
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a(k) 





Fig. 7.33 Amplification factor a(k) vs. k. The value k; corresponds to 
the point «= 0 and km refers to the maximum in «. 


amplitude diminishes. The value of k, [Eq. (94)] is known as the “critical 
wavenumber.” 


n= y-£ (94) 


In addition, let the point km correspond to the maximum in the curve of Fig. 7.33. 
The exponential dependence of the solution, Eq. (93), and the peak in a(k) at 
k = km, suggest that for any initial concentration fluctuation, those Fourier com- 
ponents in the vicinity of km will be amplified most rapidly. This selective ampli- 
fication implies that, shortly after a quench to the unstable region of the phase 
diagram, the Fourier spectrum will be dominated by a single component. It 
suggests a real-space concentration profile characterized by a nearly periodic, 
three-dimensional interconnected network of regions with high and low solute 
concentrations. 


7.4.3 
Effects of Elastic Strain Energy 


In general, the molar volumes of the two species in a binary solution are differ- 
ent. As a result, a concentration fluctuation creates an elastic strain field in the 
material. We have seen in the case of nucleation that the positive elastic strain en- 
ergy from a misfitting spherical nucleus must be added to the work of formation, 
thereby dramatically changing the predicted kinetics. The strain energy generated 
by growing concentration fluctuations also plays an important role in the spino- 
dal decomposition of crystalline solids. 

The elastic strain energy for an arbitrary composition fluctuation can be found 
in much the same way as was done in solving the diffusion equation by Fourier 
transform methods. By Fourier-transforming the composition, computing the 
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elastic energy of each Fourier component, and utilizing the fact that each of the 
components does not interact, one obtains a general elastic energy contribution 
[Eq. (95)], where E is Young’s modulus, v is Poisson’s ratio and y is the fractional 
change of lattice parameter with composition. 


E 


ee a 





| w d’r (95) 


To include the elastic strain energy in either the continuum theory of nucleation 
or that of spinodal decomposition, the Cahn-Hilliard free energy expression 
must be modified as Eq. (96). 





= [ [70 Hr le 0)” Heet] dr (96) 


It is a straightforward exercise to show that the limit of stability of a solid solution 
is no longer f” = 0 but is given by Eq. (97). 


n oye ~ 
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The concept of a spinodal can now be illustrated with Fig. 7.34. The heavy solid 
line shows the equilibrium miscibility gap (cf. Fig. 7.6). Below the equilibrium 
phase boundary is a broken curve corresponding to the locus of points such that 
f" = 0. This curve is called the “chemical spinodal.” The dash-dot curve, the “co- 
herent spinodal,” is the limit of stability with elastic contributions included. It is 
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Fig. 7.34 Phase diagram of a binary system with a miscibility gap 
(solid curve). Also shown are the chemical spinodal where f” = 0 and 
the coherent spinodal for which f” + 2Ey7/(1 — v) = 0. 


405 


406 


7 Phase Equilibria and Phase Transformations 


found by displacing the chemical spinodal downward by an amount equal to the 
additional term in Eq. (97). Cahn showed that the 27?E/(1- v) term is large in 
many solid solutions. 

Not only does the additional elastic term change the criterion for the limit of 
stability, it alters the kinetics of decomposition too. With Eq. (96), the linearized 
form of the diffusion equation (89) has a solution in Fourier space given by Eq. 
(98). 





a(k, t) = a(k,t = 0) epf Moga ( pt EE 2k?) | (98) 
N, 1-v 

The presence of the term involving elastic constants implies a decrease in the rate 
at which concentration waves are amplified. As for the nucleation of a misfitting 
sphere, the positive elastic energy contribution slows the reaction. The assump- 
tion of an elastically isotropic solid is often inaccurate. Spinodal decomposition 
in anisotropic systems is characterized by preferential rapid growth of concentra- 
tion waves along elastically soft directions (Cahn 1962). 


7.5 
Martensitic Transformations 


The previous two sections described phase transformations involving the diffu- 
sion of atoms over moderate distances, for which continuum diffusion equations 
give key insights. This section describes martensitic transformations, in which all 
atoms in a crystal distort cooperatively into a new shape. Martensitic transforma- 
tions are often called “diffusionless,” and depend on features of crystal geometry. 

A plain carbon steel at a temperature of 950 °C exists in a face-centered cubic 
(fcc) phase known as “austenite.” In austenite the Fe atoms occupy the sites of an 
fcc lattice, and the C atoms are squeezed into octahedral interstices. After the 
steel has been quenched rapidly to room temperature, the material transforms 
into a body-centered tetragonal (bct) phase known as “martensite.” Because of 
the technological importance of steel, the martensite transformation has been 
well studied, but that is not to say it is completely understood. The martensite 
transformation also occurs in Fe-Ni, Au-Cd, Ti-Nb, In-Tl, Cu-Zn, Cu-Al, and 
many other alloys; in superconductors, e.g., V3Siand Nb3Sn, and in ceramic sys- 
tems such as ZrO, and BaTiO;. The formation of martensite is different from the 
types of phase transformation discussed previously: eight characteristic features 
are enumerated in Section 7.5.1. 





7.5.1 
Characteristics of Martensite 


1. Martensitic transformations are ‘‘diffusionless.”” Atoms move 
by less than one interatomic distance and the product phase 
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is formed by a cooperative motion of many atoms. Since no 
long-range, solid-state diffusion occurs, the product 
martensite phase must have the same composition as the 
parent (austenite) phase. The diffusionless nature of the 
transition was proved by Kurdjumov and coworkers 
(Kaminsky and Kurdjumov 1936; Kurdjumov et al. 1939), 
who demonstrated that the ordered alloys Cu-Zn and Cu-Al 
remain ordered after the martensitic transformation. The 
transformation therefore causes a negligible change in the 
configurational entropy of the alloy. The entropy of the 
transformation originates from changes in vibrational modes, 
or in some Fe alloys, from changes in magnetic order. 

2. Martensite usually appears in the form of thin plates in the 
austenite phase, but other morphologies such as needles and 
laths are also observed. For a given composition, the plates 
lie on distinct crystallographic planes in the parent phase. 
The preferred plane is known as the “habit plane.” 
Sometimes the habit plane is a low-index or close-packed 
plane, but sometimes the Miller indices of the habit plane 
are irrational numbers. Martensite plates grow at velocities 
approaching the speed of sound, and the motion is 
terminated when the plate encounters a grain boundary, the 
specimen surface, or another martensite plate. The formation 
of a martensite platelet can trigger the formation of other 
plates. This autocatalytic process is often accompanied by an 
audible “click.” 

Figure 7.35 is a schematic diagram of the plate formation 
process; notice how successively smaller and smaller plates 
appear as more of the specimen is transformed to 
martensite. Figure 7.36 shows an almost fully transformed 
Fe-Ni alloy. Martensite initially forms as very thin plates that 
subsequently thicken during the rapid growth stage. The 
thickening process is inferred from optical and electron 
microscopy, which reveal the presence of midribs running 





Fig. 7.35 Sequence of martensite plate formation. Growth of plates is 
terminated at grain boundaries and at other martensite plates. 


408 | 7 Phase Equilibria and Phase Transformations 





Fig. 7.36 Martensite plates in an almost completely transformed Fe-Ni 
specimen (magnification x500). From Shewmon (1969). 


along the center line of the plate. Midribs, as seen in Fig. 
7.37, are thought to be the original nuclei of the martensite 
phase. 

3. The volume fraction of martensite is a function of 
temperature. The first martensite plates form at the 
“martensite start” temperature denoted by M,, whereas the 
specimen is completely transformed for temperatures below 
Mg, the “martensite finish’ temperature. The volume fraction 
may or may not change with time. In the ‘isothermal’ case 





Fig. 7.37 Intersecting martensite plates in an Fe-32%Ni specimen. The 
straight line running through the center of the plates is known as a 
midrib. The fine lines running across the midrib are twins. From 
Shewmon (1969). 
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the volume fraction of martensite increases with time, 
whereas for “athermal” martensites the volume fraction of 
the product phase changes almost instantaneously at any 
temperature in the range Ms > T > Me, and holding at a 
temperature for any length of time does not cause the 
formation of more martensite. 

4. There exists a definite crystallographic relationship between 
the martensite and austenite phases. In plain carbon steels, 
Kurdjumov and Sachs (1930) determined that the (111) plane 
in the austenite phase lies parallel to the (011) plane in the 
martensite.” In addition, the [101] direction in austenite, 
which lies in the (111) plane, is oriented parallel to the [111] 
direction in the martensite plates. If we employ the standard 
notation of “y” representing the fcc austenite and “a” 
denoting the bct product phase, the Kurdjumov-Sachs 
relationships are expressed by Eqs. (99). 


(111),|(011),, [101], [111], (99) 


The two planes that lie parallel to one another are the close- 
packed planes in each structure. In Fe-Ni alloys, with the 
Ni content greater than 28 wt.% Ni, the orientation 
relationships, first measured by Nishiyama (1934), are as in 
Eqs. (100). 


(111),||(011),, (112), ||[074],, (100) 


ill 
Often the relative orientations of parent and martensite 
phases cannot be expressed in simple forms such as those 
of Eqs. (99) and (100). In the Au-47.5 at.%Cd alloy, the high- 
temperature phase is the ordered B2 (CsCl) structure, 
denoted £, and the martensite phase is an ordered 
orthorhombic crystal structure (ß”) (Lieberman et al. 1955). 
The low-index planes are misoriented by a couple of degrees. 

5. The macroscopic distortion caused by the formation of 
martensite is a homogeneous shear. Figure 7.38(a) shows a 
series of straight parallel lines that were inscribed on the 
surface of the parent phase. An illustration of the surface 
relief phenomenon is depicted in Fig. 7.38(b), which 
represents a cross-section of the specimen shown in Fig. 
7.38(a). After a martensite plate forms and intersects the 


15) Orientation relationships are measured by grain. By indexing both patterns one can 
taking two diffraction patterns, X-ray or formulate the relative spatial orientations of 
electron: one from a single martensite plate the two crystals. 


and another from an adjacent austenite 
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Fig. 7.38 (a) View normal to the surface of a specimen exhibiting a 
martensite plate. Parallel lines inscribed on the surface remain straight, 
but a macroscopic shear due to the transformation is observed. 

(b) Surface upheaval due to the plate intersecting the surface of the 
sample. 


sample surface, the lines remain straight in the martensite 
(labeled M), but make a distinct angle with respect to the 
lines of the austenite. The habit plane, i.e., the plane 
separating the parent and martensite phases, is one of zero 
macroscopic distortion. The emphasis on the word 
“macroscopic” is important. As will be seen in Section 7.5.4, 
the presence of a plane of zero distortion in the martensite 
phase necessarily implies that the plate is microscopically 
sheared or twinned. 

6. Martensite transformations are often reversible in the sense 
that the product phase will transform back to the austenite 
phase upon heating. The temperature at which the reverse 
reaction first begins is denoted by Ag, and the last remnant 
of martensite disappears at the temperature Ag. In general, 
however, there is thermal hysteresis such that A, > Mr and 
Ar > Mg. The magnitude of the hysteresis can be quite large, 
hundreds of degrees Celsius in Fe-based alloys, but it can be 
small, as for Cu-Zn-Al. In a study of the Cu-Al-Ni system, 
Kurdjumov and Khandros (1949; see also Tong and Wayman 
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1974) demonstrated the microscopic reversibility of 
martensite reactions. The first plate to form at M, during the 
cooling cycle is the last one remaining upon heating. Also, 
the last plate to appear, at Mr, is the first plate to retransform 
at the temperature Ag. 

7. Temperature is not the only external field that can trigger the 
martensite reaction. An applied stress or high magnetic field 
can also initiate the transformation. For any temperature in 
the range M, > T > Me, the amount of martensite will 
usually be increased by plastic strain. A plastic strain can also 
cause the formation of martensite plates even at 
temperatures above Ms. The highest temperature at which 
the product phase can form under stress is denoted Mg. In 
contrast, cold working of the parent phase at temperatures 
above Mg often inhibits the martensite transformation. The 
M, temperature is depressed and the amount of martensite is 
reduced at any temperature within the transformation range. 

8. Consider the following cooling sequence. A material is 
rapidly quenched to a temperature below M, but above Mr 
and then held at this intermediate temperature for some 
period of time. After the hold time the temperature of the 
specimen is then lowered again to some final temperature, 
say Tr. It is found that the martensite does not form 
immediately during the second cooling step; instead, the 
partially transformed sample must be cooled to a new start 
temperature M! before any new martensite plate formation is 
observed. Furthermore, the amount transformed at Tp is less 
than that obtained if the material had been directly quenched 
to this final temperature. This process is called “stabilization” 
(Nishiyama 1978). 


7.5.2 
Massive and Displacive Transformations 


Of the characteristics listed above, the diffusionless nature of the transformation 
(item 1) taken together with the macroscopic shape change (item 5) distinguishes 
martensite from similar transformations. For example, a Cu-Zn alloy with Zn 
concentration in the range 36.8-38.3 at.% exists as a single-phase £ (disordered 
bec) at high temperatures. A rapid quench to lower temperatures brings the 
material into another single-phase field, disordered fcc x, such that the B — « 
transformation involves no change in composition and hence no long-range dif- 
fusion. The morphology of the new «-phase contains blocky, or “massive,” precip- 
itates that form at grain boundaries. The growth of the «-phase involves the rapid 
advance of relatively flat interphase boundaries. Despite its diffusionless nature, 
the $ — « reaction in Cu-Zn is not martensitic because it does not occur with a 
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macroscopic shape change based on a homogeneous shear. This type of phase 
transformation is called a “massive transformation.” 

Finally, martensite reactions are a subset of a general category known as “dis- 
placive transformations.” The product phase of a displacive transformation is 
formed by the cooperative motion of atoms in the initial phase, where the extent 
of the individual displacements is less than one interatomic distance. In contra- 
distinction to displacive reactions are replacive-type transformations that involve 
the diffusion of atoms. An example of a transformation that is displacive but not 
martensitic is the bcc to “w-phase” in Ti, Zr, and Hf alloys. The formation of the 
hexagonal w-phase is accomplished, not by a shear mechanism, but by a periodic 
collapse of adjacent {111} planes in the parent lattice. 


7.5.3 
Bain Strain Mid-Lattice Invariant Shear 


In 1924 Bain (1924) described the distortions required to form a bct martensite 
from an fcc austenite crystal. As demonstrated in Fig. 7.39(a), a bct unit cell can 
be constructed from an fcc lattice by taking as the top and bottom of the cell the 
squares formed by connecting two face and two corner atoms. In this representa- 
tion the [110] direction of the austenite becomes the [010] direction in the tetrag- 
onal phase and, if a, is the original austenite lattice parameter, the short dimen- 
sion of the bct unit cell in Fig. 7.39(a) is 00V 2. The atom located at alt 0 2] in 
the fcc phase becomes the body-center atom under the new construction. Carbon 
atoms, randomly dispersed on interstitial sites in the austenite phase, occupy the 
base center, al$ 4 0] positions and the a[0 0 4] edge positions of the new bct 
cell. To obtain the correct cell size, a contraction of about 20% in the vertical 
[001] direction must occur, along with an expansion of 12% in both the [100] 





Fig. 7.39 (a) Illustration of how a bct unit cell of martensite can be 
constructed from an fcc austenite cell. (b) Final unit cell of martensite 
obtained after application of the Bain strain. 
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and [010] directions.” If a and c denote the lattice parameters of the martensite, 
then the so-called “Bain strain” or “Bain distortion’ is given by Eqs. (101). 


J/2a c 
= = 101 
S Pr 2 do (101) 


The unit cell produced by the Bain distortion is depicted in Fig. 7.39(b). One can 
envision other distortions that will transform the fcc parent phase into the mar- 
tensite structure, but Bain argued that the correspondence shown in Fig. 7.39 
and its strains of Eq. (101) represent the lowest elastic energy. 

It is tempting to describe the formation of martensite simply as the Bain distor- 
tion of plates of material. However, such a straightforward description of the pro- 
cess is inconsistent with the experimental facts. A key feature of the martensite 
phase transformation is the “habit plane” (item 2), a plane that remains undis- 
torted and unrotated during the reaction. It turns out that there is no way the 
Bain strain can be applied to a volume of material so that an undistorted plane 
is preserved. 


7.5.4 
Martensite Crystallography 


Wechsler, Lieberman, and Read (Wechsler et al. 1953), and independently Bowles 
and Mackenzie (Bowles and Mackenzie 1954; Mackenzie and Bowles 1954, 1957) 
proved that both the habit plane and the crystallographic relationships can be 
computed from a knowledge of only the lattice parameters and structure of the 
austenite and martensite phases. Although the two approaches differ somewhat, 
Christian (1956) showed that their results are identical. The crystallographic 
theory of martensite can also predict the magnitude of the macroscopic shear 
and can provide some details of the microscopic lattice invariant shear. These 
two theoretical contributions from the mid-1950s tied together many seemingly 
unrelated experimental facts concerning the martensite reaction and are crucial 
to our present day understanding of the transformation. 

Imagine a martensite plate to consist of twins as shown in Fig. 7.40. One 
region, with thickness 1— x, and a second region, with thickness x, represent 
two crystallographic variants of the martensite phase. Each of the two twinned 
volumes has associated with it a pure Bain distortion matrix written as Eqs. (102). 


m 0 0 m 0 0 
Tı=| 0 m 0 T2=|0 m O}. (102) 
0 0 m 0 0 m 


16) The necessary strains are only approximate 
because the lattice parameters of both 
austenite and martensite in steels are 
functions of carbon content. 
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Fig. 7.40 A microscopically twinned martensite plate gives a 
macroscopic shear. 





If the total distortion is described by Eq. (103), then the matrix F is given by Eq. 
(104). 


PH (103) 
E = (1—x)@®,T) + x® T (104) 


Here two rotation matrices, ®; and ®,, have been introduced. These two matrices 
are, at this point, unknowns in the problem. However, the relative rotation be- 
tween regions 1 and 2 can be established. A vector 7 = [110] transforms into the 
vector ri = [4, Mm, 0] in region 1 by the Bain distortion T1. The resulting vector is 
shown in Fig. 7.41. On the other hand, the same vector in region 2 becomes 
r, = [m,m, 0] and, as demonstrated in Fig. 7.41, application of the Bain distor- 
tions to the two regions creates a gap. To “fill in” the gap, notice that the magni- 
tude of rj is equal to that of r}, meaning that both vectors must lie in the coherent 
twin plane. 

With the relative rotations between the twins established, the habit plane nor- 
mal can be found from an algebraic analysis of the transformation strain distor- 


nN 


Fig. 7.41 Two twin variants within a martensite plate. To maintain a 
coherent twin plane, region 2 must be rotated by an amount ¢ relative 
to region 1. 
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tions and the requirement that an invariant plane exists. The derivation of the 
habit plane crystallography is quite lengthy and we shall only quote the final re- 
sult here. The components h, k, and | of the habit plane normal, ñ, are found to 
be Eqs. (105), where 7; = yı. 


pad m+m-2mm |2- ni-ni 
T 
2m 1-m 1-9 
p- 1 ( [tm mim [2n n 
27 1-7 1-m 


feck m1 
m\1-® 




















(105) 


Notice that the habit plane will, in general, have irrational indices. The only input 
required in the Wechsler-Lieberman-Read/Bowles-MacKenzie crystallographic 
theory is the lattice parameters of the austenite and martensite phases, both of 
which can be easily and accurately measured. An fcc to bec martensite reaction 
takes place in Fe-30.9 wt.%Ni such that the lattice parameters are ay = 3.591 A, 
and a = c = 2.875A. Equation (101), combined with the central result, Eq. (105), 
predicts a habit plane of h = 0.1848, k = 0.7823, and l= 0.5948. In an X-ray 
diffraction experiment, Breedis and Wayman (1962) found the habit plane to be 
close to (3 14 10); a result that is a scant 1.5° different from the theoretical predic- 
tion. Excellent agreement between theory and experiment has also been reported 
in the Fe-Pt system (again a fcc — bcc transition) (Nishiyama 1978) and the cu- 
bic to orthorhombic transformation in ordered Au—47.5 at.%Cd (Lieberman et al. 
1955). Another success of the theory is for In-20.72 at.%Tl. The habit plane in 
this alloy is the low index (011) whereas the predicted normal is (0.013, 0.993, 
1.00), a discrepancy of only 0.43°. 


7.5.5 
Nucleation and Dislocation Models of Martensite 


The Wechsler-Lieberman-Read/Bowles-Mackenzie theory does not address the 
questions of why a habit plane forms in the first place, how an individual marten- 
site plate forms initially, or how the atoms move cooperatively as the plate shoots 
through the material. To extend the discussion beyond the phenomenological 
stage, a mechanism for the transformation is needed, and perhaps the logical 
starting point is homogeneous nucleation theory. Unfortunately, experiments 
have confirmed that martensite reactions are not a result of homogeneous nucle- 
ation. Small-particle experiments (Cech and Turnbull 1956; Huizing and Kloster- 
mann 1966; Easterling and Swann 1971), pioneered by Cech and Turnbull (1956), 
studied the transformation behavior in spheres of Fe—Ni alloys whose diameters 
varied from below a micron to a fraction of a millimeter. The experiments dem- 
onstrated that, even after cooling to 4 K, an undercooling of 600-700 °C, not all 
particles transformed to martensite, a finding that rules out both homogeneous 
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nucleation and nucleation at the surface. Experiments also showed that the num- 
ber of nuclei is independent of grain size, meaning grain boundaries are not a 
preferred site for plate formation. If the martensite phase initiates at some defect 
site in the material that is neither grain boundaries nor surfaces, then the most 
logical heterogeneous site is dislocations. 

Perhaps the easiest example with which to illustrate the role of dislocations is 
the fcc to hexagonal close-packed transformation that occurs in pure cobalt. An 
fcc and an hcp structure both contain close-packed layers of atoms, but the two 
lattices differ in their stacking sequence; fcc has an ABCABC stacking, whereas 
hcp has an ABABAB pattern. A partial dislocation of the type a/6<112> that 
glides along a (111) plane will leave behind a stacking fault, i.e., a disruption in 
the stacking sequence such that an A plane is shifted to the B position (and 
B — C, C — A). Passage of a partial dislocation across every other (111) fcc plane 
creates the hcp structure (see Chapter 6). An additional very small dilatation per- 
pendicular to (111) is required to contract the ideal c/a ratio (1.633) produced by 
the dislocation mechanism to the value of 1.623 observed in Co. From this pro- 
cess one can establish the orientation relationships in Co [Eqs. (106)], which are 
known as the Shoji-Nishiyama relations (Nishiyama 1978). 

(111) 


(0001) rep (112 Jeccll[1100 rcp (106) 


fcc hep 


Also, a shear angle of tan-!(\/2/4) = 19.5° is readily found. Unlike the fcc — hcp 
transformation, however, it is impossible to construct a bcc martensite product 
phase using only the dislocations commonly encountered in fcc materials. 

In the mid 1970s, Olson and Cohen published a series of papers describing a 
dislocation-assisted theory of martensite plate formation that has gained some 
acceptance in the subsequent years (Olson and Cohen 1976a,b,c). The model ex- 
tends the dislocation concepts of shear transformations to obtain a critical nucle- 
ation event. The Olson—Cohen model uses energy concepts of partial dislocations. 
Recall the situation in an fcc structure where a perfect dislocation of Burgers vec- 
tor dissociates spontaneously into two partials, for example as in Eq. (107), where 
the total strain energy of the dislocation on the left-hand side of the reaction (pro- 
portional to b?) is greater than the sum of the strain energies for the two partials 
(a?/2 > a?/3). 

5! 


> [119] et 


g [21] + = [121] (107) 





The two partials in Eq. (107) are oriented to repel each other, but between them 
is a stacking fault that is unfavorable energetically. An equilibrium spacing is 
achieved when the stacking fault energy compensates the elastic energy effect. 
In the Olson—Cohen model, the stacking fault energy is a function of tempera- 
ture. At the temperature where the stacking fault energy goes to zero, the separa- 
tion between the partial dislocations can grow without limit. The character of the 
partial dislocations in the Olson—Cohen model is unusual, however, with one set 
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having a very small Burgers vector, giving a local kinked region spread over a 
few crystal planes. A three-step process is required to obtain the transformation 
product. 


7.5.6 
Soft Mode Transitions, the Clapp Lattice Instability Model 


In Section 7.4 it was established that some materials can become unstable with 
respect to infinitesimally small fluctuations in concentration. Using a completely 
analogous approach, one can establish conditions for the stability of a crystal with 
respect to a set of strain components. In terms of the elastic constants, the stabil- 
ity criteria for a cubic crystal are stated in Eqs. (108) (compare Chapter 4). 

Cy > 0 

Cu >0 


2 2 
Ci > Ch 


(Cu — C2) > 0 (108) 


Nile 


Each of the individual stability criteria listed in Eqs. (108) corresponds to a 
different mode of deformation. For example, the last inequality, which is of pri- 
mary interest in the study of martensite, defines the stability with respect to 
small-amplitude shears on (110)-type planes in <110) directions. If the difference 
(Ci, — C12) vanishes at the M, temperature, the material will shear spontane- 
ously. This mode of deformation is known as a “soft mode,” and the reaction is 
termed a “soft mode instability.” The very first explanation of the martensite phe- 
nomenon (Scheil 1932; Zener 1948) was based on a soft mode instability. In 1973 
Clapp extended the soft mode concept one step further. The free energy per unit 
volume of a crystalline solid can be written in the form of Eq. (109) (Clapp 1973), 
where Fo is the free energy of the strain-free material. Here the convenient Voigt 


notation for the strains 7 has been employed.” 


6 


6 
F= Fo >> Can; +a), 
i= 


6 


6 6 
>3 Cent (109) 
ji {Z1 j=1 k=1 


Finally the last sum in Eq. (109) represents contributions to the free energy due 
to higher-order, anharmonic strains and the Cj variables refer to higher-order 
elastic stiffness constants. We have seen in the theory of spinodal decomposition 
that the stability of a homogenous solution is determined by the sign of the sec- 
ond derivative of the free energy with respect to composition. Stability with re- 





17) Here 44 = €11, = €22; N3 = 633, N4 = €12, Ns = €13, and ye = €23 (compare Chapter 4). 
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spect to strain is governed by the sign of each of the six eigenvalues of the 6 x 6 
matrix of the second derivatives | Eq. (110)]. 


OF 
ay (i, j = 1,6) (110) 

If a particular eigenvalue vanishes at a given temperature, the material is unsta- 
ble with respect to the deformation mode, described by the associated eigenvector. 
The stability analysis applied to a free energy written in terms of only the first 
summation in Eq. (109) will reproduce the inequalities presented in Eq. (108). 
However, Clapp argues that the anharmonic contribution may be important in 
certain cases and should not be neglected. 

Dislocation models of the martensite transformation describe the reaction 
in terms of a collective dissociation and splitting of specific arrays of defects, 
whereas the lattice instability model views the transition as the onset of an elastic 
instability in the crystal. Which interpretation is correct? In agreement with the 
aforementioned small-particle experiments, both models identify dislocations as 
the heterogeneous site associated with the initiation of martensite plates. The 
lattice instability approach works well for a number of systems, but fails in the 
important case of steels and other ferrous alloys. The Olson—Cohen description 
appears to capture the main features of the fcc to hcp transformation and cor- 
rectly predicts the Kurdjumov-Sachs relationship found in certain steels, but it 
cannot account for the variety of habit planes observed in different Fe alloys. It 
is safe to say that we are many years away from a complete theory of the marten- 
site transformation. 


7.6 
Outlook 


Studies of phase transformations and alloy phase equilibria have considerable 
history and well-established scientific principles behind them. Alloy phase stabil- 
ity and kinetic processes remain central to modern metals physics, however, be- 
cause they are essential to understanding the microstructures that control alloy 
properties. Fortunately, the outlook for studies of alloy phases over the next de- 
cade shows a dynamic field that is poised for progress in several important areas: 
e Thermodynamics. Over the past decades, electronic structure 

calculations have become accessible to more scientists, 

practical on more systems, and better understood in their 

reliability. Improvements in computing hardware and 

software will further these trends, but fundamental advances 

are also expected. Specific progress is expected in predictive 

capabilities for magnetic systems of transition metals and 

rare earths. It will be interesting to see the directions taken 

to understand the thermodynamics of actinide systems. 
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At high temperatures anharmonicity becomes increasingly 
important, and understanding anharmonicity has remained a 
challenge for many years. Progress on electron—phonon and 
phonon-phonon interactions will further our understanding 
of high-temperature thermodynamics. 

The study of alloy thermodynamics under extreme 
conditions is also motivated by research in geophysics and 
high-pressure physics. Today new experimental tools are 
available for pressures beyond a megabar, and these are 
expected to stimulate work on the thermodynamics of alloy 
phases under pressure, and work on materials at high 
pressures and temperatures. 

Kinetics. A fundamental understanding of kinetics will always 
be at least as difficult as that of thermodynamics. A kinetic 
process must accommodate the thermodynamic end state, 
and the mechanism of atom movements adds richness to the 
process. Developments in analytical theory are likely to 
continue, but there is no doubt that the opportunities for 
computational modeling will expand significantly. Monte 
Carlo methods and phase-field modeling will continue to 
address kinetic phenomena of increasing complexity, and will 
allow more rigorous results on the systems studied today. 
The more detailed results from this computational work will 
facilitate experimental tests of key predictions. 

Nanostructured materials. Much of what is device engineering 
at the micrometer scale becomes atom engineering at the 
nanometer spatial scale. The effects of interfaces and 
quantum confinement alter the thermodynamics of alloy 
phases in nanostructures, and these phenomena also offer 
opportunities for new device functionality. The widespread 
interest in applications of thin films, nanowires, and atom 
clusters offers the real possibility of the discovery of new 
phenomena in alloy phase equilibria and phase 
transformations. 


The field of phase equilibria and phase transformations began with Gibbs, and 
classical concepts from his work (Gibbs 1876) remain vital some 130 years later. 
Over time, our increased understanding of electrons, bonding, and atomic struc- 
ture of alloys has enabled a detailed understanding of individual components of 
the enthalpy and entropy that underlie the thermodynamics. Free energy func- 
tions can now be constructed from these fundamental parts. This level of detail 
is migrating toward our understanding of kinetic processes and more complex 
phase transformations. Progress in phase equilibria and phase transformations 
continues to produce practical results, and good rewards for intellectual effort. 
We see innovation and discovery for decades to come. 
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8 
Kinetics in Nonequilibrium Alloys 


Pascal Bellon and Georges Martin 


The alloys (more broadly, materials as well as devices) which we live with are 
mostly used in nonequilibrium states: they have been processed in such a way 
as to achieve useful properties. The built-in microstructure is retained either be- 
cause of sluggish kinetics back to equilibrium, or because it is clamped into the 
nonequilibrium state by some difficult nucleation process. An example of the for- 
mer is the slow coarsening rate of a finely dispersed precipitate population in 
light alloys for the aircraft industry; examples of the latter are the nucleation of 
graphite in iron—carbon alloys, and of graphite in diamond. 

Whatever the case, we know - or at least, we know how we could find — what 
the alloy configuration should be at equilibrium: the latter minimizes the Gibbs 
free energy (if we deal with isothermal, isobaric, isoconcentration processes). The 
problem we are left with is to find the “kinetic pathway” back to equilibrium: a 
kinetic pathway is a sequence of configurations parameterized with time. 

A quite distinct situation is that of materials subjected to some form of energy 
input, such as an imposed gradient of chemical potential, or a flow of energetic 
particles in an irradiation environment, or dynamical mechanical straining. For 
sure, the alloy configuration adapts to the environment, but we lack a universal 
principle to tell us whether such a driven material will achieve some stationary 
state or not, whether that state is unique or not, and if it is obtained, what func- 
tional, if any, that state optimizes.” 

The above considerations have dramatic consequences on alloy design strategy, 
e.g., for nuclear materials which operate in an irradiation environment. Unlike 
the classical case of relaxation toward equilibrium, we ignore the configuration 
to which the driven alloy should evolve. The only way we have to anticipate the 
future evolution is to model it. Imagine the situation if, in order to know whether 
H20 is a gas or a liquid, we had to model the nucleation of the liquid out of a 


1) In the early 1970s, Prigogine and his school Cie, 1971]; soon afterward, they acknowl- 
tried to promote such general principles, edged the limitations of such an approach 
based on the excess of entropy production [G. Nicolis, I. Prigogine: Self Organization in 
rate [P. Glansdorff, I. Prigogine, Structure Non Equilibrium Systems, Wiley, New York, 
Stabilité et Fluctuations, Paris, Masson et 1977]. 
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vapor, in the absence of thermodynamic concepts! One consequence of this is 
that a reliable theory and modeling of driven alloys must rest on a robust descrip- 
tion of the kinetics. The description must be such that, in the absence of external 
forcing, we recover the classical thermodynamic behavior of the material. In 
some particular cases (to be discussed in Section 8.2) one may define a functional 
as a minimum under stationary conditions, and as giving information on the re- 
spective stability of competing stationary states. The counterpart of phase dia- 
grams could therefore be built, which we named “dynamical equilibrium phase 
diagrams” in order to make it clear that we deal with stationary dynamical states, 
rather than equilibrium states in the thermodynamic sense. 

The purpose of this chapter is to summarize the state of affairs in that field. 
Because of the importance of diffusion-controlled kinetics in the evolution of con- 
figurations, we begin with a complementary account of the classical modeling 
techniques for nucleation, growth, and coarsening: the latter have been addressed 
in Chapter 7. The complements we give are useful for the second part, which 
deals with driven alloys.” 


8.1 
Relaxation of Nonequilibrium Alloys 


The aim here is to emphasize the complementarities of the various descriptions 
of the kinetic pathway for phase separation: one is based on the time evolution of 
concentration fields, the other on the time evolution of the precipitate distribu- 
tion. The latter splits into two subgroups. In the dilute case, one may describe 
the time evolution of atomic clusters, as seen by three-dimensional atom probe, 
or as simulated by kinetic Monte Carlo (KMC) methods: this is the so-called clus- 
ter dynamics method. In the concentrated case, isolated atomic clusters can no 
longer be safely identified: one speaks of precipitates in a solid solution and the 
appropriate modeling tool is the classical nucleation theory. 

Such tools and the relevant concepts have been introduced in Chapter 7, but 
here we discuss several points which are of key importance for understanding 
the modeling of driven alloys: these include the definition of the diffusion coeffi- 
cients which enter the models, and the way fluctuations are built in the models. 
The modeling of driven alloys, which, as already stressed, is based on kinetics, 
must be such that in the absence of forcing, classical thermodynamics is recov- 
ered. In the following, we give a fully kinetic interpretation of the classical nucle- 
ation theory so as to prepare the link with driven alloys. We restrict ourselves to 
diffusion-controlled coherent precipitation with or without concomitant ordering. 


2) Historically, the reverse did occur: some 
difficulties we met when modeling driven 
alloys stimulated us to reconsider the 
classical theories. 


8.1 Relaxation of Nonequilibrium Alloys 


8.1.1 
Coherent Precipitation: Nothing but Solid-State Diffusion 


Coherent precipitation is indeed nothing but a diffusion process: atoms exchange 
lattice sites (e.g., because of vacancy jumps) at a frequency which depends on 
their local environment. In a solid solution with a clustering tendency, bonds be- 
tween like atoms are energetically favored, so that bonds between pairs of like 
atoms undo less easily than they form. Qualitatively, one may say that, in an 
undersaturated solid solution, because solute atoms are rare, it takes a long time 
for one solute to reach a region where solute has clustered; as a consequence, a 
solute pair can dissociate before it is reached by one more solute atom. In a 
supersaturated solid solution, the reverse is true: the time lag for one solute 
atom to reach one cluster is short compared with the lifetime of a given solute 
atom at a cluster. Without going into more detail for the time being, it is seen 
that the very same atomic jumps are at the origin of the fluctuations in the under- 
saturated solid solution and control the kinetic pathway for phase separation.” 
This is well exemplified by KMC simulations, where the input parameters define 
only the frequency of atomic jumps as a function of the local environment, be- 
cause of microreversibility, the set of jump frequencies defines the thermodynam- 
ics for the alloy.” Indeed, if P,(i) is the equilibrium probability of configuration 
(i), and W(i — j) is the transition probability per unit time from configuration 
(i) to (j), a detailed balance leads to Eq. (1a) 





Pei) Wi j) = Pe) WU = i) (1a) 
From Eq. (1a) we get: 
Ftp ep MER) - EL) (1b) 


E(i) is the internal energy of configuration (i) and £ = 1/kgT. To go from Eq. (1a) 
to (1b), we have used the fact that the atomic jump is a thermally activated pro- 
cess, so Eq. (1c) holds 


wi j) = v exp[-B(E;’ — E(i)], 


(1c) 


where E;? (= E;?) is the energy of the system at the saddle point between config- 
urations (i) and (j) [v is an attempt frequency, which we take as a constant for the 
sake of simplicity.” 


3) 


4) 


5) 


This is an example of the fluctuation 
dissipation theorem. 

The reverse is not true: for a given 
thermodynamics, several kinetic pathways 
are possible. 

According to the transition state theory (see 


IL, of 
Io; 
the stable eigenmode vibration frequencies 
in configuration (i) and at the saddle point 


Chapter 5), vj = where the w’s are 





between (i) and (j). At the saddle point 
position, one mode is unstable, so that the 
product in the denominator contains one 
factor fewer than in the numerator. The 
denominator cancels out of the ratio in Eq. 
(1b), and the numerator can be incorporated 
into the E’s in the form of a vibrational 
entropy (cf. C.P. Flynn, Point Defects and 
Diffusion, Clarendon Press, Oxford, 1972, 

p. 319). 
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Once it is realized that the set of all possible jump frequencies (i.e., transition 
probabilities between all the possible configurations) defines both the alloy ther- 
modynamics and the kinetic pathway for the relaxation toward equilibrium, we 
are left with the following question: to what extent do the classical theories (the 
Cahn-Hilliard equation for spinodal decomposition, cluster dynamics, nucleation 
theory) relate to the latter set of jump frequencies? This question is a key one for 
modeling driven alloys, since, as we will see in Section 8.2, the forcing imposed 
on the configuration will be modeled by a set of extra transition probabilities 
superimposed on that we have just discussed and which we call “intrinsic” in 
the following. 


8.1.2 
Cluster Dynamics, Nucleation Theory, Diffusion Equations: Three Tools for 
Describing Kinetic Pathways 


Experimentalists have access to a variety of tools for observing and describing a 
kinetic pathway. Scattering techniques (neutron or X-rays; see Chapter 13.1) oper- 
ate in reciprocal space and give the Fourier transform of the composition field; 
they allow for in-situ determination of time evolution. Other techniques operate 
in real space, at coarse spatial resolution (classical metallography), or at a finer 
resolution by transmission electron microscopy (TEM) [1], or at atomic resolution 
by three-dimensional tomographic atom probe (3DTAP) [2] (see Chapter 13.2). 
These latter techniques give a post mortem image at a given time. Depending 
on the spatial resolution, they exhibit either clusters of solute atoms (3DTAP, 
with a yield of about 60%), or precipitates. From such data, various types of statis- 
tical information can be extracted: size distribution, shape distribution, correla- 
tions in position and size, composition, solute distribution in the vicinity and 
within the precipitate, etc. 

From the theoretical and modeling side, three distinct tools are available: clus- 
ter dynamics, the output of which is a time-dependent size distribution of clus- 
ters; the classical theory of nucleation, which yields the incubation time, the nu- 
cleation rate of precipitates out of a supersaturated solution; and diffusion 
equations, to model the time evolution of the concentration (and order) fields of 
the various components of the alloy. 


Cluster dynamics (CD) This relies on a representation of the alloy as a lattice gas 
of solute clusters: a cluster is characterized by the number of solute atoms from 
which it is made. For instance, if the phase to precipitate is a disordered solid so- 
lution, a cluster with size n (an ‘‘n-mer’’) is a set of n solute atoms, each of which 
has at least one nearest neighbor belonging to the cluster. An “n-mer” achieves 
several distinct configurations, ranging from the most diffuse one, n atoms in a 
dense row, to the most compact; each cluster therefore has a free energy of its 
own. Hence, at equilibrium, the gas of clusters exhibits a free energy with two 
distinct contributions: the sum of the free energies of the n-mers and the mixing 
entropy of the n-mers on the lattice. Cluster dynamics therefore implies a ther- 
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modynamic model for the alloy, viewed as a lattice gas of clusters. For such a rep- 
resentation to be meaningful, the alloy must be sufficiently dilute to be far below 
the percolation limit. 


Classical nucleation theory (CNT) Despite its elegance, cluster dynamics fails to 
account for the relaxation of concentrated solid solutions, where clusters cannot 
be defined unambiguously, and for diffusion mechanisms more complex than 
single-atom impurity. Also, since it relies on thermodynamics of its own, CD 
cannot be cast into the classical thermodynamic models, e.g., in order to define 
a driving force for nucleation. The classical nucleation theory, on the contrary, 
has none of these drawbacks. However, at some stage it requires a link with CD, 
which is stated in all textbooks, but not established precisely. In the following we 
propose a technique to establish the latter link formally. 


Diffusion equations At some scale (to be discussed), the alloy can be character- 
ized by a set of concentration fields for the solvent and for the various solutes. 
Precipitates appear as a local change in concentration, delineated by a more or 
less diffuse interface. Such a picture can be obtained using low spatial resolution 
characterization techniques (e.g., conventional microscopy), or from a proper 
averaging, over several precipitates, of atomic resolution pictures (see below). 
Modeling a kinetic pathway implies solving the appropriate diffusion equation. 
To what extent the latter equation relies on the set of atomic jump frequencies 
introduced above will be discussed in the corresponding section. 


8.1.3 
Cluster Dynamics 


As stated above, we describe the alloy as a set of clusters dispersed on the lattice. 
This picture, even in its crudest form, gives a very efficient way to describe both 
the fluctuations in the alloy at equilibrium and the relaxation toward equilibrium. 


8.1.3.1 Dilute Alloy at Equilibrium 

Under equilibrium conditions, the number density of n-mers is stationary and 
minimizes the free energy of the gas of clusters. We first notice that an n-mer 
may exhibit several configurations {i,n}, each with an energy E',. An n-mer 
therefore exhibits a free energy given by Eq. (2), with g; the degeneracy of config- 
uration i. 


F, = -p In| 5 gi exp -BE; (2) 
{i} 


The free energy of the whole gas of clusters is the sum of the free energies of the 
individual clusters, minus the contribution of the configurational entropy of the 
gas of clusters [Eq. (3a), where C, is the number of n-mers per lattice site and W 
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is the number of ways to arrange N, clusters of size n (n ranging form 1 to infin- 
ity) on N, lattice sites (in the limit of infinite N,).| 


Fras = X Cn Fn — kp TLn(W) (3a) 


Assuming the gas of clusters to be very dilute, i.e., neglecting the constraint that 
clusters should not overlap, W is given by Eq. (3b). 





(3b) 


Minimizing F,,; [Eq. (3a)] with the constraint that the total number of solute is a 
constant [Eq. (4a)] yields the equilibrium concentration of n-mers, Eq. (4b). 





(Fn — 92 nCa) s a 


= F- 
Cn = exp — ae 





(4b) 


Here u is the free energy per solute atom in the gas at equilibrium. Provided F, is 
known for each value of n, „is obtained by solving Eq. (4c), where cio; is the total 
solute concentration in the gas. 


5 nCa( u, T) = Cio (4c) 


Equation (4c) gives u(T, cto), the chemical potential as a function of temperature 
and of the solute content of the alloy. 

In summary, the concept of a gas of clusters, together with the approximations 
used so far, constitute a thermodynamic model of the alloy. 

As shown by Eq. (3a), the key quantity in the cluster description is the set of F,,, 
the free energies of the n-mers. The latter can be evaluated numerically from 
solute-solute and solute-solvent interactions, by thermodynamic integration 
using Monte Carlo techniques to compute ÖF„/ön. According to Perini et al. [3], 
in the Ising model, F, is given by Eq. (4d). 


F, = an + bn?’ + cn’? + d+ ıLn(n). (4d) 





In Eq. (4d) the parameters (a, b, c, d, t) have been computed for the simple cubic 
lattice. For sufficiently large n, we recover the classical model with a bulk and a 
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surface contribution. For small clusters, the next two terms in the expansion can 
be understood as an edge and a vertex contribution. The last term accounts for 
the fact that, since the cluster is of finite size, the cluster/solvent interface cannot 
exhibit the full spectrum of undulations, which contribute to the macroscopic in- 
terfacial free energy embedded in b [4]. The term cn!/? can be viewed as a curva- 
ture correction to the interfacial free energy; it was found to be negative. 


8.1.3.2 Fluctuations in the Gas of Clusters at Equilibrium 

In the cluster gas at equilibrium, solute atoms permanently evaporate from one 
cluster, diffuse through the solvent, and stick to another cluster. But on average, 
the cluster distribution [Eq. (4b)] is stationary. If we assume that only single 
atoms evaporate and migrate, the probability per unit time of an n-mer becoming 
an (n+ 1)-mer ({n} + {1} = {n + 1}) is given by Eq. (5a) 


Baar = &nDi Cy (5a) 


Indeed, the probability of a cluster catching a solute atom per unit time is higher, 
the higher the concentration of monomers, C;, and the more frequently the latter 
jump. In this equation, D; is the diffusion coefficient of single solute atoms in 
the pure solvent (called the impurity diffusion coefficient in the diffusion litera- 
ture), and g, is a geometrical factor. One usually chooses g, x 4nR,„/Q where R, 
is the radius of a sphere with n solute atoms and Q is the atomic volume. The 
latter expression for g, is obtained by computing the stationary flux of solute 
from infinity to a sphere of radius Ry, at the surface of which the concentration 
is kept at zero. More advanced formulations are under development [5]. In Eq. 
(5a) and hereafter, an overbar on a quantity implies that the latter is given its 
equilibrium value. 

The probability per unit time of the reverse process, i.e., where an (n + 1)-mer 
becomes an n-mer by the evaporation of a single solute ({n + 1} > {n} + {1}), is 
such that the equilibrium distribution of clusters is stationary [Eq. (5b)]. 


Cn 


Chr 





gnDıCı (5b) 


On+1,n Car = Pant Cn => Xn+1,n = 





In summary, describing the alloy as a dilute gas of clusters yields a thermody- 
namic model [Eqs. (4)], with fluctuations built into it; indeed, the probability of 
an n-mer changing size per unit time is (&n,n-1 + Byn41)- Also, note that at infi- 
nite temperature the model correctly predicts a cluster distribution, which does 
not reduce to isolated single solutes [6]. The gas of clusters therefore yields a ther- 
modynamic model with some correlations built into it, at variance with, e.g., the 
Bragg-Williams approximation. 


8.1.3.3 Relaxation of a Nonequilibrium Cluster Gas 
If the cluster gas is not at equilibrium, i.e., if the concentration of n-mers in the 
gas is distinct from that given by Eq. (4b), the clustering process will evolve, i.e., 
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monomers will evaporate preferentially from those n-mers of which the number 
density is too large, and condense preferentially on those n-mers which are too 
rare for the gas to be at equilibrium. The net balance of such processes is de- 
scribed by the master equation, Eq. (6). 





Ch 
at = Jn-1,n — In,n+1; Jn, n+1 = Bn, n+1Cn = Ont1,nCn41 (6) 


In Eq. (6), Jnjn41 is the net number density of n-mers which catch one solute 
atom per unit time. Note that the quantities in Eq. (6) do not have bars, since 
they have nonequilibrium values. The value to be given to the impingement rates 
B, is given by Eq. (5a), where the concentration of monomers is given its actual 
value at time t, Cı(t). As for the evaporation rates, «, one assumes that the evapo- 
ration rate is a characteristic of the cluster itself, independently of the cluster gas 
which surrounds it. In other words, « is given the value defined in Eq. (5b). The 
physical picture behind this assumption is that, on average, n-mers explore their 
many configurations at a rate much higher than the impingement or evaporation 
rate. This argument is consistent with the expression we used for g,. More ad- 
vanced formulations are under development [5]. 

As can be seen, once D; and the set of F, are known, Eq. (6) can be integrated 
for any initial distribution of clusters C,(0) up to any time t. In the case where 
the initial cluster distribution corresponds to a supersaturated solid solution, the 
cluster size distribution will evolve toward a bimodal distribution: the first part 
peaks at Cı and decreases to some critical size n*; beyond n*, C,(t) increases, 
goes through a maximum, decreases again, and vanishes for large values of n. 
The cluster size distribution evolves continuously in time, and depicts the nucle- 
ation stage at the beginning (with an increase in the number of clusters, keeping 
the average size approximately constant), the growth stage (with a constant num- 
ber of clusters with increasing average size), and later the coalescence stage (with 
a decrease in the number of clusters as the average size increases). 

Note that in the framework of cluster dynamics, no strict distinction can be 
made between the solid solution and the precipitates: whatever its degree of de- 
composition, the alloy is nothing but clusters dispersed on a lattice. 

Provided the above definition of cluster dynamics is strictly followed [including 
the first three terms of the cluster free energy in Eq. (4d)], it is possible to repro- 
duce, by CD, the results of lattice KMC (LKMC) simulations, using the very same 
set of input parameters, at least in alloys with low solubility limits. This latter 
point has been demonstrated carefully by Clouet et al., for Al(Sc) alloys [7]. More- 
over, without further adjustment, the master equation can be integrated to much 
longer times; As illustrated by Fig. 8.1, it is found that CD results do reproduce 
the experimentally measured mean radii of precipitates, despite the fact the ex- 
periments are done at times which are three to four orders of magnitude greater 
than the longest affordable LKMC simulation [7]. 
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Fig. 8.1 Mean precipitate radius as a 
function of the aging time for an Al-Sc solid 
solution of composition x°s, = 0.18 at.% at 
temperatures T = 300,350, and 400 °C; the 
point symbols are LKMC results [*: E. Clouet, 
M. Nastar, C. Sigli, Nucleation of Al3Zr and 
Al3Sc in aluminum alloys: from kinetic Monte 
Carlo simulations to classical theory, Phys. 
Rev. B 69 064 109 (2004)]; the curve is 
obtained from CD based on the same set of 


parameters as LKMC. Open symbols: 
experimental results obtained at much longer 
times [*: G.M. Novotny, A.J. Ardell, 
Precipitation of Al3Sc in binary Al-Sc alloys, 
Mater. Sci. Eng. A 318 144-154 (2001); 5: E.A. 
Marquis, D.N. Seidman, Nanoscale structural 
evolution of Al3Sc in a dilute Al-Sc Alloy, 
Acta Mater. 49 1909-1919 (2001)]. The cutoff 
radius used for CD and KMC is r*x % 0.75 
nm (n*x = 27) (From ref. [7]). 
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The CD formalism, as described above, is of current use for studying materials 
under irradiation. It is called the “rate theory” of defect accumulation, and has 
been enriched by many extensions: solute diffusion proceeds both by vacancy 
and interstitialcy mechanisms; vacancies and self-interstitials (Frenkel pairs) are 
produced by irradiation and may be annihilated by recombination or on lattice 
discontinuities (dislocations, grain boundaries, internal surfaces, etc.); they may 
also agglomerate into point defect clusters, etc. [8]. As will be discussed in Sec- 
tion 8.2, cluster dynamics has been used successfully by Cauvin et al. [9] to ac- 
count for the decrease in solubility observed in certain solid solutions under irra- 
diation (so-called “irradiation-induced precipitation”). 


8.1.4 
Classical Nucleation Theory 


As discussed in Section 8.1.3, cluster dynamics is based on an assumption of 
high dilution; it implies that the alloy has a thermodynamic description of its 
own; and it assumes that solute diffusion proceeds by jumps of single atoms. 
Such limitations are not met by the classical nucleation theory, which we summa- 
rize now. Full details are to be found in Chapter 7. 


8.1.4.1 Summary of CNT 

The picture behind CNT is that, unlike in CD, we can unambiguously define pre- 
cipitates embedded in a solid solution (the latter being super- or undersaturated). 
It is then claimed that first-order transitions proceed by the formation (nuclea- 
tion) and the growth of well-defined domains of the second phase (precipitates) 
in the bulk of the mother phase (supersaturated solid solution). According to clas- 
sical thermodynamics, heterophase fluctuations (precursors of the precipitates) 
have an equilibrium number density given by Eq. (7), where AF is the reversible 
work to form the fluctuation in the mother phase. 


P(n) oc exp(—BAF) (7) 


The lower AF is, the more probable the fluctuation. As a consequence, the most 
frequent fluctuations exhibit: 
e a concentration close to that of the precipitate in equilibrium 
with the solid solution 
e the equilibrium shape (which minimizes the total interfacial 
energy, e.g., a sphere in the case of isotropic interfacial 


energy) 
e a size dependence of the form (spherical fluctuation) given by 


Eq. (9). 


AF(R) =i" (fu — fi) + 4nR?0 (8) 
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In Eq. (8), R is the radius of the spherical fluctuation, ø the interfacial energy, Q 
the atomic volume (assumed to be the same in both phases), and fy and fy re- 
spectively the free energy per atom in the «’ precipitate and in the « solid solu- 
tion. 

In the case where fw > fa, the free energy per atom is greater in the second 
phase than in the mother phase: the latter is undersaturated; as shown by Eq. 
(8), reversibly decreasing the radius R releases work, so that the dissolution of 
the fluctuations into the mother phase is a natural process. In the opposite case 
(fr < fu), one finds a critical radius R* below which the reversible work is nega- 
tive and above which it is positive: subcritical fluctuations should dissolve sponta- 
neously in the mother phase, while supercritical ones would grow. The critical 
fluctuation with size R* is in an unstable equilibrium with the solid solution. 

Strictly speaking, the above statements imply that the equilibrium distribution, 
as given by Eqs. (7) and (8), is not stationary! The reason for this apparent contra- 
diction is that the above presentation rests on macroscopic concepts and therefore 
fails to describe the source which generates the fluctuations, the equilibrium dis- 
tribution of which is given correctly by Eqs. (7) and (8). In order to circumvent 
this difficulty, classical presentations of CNT state that the fluctuations can be 
modeled by a polymerization chain of the type: {n} + {1} = {n+ 1}, i.e., one 
similar to that used in cluster dynamics; however, unlike in CD, the clusters as 
well as the rate constants in the master equation are, here, defined intuitively. 
Below, we derive an expression for the source of fluctuations which is fully con- 
sistent with CNT and which makes explicit the missing link between CNT and 
CD. 


8.1.4.2 Source of Fluctuations Consistent with CNT 

We give here the principle of the demonstration. We start from the macroscopic 
formalism used in CNT. The growth rate of a spherical fluctuation of radius R is 
written classically as Eq. (9a), with J(R) the flux of solute at the interface (for the 
sake of simplicity, we assume, without loss of generality, that the fluctuation con- 
sists of pure solute). 


OR/öt= —J(R)Q (9a) 


The flux is proportional to the gradient of diffusion potential (ap = Hg — Ha) 
and is computed in the stationary regime [Eq. (9b)] in the spherical cell with the 
inner radius equal to R and the outer one equal to Rew, and with the boundary 
conditions given by Eqs. (9b) and (9c). 


Jr) = -LQ"Vuye; dc/ét = -VJ = 0 (9b) 
r= R: pgig(R) = A+20Q/R; r= Res: Ma — Ha = Haig (t) (9c) 


Note that the intrinsic diffusion coefficient is D = Löu/öc. The solution of the 
problem, in the limit R/Rex «<1, is Eq. (9d), with n the number of solutes in a 
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precipitate of radius R,, where the value of L is computed in the equilibrium 
solid solution. 


FAR» alu (Rec) — Haig (Rn) (9d) 





ön/öt = 


Following a technique introduced by Landauer [10], we rewrite Eq. (9d) as Eq. 
(10). 


an/ét = én/at|* — ôn/ ôt 


Haig (R+/-) — Rap]. 


4rR 
én/at\t/- = r 
a Q kat 


L(é,)kgT exp (10) 


Ry = Rea; R- = Ry 


What we have done is to arbitrarily interpret the difference, which appears on the 
right-hand side (RHS) of Eq. (9d), as a difference between a growth and a decay 
rate [Eq. (10)] and express the latter as a first-order expansion of exponentials 
laip (R4/—) — Haig 

kpT 
terpretation, we expect the equilibrium distribution of fluctuations to obey Eq. 
(11). 


close to equilibrium ( < 1). In agreement with the above in- 


Cnôn/ôtlR, = Cryin / Atle, 
(11) 
200 


or, E: a | 
(), nr) MT [rar Ha + RFI) 








The subscript kin reminds us that the equilibrium distribution is that expected 
from Eq. (10). From Eqs. (7) and (8), the equilibrium distribution of fluctuations 
used in CNT is given by Eq. (12) 


Ca = 262 
in( zt) = -(kaT) [rar = Hat a (12) 
CNT n 





n 


Equations (11) and (12) are identical in the limit of large n (3n > 1). We conclude 
that the artificial decoupling of the net growth rate [Eq. (9d)] into a difference be- 
tween a growth- and a decay-rate [Eq. (10)], together with the particular way we 
write the latter, provides us with the equilibrium distribution of fluctuations 
used in CNT. Based on Eq. (10), we can now write the master equation, which 
governs the distribution of fluctuations in the CNT: it is identical to Eq. (6) for n 
large enough, but with quite distinct expressions [Eqs. (13a) instead of (13b)] for 
the rate constants. 
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post EnL(6,)kgT exp Er = ad 





n,n+1 = kgT 
(13a) 
200 
Onn 1 = 8nL(6,)kpT exp zu 
BSa = eG 
3 (13b) 


tytn = BnD1 C1 Cn/ Car 
Note also that, unlike in CD, the master equation (6) cannot be integrated from 
n = 1 since, for small values of n, our scheme yields a wrong equilibrium distri- 
bution. But the link between CNT and CD is now established formally. 

We now explore Eqs. (13a) for a specific diffusion model, i.e., a specific expres- 
sion for L(c). 


8.1.4.3 A First Application 

Elaborating an atomistic theory of Onsager’s transport coefficients Lj is a long- 
standing issue, which is making steady progress [11-14]. Ideally, for a nucleation 
model to be fully self-consistent, the expression of L should be consistent with the 
thermodynamic model used to compute the diffusion potentials wg in Eqs. 
(13a). The simplest case where the above requirement is met is the regular solu- 
tion model, in the point approximation (Bragg—Williams) with a direct exchange 
diffusion mechanism [15]. In this model, L is written as in Eq. (14), where v is the 
attempt frequency, z is the plane-to-plane coordination, c is the solute concentra- 
tion in the solid solution, u is the diffusion potential in the solid solution (a bar 
means the two-phase equilibrium value), E,, is the binding energy of the ex- 
changing pair to the saddle point and the js are the usual pair interactions. 








pg 
L= vz(1 —6)'**()'* exp — r 
(14) 
EAA + EBB 
a=u/2o; w= £48- 7 ; U = E&4A — EBB 


This expression for L, when introduced in Eqs. (13a), yields the kinetic parame- 
ters of the master equation and can be used, for example, to estimate the Zeldo- 
vich constant, the incubation time, etc. 

Moreover, one can easily check that in the limit of very low solubility, the rate 
constants we developed for CNT [Eqs. (13a)], together with Eq. (14) for L, do con- 
verge to the classical expression used in CD [Eqs. (13b)]. 

To summarize this section, the expressions we propose for the rates of solute 
impingement and evaporation [Eqs. (13)] provide the correct equation for the 
growth rate of precipitates and the correct equilibrium distribution of hetero- 
phase fluctuations, and do converge, in the dilute limit, for the direct exchange 
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mechanism in the regular solution model, to the classical expressions used in 
cluster dynamics. They are not restricted, however, to that simple case; work re- 
mains to be done to develop the above formalism in detail for the vacancy diffu- 
sion mechanism in a multicomponent alloy. The point we wanted to establish is 
that the link of the CNT to the details of atomic jumps is now established, as long 
as the link between the Lijs [Eq. (9b)] and atomic jump frequencies is sorted out. 


8.1.5 
Kinetics of Concentration Fields 


At some scale (see below), the solid solution, together with the coherent precipi- 
tates, can be represented by a three-dimensional concentration field; the sharp 
precipitate/matrix interface is replaced by a boundary layer with a steep but 
smooth variation of the concentrations [16]. Nucleation, growth, and coarsening 
are described by the time evolution of the concentration field. To be more specific, 
a multicomponent alloy with S chemical species and vacancy-mediated diffusion 
is specified by (S + 1) concentrations: that of vacancies Cy and those for the alloy 
components C, (s = 1 to S). The scale where nucleation and growth proceed is 
much finer than the mean distance between vacancy sources and sinks; vacancies 
are thus conserved. Because of the conservation of lattice sites the (S + 1) concen- 
trations sum to unity: we are left with S independent concentration fields. For the 
same reason, we define S diffusion potentials, (u, — wy), where u is the chemical 
potential of the component s and wy is the chemical potential of the vacancy. The 
chemical fluxes are proportional to the diffusion potentials via the (S x S) Ons- 
ager matrix, which is symmetric, definite, and positive; the fluxes are therefore 
specified by S(S + 1)/2 Onsager coefficients, Lj, (i, j = 1 to S). The flux of matter 
in the lattice frame of reference J is given by J = —LVf@(QkT)~' = -DVcoT!, 
where J is the column vector with elements Js, (s = 1 to S), similarly to the diffu- 
sion potential &, and the concentration of atomic species, C; Q is an atomic vol- 
ume. The diffusion matrix is given by the product of the Onsager matrix and the 
susceptibility matrix: 





5- Ema) 


ET 5G, ) (i, j = 1 to S) 


The time evolution of the concentration field, C, is given by the conservation 
equation, Eq. (15). 


ac=-V-J (15) 


As discussed in Chapters 7 and 10, for Eq. (15) to give the equilibrium concentra- 
tion field at infinite time, the chemical potentials must include a contribution of 
the inhomogeneity of the concentration field: to first order in inhomogeneity, the 
latter reduces to a term proportional to the square of the concentration gradient, 
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first introduced in this field of physics by Cahn; this term can be viewed as the 
second moment of the interatomic potential [17]. 

As will be seen below, because the effect of external forcing is best described at 
the atomic scale, it is important to clarify at which scale the above concentration 
field is defined. 

The classical phase field approach defines the concentration by a coarse grain- 
ing procedure: the volume is divided into cells within which the details of the 
atomic configuration are ignored. C is defined at the coarse grain scale. Equation 
(15) can be made dimensionless, the length scale being given by the cell size. 
However, the size of the cell plays a key role whenever fluctuations are to be taken 
into account [18-20], which is the case for driven alloys. 

Another approach is sometimes called the “lattice phase field” approach as was 
first proposed, to our knowledge, by Vaks and coworkers [21, 22]. Here the con- 
centration at each time is defined on each single site as the first moment of the 
(time-dependent) probability of occupation of that site by the various species. This 
approach allows the direct link to be made between Eq. (15) and the atomic jump 
frequencies, whatever their origin (thermal fluctuations or/and external forcing). 

As an illustration, Fig. 8.2 shows the concentration profiles in a two-phase 
model superalloy, as observed by 3DTAP and as modeled by LKMC. The concen- 
tration here is defined by averaging on many precipitates, and by a weighted aver- 
age on several neighboring sites. With that procedure, a direct link can be estab- 
lished between observations (with 3DTAP), simulations (using LKMC technique) 
at the atomic scale, and the theory of diffusion [23]. 
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Fig. 8.2 Decomposition of Ni (5.2 Al features of the experimentally observed 
at.%-14.2 Cr at.%) solid solutions at 873 K: kinetic pathway well, while LKMC2, which 
concentration of chromium as a function of generates the same alloy thermodynamics 
distance from the y/y’ iso-ordering interface, but distinct diffusion correlation effects in the 
when the mean precipitate radius (R) ternary alloy, yields much sharper interfaces; 
reaches 1.25 nm, as observed from 3DAPT this is shown to have noticeable effects on 


and as simulated by LKMC, using two sets of the early-stage decomposition microstructure 
parameters; LKMC1 reproduces all the [from ref. 23]. 
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8.1.6 
Conclusion 


In the absence of any macroscopic formalism for taking into account external 
forcing, one is left with modeling-driven alloys from the atomic scale where the 
mechanism of forcing is understood. Any reliable modeling must be such that, 
in the absence of forcing, a nonequilibrium state will decay to the proper equilib- 
rium state via a realistic kinetic pathway. Section 8.1 was aimed at presenting (in 
the simplest case of coherent transformations) the limitations of the classical ap- 
proaches to nucleation, growth, and coarsening, when we want to make the link 
between the phenomenological kinetic coefficients they rely on and the atomic 
jump frequencies one would like to start from. Important improvements were 
achieved in the last decade; much work remains to be done. 


8.2 
Driven Alloys 


In the first part of this chapter, we have considered materials that, once brought 
into some nonequilibrium state, are relaxing toward their equilibrium state 
through diffusion-controlled processes. For the sake of simplicity here we will 
refer to these processes as intrinsic. In many practical situations, however, mate- 
rials are driven and maintained away from equilibrium by some external forcing, 
as already indicated in the Introduction. The external forcing may not only affect 
the rate of the intrinsic dynamics, for instance by creating an excess of point de- 
fects, it often leads to the introduction of additional dynamical processes, which 
we will refer to as extrinsic. We refer to materials under external forcing as 
“driven system materials”, or, more concisely, as “driven alloys’, since, in most 
situations of interest, materials contain more than one chemical species, and the 
spatial distribution of these species is modified by the sustained external forcing. 
Under appropriate conditions, extrinsic dynamical processes compete with in- 
trinsic ones, and, as a result of this competition, the material may undergo 
nonequilibrium phase transitions and microstructural evolutions, as well as self- 
organization reactions. These evolutions are not driven by thermodynamic forces, 
and, therefore, they cannot be predicted or rationalized by equilibrium thermody- 
namics and near-equilibrium relaxation kinetics. In 1997, we published a compre- 
hensive account of a general framework specifically designed to study driven al- 
loys [24]. The two main types of forcing discussed in detail in that review were 
the sustained irradiation of alloys with energetic particles, and the processing of 
powders by high-energy ball milling. We showed that, despite the diversity of the 
external forcing situations, the response of these materials could be studied, un- 
derstood, and to some extent predicted by relying on one common approach. The 
starting point of this framework is to establish a consistent kinetic description 
that encompasses all the relevant dynamical processes, from the intrinsic to the 
extrinsic ones. The spatio-temporal evolution of these materials, with several dy- 
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namics in parallel, can then be analyzed by applying and extending concepts and 
tools developed in the physics community for the study of dynamical dissipative 
systems [10, 25-32]. 

In the past decade, our knowledge of driven alloys has grown significantly in 
terms of the range of situations where the above framework has been applied, as 
well as in terms of our fundamental understanding of the evolution of these ma- 
terials. This growth is partly reflected in the contributions that appeared in a 
Viewpoint Set on “Materials under external forcing” which appeared in 2003 in 
Scripta Materialia [33]. In the past few years, additional important results have 
been published, and thus we provide here an up-to-date review of the salient fea- 
tures of driven alloys, with an emphasis on the recent experimental, theoretical, 
and computer simulation results. We start by providing examples of driven alloys 
in Section 8.2.1. We then discuss the concepts of control parameters and steady- 
state dynamical equilibrium phase diagrams in Section 8.2.2. Theoretical ap- 
proaches and simulation techniques are reviewed and illustrated in Section 
8.2.3, and in Section 8.2.4 we discuss the role of external length scales in self- 
organization reactions. Finally in Section 8.2.5 we present some practical applica- 
tions of the driven alloy framework, and we indicate some open questions that 
should deserve attention in the future. 


8.2.1 
Examples of Driven Alloys 


The initial questions that led to the development of the driven alloy framework 
[34, 35] were motivated by puzzling experimental observations of alloys subjected 
to sustained irradiation by energetic projectiles, such as electrons, ions, and neu- 
trons, and in alloys subjected to sustained plastic deformation during fatigue. The 
intriguing point is that one material subjected to a given type of forcing can 
undergo one type of evolution or its very opposite, depending upon the exact 
forcing conditions. Typical examples, which are reviewed in ref. [24], include 
precipitation—dissolution [36], order-disorder [37], and crystal-to-amorphous [38] 
reactions. Later, similar paradoxical observations were reported for powders pro- 
cessed by high-energy ball milling and for thin films grown from a vapor phase. 
In the case of alloys under irradiation, Adda et al. [34] proposed rationalization of 
these effects by introducing a generalized phase diagram, which, in addition to 
thermodynamic variables, includes the irradiation flux, a specific example of forc- 
ing intensity. More recently, it has become clear that, in addition to the forcing 
rate(s), it is also important to identify and include the characteristic lengths of 
the elementary intrinsic and extrinsic processes. From that perspective, we review 
below important examples of driven alloys. 


8.2.1.1 Alloys Subjected to Sustained Irradiation 

We first focus on the case of metallic alloys under irradiation. This choice is mo- 
tivated by three factors, not mentioning the practical interest of this question. 
First, this situation is the one for which the driven alloy theory is the most ad- 
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vanced, since the atomic-scale physical processes involved in particle-matter in- 
teractions are well understood. Second, our ability to perform controlled experi- 
ments in this field makes it possible to compare experimental results critically 
with predictions from analytical models and computer simulations. Third, this 
situation is also interesting from a pedagogical viewpoint since it lends itself to a 
progressive introduction of the concepts and tools useful for the study of driven 
alloys. 

The study of the elementary effects induced by the slowing down of energetic 
projectiles in solids is a mature field (see ref. [39] for a recent review). An ener- 
getic projectile, or a target atom that has been set in motion, may lose energy by 
interacting with the nuclei of the target or its electronic system (for charged par- 
ticles), or through resonant nuclear reactions. Since we are mostly concerned 
with metallic alloys here, we restrict our discussion to nuclear interactions, 
though electronic excitations can be relevant as well at very high projectile ener- 
gies, typically >1 MeV amu™!. Nuclear energy loss, which results from elastic 
collisions, and its resulting primary recoil spectrum can be calculated using uni- 
versal expressions for the nuclear stopping power, which rely on a binary collision 
approximation. In crystalline targets, when the kinetic energy transferred to an 
atom exceeds a threshold energy Ey, typically 25 eV for metals, this atom is dis- 
placed from its initial lattice site, thus creating a vacancy. At low recoil energy, 
single Frenkel pairs are produced: one interstitial atom is created near the va- 
cancy, so as to keep the total number of atoms constant. For higher recoil ener- 
gies, the primary knocked atom may displace several other target atoms. In par- 
ticular, for recoil energies above ~ 1 keV, a cascade of displacement reactions 
takes place, resulting in the production of many vacancies and interstitials. The 
modified Kinchin-Pease formula, also known as the NRT formula, was intro- 
duced to determine the number of Frenkel pairs ngp produced in such cascades 
as a function of the damage energy Ep, which is defined as the fraction of the re- 
coil energy involved in nuclear collisions [Eq. (16)]. 


E 
nNRT — 087E; (16) 


However, it is now well recognized that the binary collision approximation fails to 
account properly for the interactions and recombination of defects during the few 
picoseconds of the cascade lifetime [39-41]. As a result of these many-body inter- 
actions, the actual number of Frenkel pairs produced is lower than that predicted 
by the NRT formula (see Fig. 8.3). Bacon and coworkers [42, 43] obtained a gen- 
eral expression [Eq. (17)] by fitting molecular simulations results. 


nrp = A(Ep)”™ (17) 


Ep is expressed in keV, and A x5 and m x 0.75 are weakly material-dependent 
constants. In addition, a significant fraction of these defects may agglomerate 
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and form interstitial or vacancy clusters, so that the number of defects free to mi- 
grate in the material at the end of the displacement cascade is reduced even fur- 
ther. We note also that when the recoil energy exceeds a certain value, typically 
x 50 keV for metals, a cascade splits into subcascades; provided correlation ef- 
fects between subcascades may be ignored, it is sufficient to evaluate the damage 
produced by recoil energies less than that subcascade formation threshold. 

In addition to the production of point defects, nuclear collisions also result 
in the forced relocation of atoms from their initial site to some final location. At 
low recoil energy, these atomic replacements occur in replacement collision se- 
quences (RCS) along dense crystallographic directions, while, at higher energies, 
they result from the dynamically correlated transport of atoms within the dis- 
placement cascade. The number of replacements per Frenkel pair produced 
ranges from a few (for low recoil energy) to more than hundred at high energies. 
These replacements play a critical role in driving alloys away from equilibrium: 
they may result in forced atomic mixing, and chemical disordering if the initial 
microstructure is ordered. Since the kinetic energy per atom in a cascade during 
most of its lifetime exceeds ~1 eV, it is much larger than typical enthalpy 
changes involved in phase transformations, ~ 0 to 0.1 eV, and, as a first approxi- 
mation, one may assume that these replacements are ballistic, i.e., unaffected by 
the thermodynamics of the irradiated alloy [44]. This simplification may break 
down in alloy systems with a very large enthalpy of mixing, AHm. For large and 
negative AH», the core of the cascade may retain some chemical short-range 
order (e.g., Ni3Al, [45, 46]). For large and positive AHm, little or no mixing at all 
is observed in molecular dynamics (MD) simulations. Within the liquid-like 
model introduced by Averback and coworkers for displacement cascades [39], lit- 
tle or no forced mixing is expected to take place in alloy systems that are immis- 
cible in the liquid state, such as Ni-Ag, Fe-Ag, Cu-Mo, and Cu-W. This rule is 
in good agreement with experimental findings, and with MD simulations [47, 
48]. Finally, in certain alloy systems the disorder induced by nuclear collisions 
leads to the formation of amorphous regions [39]. 

The atomic mixing and disordering forced by displacement cascades is charac- 
terized by a rate, the rate of forced replacements, but also by two length scales. 
Firstly, a cascade extends over a finite region, with an average diameter L ranging 
from about 1 to 10 nm. Cascade sizes can be measured indirectly by dark-field 
transmission electron microscopy, using alloys that are long-range ordered, e.g., 
Cu3Au, and assuming that a cascade transforms fully into a chemically disor- 
dered zone [49]. These sizes are in good agreement with those directly deter- 
mined from MD simulations [39]. We will indicate in Section 8.2.5 that L plays a 
determining role in the possible self-organization of the chemical order in irradi- 
ated alloys. Secondly, within a cascade, atomic relocations are characterized by a 
distance R, or more precisely by a distribution of distances. MD simulations indi- 
cate that most atoms are relocated to first-nearest-neighbor sites. Some atoms, 
however, are relocated further away [39]. This distribution was investigated in de- 
tail for a CusoAgso alloy subjected to ion irradiation, with ion masses ranging 
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Fig. 8.3 Total number of vacancies (empty circles) and number of 
isolated vacancies (solid circles) as a function of the primary recoil 
energy from MD simulations using a generic Ni interatomic potential. 
The mixed line indicates the defect production predicted by the NRT 
formula [see Eq. (16)]. The solid triangles give the total number of 
replaced atoms, to be read on the right-hand ordinate (after ref. [41]). 


from low to high, and the distribution of relocation distances was found to follow 
an exponential decay [50]. Moreover, the decay length increased from half a 
nearest-neighbor distance for light-ion irradiation to about one nearest-neighbor 
distance for heavy-ion irradiation. While this variation is rather modest, it will be 
shown in Section 8.2.4 that it may nevertheless be sufficient to trigger composi- 
tional patterning. L and R characterize the spatial correlations of the forced mix- 
ing, and, in the context of dissipative systems, these length scales relate to the 
structure of the “noise,” or fluctuations, introduced by the external forcing. 

As a result of the elementary effects described above, an alloy under sustained 
irradiation is under continuous siege by many processes. Point defects and point 
defect clusters are produced, migrate through thermally activated processes, dis- 
appear by interstitial-vacancy recombination and by elimination in sinks such as 
grain or phase boundaries, surfaces, or dislocations; chemical species undergo 
forced relocation and thermally activated migration. The lifetime of displacement 
cascades is typically a few picoseconds, a time scale much shorter than those in- 
volved in thermally activated diffusion, even at high irradiation temperatures. 
This decoupling in time scale makes it possible to simplify the kinetics of the sys- 
tem and assume that the dynamics resulting from forced atomic mixing and de- 
fect production takes place in parallel with the internal dynamics of the alloy, by 
thermally activated jumps of point defects. Nearly all kinetic models for alloys 
under irradiation rely on such a superposition principle, where the total rate of 
evolution of any particular variable describing the alloy microstructure is written 
as the sum of the rates due to each contributing process. Whereas this approach 
appears to be well founded for atomistic models, one should realize that prob- 
lems can arise in continuum descriptions, since they rely on quantities averaged 
in time over many events. A first example is found in assessing the flux of point 
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defects which are eliminated into a sink. In this case, the rate of point defect 
elimination in sinks is a function of the recombination rate between defects [51, 
52], and more generally is influenced directly by the presence of other sinks, in 
particular biased ones [53]. Another example is the possible modification of diffu- 
sion coefficients by the forced ballistic jumps [54]. When any given atom under- 
goes alternately thermally activated and ballistic jumps, at similar rates, standard 
expressions for diffusion coefficients miss certain coupling effects between the 
two types of jumps. 

A first effect of irradiation is to increase intrinsic atomic diffusion proportion- 
ally to the ratio of nonequilibrium vacancy concentration to its thermal equilib- 
rium value. It is important to emphasize that, in a first approximation, this dy- 
namics is controlled by the thermodynamics of the alloy, i.e., the gradient of 
chemical potential. As noted by Adda et al. [34], the presence of large point defect 
supersaturations may, however, modify the chemical potential of other species. 
Second, the presence of interstitials adds another process which contributes to 
atomic diffusion. Self-interstitials have quite small migration energies in most 
metallic systems, from 0.1 to 0.3 eV, and MD simulations indicate that small 
interstitial clusters may even migrate almost athermally [55]. In some alloy 
systems, however, large interstitial migration energies have been reported, some- 
times even approaching the vacancy migration energies [56, 57]. The thermody- 
namic potentials controlling the evolution of vacancy and interstitial fluxes are 
different, as well as their coupling to chemical fluxes. Interstitials can thus lead 
to the formation of nonequilibrium phases distinct from those stabilized by va- 
cancy fluxes. Finally, a third dynamics is the forced atomic mixing produced by 
nuclear collisions. Within the approximation of ballistic jumps, the atoms in- 
volved as well as the direction of the jumps are randomly sampled. Such a forced 
dynamics will tend to erase composition gradients, and it is thus equivalent to an 
“infinite-temperature” dynamics. Systems with infinite-temperature dynamics 
have sometimes been considered in the nonequilibrium statistical physics com- 
munity, in particular for recovering hydrodynamic equations [58], or as a generic 
case of nonequilibrium dynamical systems [59]. 

Depending upon the alloy and the irradiation conditions, these various dy- 
namics may work synergistically, or they may compete with one another. Under 
sustained irradiation conditions, it is commonly observed that the composition 
and chemical order fields reach a stationary state for a sufficiently large irradia- 
tion dose. More accurately, these states are often only quasi-steady states, since 
other elements of the microstructure, such as the density and the distribution of 
dislocations, are still slowly evolving. An important question is whether, for a 
given set of irradiation conditions, the same steady state is reached regardless of 
the initial conditions. While this property holds for alloy systems at thermal 
equilibrium — even though the system may be trapped in a metastable state for a 
very long but finite time, there is no principle that guarantees that the property 
holds for nonequilibrium dissipative systems [60, 61]. In nearly all the experi- 
mental results available to date, however, it appears that alloys under irradiation 
do reach steady states which are independent of their initial state [24, 62, 63]. 
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One exception reported during the electron irradiation of Ni,Mo ordered alloy 
may be due to the very slow coarsening of the D1, long-range ordered phase at 
rather low temperatures [64]. A large part of the theoretical effort presented in 
this chapter focuses on the prediction of these steady states. The practical applica- 
tion of this knowledge cannot be underestimated since one crucial question in 
the nuclear industry is the assessment of the long-term stability of engineering 
materials in irradiation environments, over periods of time ranging from 50 years 
(nuclear reactor) to thousands of years (nuclear waste storage). The first such 
steady-state diagrams for alloys under irradiation were reported by Barbu and 
Martin for the Ni-Si system [65], and by Cauvin and Martin for the Al-Zn sys- 
tem [9]. 

Before techniques that can be used to determine steady states stabilized by irra- 
diation or other external forcing are presented in Sections 8.2.3 and 8.2.4, it is 
useful to introduce some simple qualitative arguments on the evolution of mate- 
rials under the three dynamics identified above. If one ignores for the time being 
the interstitial-mediated dynamics, it is useful to distinguish three regimes [33, 
34]. 

At high temperatures, the vacancy-mediated dynamics is dominant, and the 
alloy is expected to evolve toward a steady state which is close to its equilibrium 
state. Irradiation can in fact be used to accelerate the rate of relaxation toward 
equilibrium, for instance, to achieve a high degree of chemical order [36, 66], 
one that would require prohibitively long thermal annealing in the absence of ir- 
radiation. 

At low enough temperatures, on the other hand, the evolution of the alloy mi- 
crostructure is dominated by the forced jumps, and if they are ballistic, they will 
drive any alloy toward a chemically disordered and homogeneous state. The dy- 
namical stabilization of a solid solution to phase separation is observed, for in- 
stance, in Ag-Cu multilayers undergoing ion-beam mixing at low temperatures, 
typically below room temperature [63]. As discussed above, however, no mixing, 
or only a little, is observed in alloy systems with large positive heats of mixing, 
such as Ag-Co, Ag-Ni, Ag-Fe, or Cu-Mo [39, 67, 68]. In alloy systems which 
undergo both decomposition and ordering at equilibrium, the dissolution and 
disordering of ordered precipitates have also been observed often during low tem- 
perature irradiations [37]. Recent work which combined atom probe field ion mi- 
croscopy (APFIM) with TEM has revealed that, in the Ni-Al system, disordering 
may or may not precede dissolution, depending upon the irradiation conditions 
[69, 70]. This result is in agreement with an earlier prediction [71] based on a 
mean-field kinetic model [24]. Matsumura et al. [72] have also reached a similar 
conclusion using phase field modeling. 

At intermediate temperature, the two dynamics are of similar magnitude and 
their competition may drive the alloy into steady states which have no counterpart 
in the equilibrium phase diagram. In Section 8.2.4, for instance, we discuss pat- 
terning reactions which develop in this intermediate temperature regime in the 
Ag-Cu and Cu-Co systems. It should be clear also that the boundaries between 
these three regimes depend upon the irradiation conditions, and in particular 
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upon the displacement and replacement rates, since the competition between bal- 
listic and intrinsic dynamics is determined by the jump frequency of forced 
jumps relative to thermal ones. 

Let us now turn to situations where the contribution of ballistic jumps is negli- 
gible, which usually requires an elevated irradiation temperature. Due to the sus- 
tained production of point defects, which takes place homogeneously in the solid, 
and their subsequent elimination in localized sinks, irradiation leads to the build- 
up of permanent nonzero net fluxes of point defects. These fluxes are responsi- 
ble, in particular, for the development of the defect microstructure, including dis- 
location loops and voids. Since the present chapter is focused on alloys, we will 
not discuss these microstructural evolutions per se. Defect fluxes, however, may 
couple to chemical fluxes, and thus result in the preferential transport of certain 
chemical species to sinks, the so-called inverse Kirkendall effect. One well-known 
and technologically relevant outcome of these couplings is the phenomenon of 
radiation-induced segregation (RIS) [73-81]. A remarkable fact is that this cou- 
pling may lead to local chemical enrichments at sinks [82, 83], or homogeneously 
[9], and these enrichments can be large enough to lead to the precipitation of sec- 
ond phases, even when the nominal composition of the alloy is such that the ma- 
terial is an undersaturated solid solution at equilibrium. The production and the 
migration of interstitials also introduce a new pathway for alloy evolution. For in- 
stance, a solid solution at thermal equilibrium may develop precipitates through 
the preferential transport by interstitials of a chemical species with a negative size 
effect. These evolutions can be captured by the rate equations discussed below in 
Section 8.2.3. 

We now return to the more general case where the three dynamics operate to- 
gether. A wealth of microstructural evolutions is possible, and is indeed observed. 
While it is often possible to rationalize experimental results by invoking effects 
produced by the three types of dynamics, it is often challenging to predict these 
evolutions a priori, or to extrapolate known results to other irradiation environ- 
ments, because these evolutions are quite sensitive to the details of each dy- 
namics and to their interplays. We will see in particular in Section 8.2.3 that, 
in the control variable space, there exist boundaries that delimit very different 
kinetics and steady states. As a result, a small variation of irradiation temperature 
or of the alloy composition may lead to dramatic changes in the microstructure 
which will evolve under irradiation. It is proposed, however, that kinetic modeling 
based on the driven alloy theory, coupled with selected experiments, has the po- 
tential to make safe predictions and extrapolations. 

While the theory of alloys driven by irradiation was initially developed for me- 
tallic alloys, many results on nonmetallic systems subjected to irradiation can be 
rationalized within the same theoretical framework. In silicon, in semiconductor 
compounds, and in oxides, irradiation can lead to the amorphization of initially 
crystalline phases [84, 85]. Even though amorphization may occur directly in dis- 
placement cascades or by accumulation of disorder [38, 86], we note that the exis- 
tence of critical temperatures above which amorphization no longer takes place 
has been reported for several systems [24]. The presence of these critical tem- 
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peratures suggests that, near these temperatures, the steady-state microstruc- 
ture results from a dynamical competition between irradiation damage and 
thermal recovery [87-91]. Irradiation effects in ceramic materials have received 
renewed attention since these materials may be used for nuclear waste immo- 
bilization and in Generation-IV fission reactors. Recent experimental results on 
irradiation-induced amorphization in A2B207 pyrochlore compounds demon- 
strated the existence of such a critical temperature for amorphization [92, 93]. 
This critical temperature varies with the composition of the compound, in partic- 
ular the ratio of the atomic radii of the A and B cations, and with the irradiation 
conditions. Recent MD simulations of damage production in Laj,Zr,O7 pyro- 
chlore [94] supported the idea that amorphization results from the accumulation 
of cation antisites. This ensemble of results therefore suggests that one should be 
able to predict the outcome of an irradiation in such compounds by modeling the 
dynamical competition between antisite production by nuclear collisions and 
antisite elimination by thermally activated processes. A similar approach has 
also been proposed for the case of irradiated SrTiO; perovskites [95]. 

Unexpected irradiation-induced phase transformations have also been reported 
for carbon nanostructures such as buckyballs and nanotubes (see ref. [96] for a 
review). In particular, it has been reported that electron irradiation in transmis- 
sion electron microscopes operating at 400 and 1250 keV can alter the respective 
stability of graphite and diamond phases [97, 98]. As shown in Fig. 8.4, in the pa- 
rameter space spanned by the electron dose rate and the irradiation temperature, 
irradiation at intermediate temperature induces the growth and the stabilization 
of the diamond phase over the graphite one. Key to this inversion of stability is 
the fact that the average displacement threshold energy is lower in the graphite 
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Fig. 8.4 Experimentally determined nonequilibrium phase diagram 
(irradiation intensity versus temperature) for carbon under irradiation 
with 1250 keV electrons. Open circles: diamond growth; black squares: 
graphite growth; full line: theoretical curve (from ref. [98]). 
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phase, largely because it is easy to displace atoms along the c direction due to the 
sp? bonding. Zaiser and Banhart [97, 98] have demonstrated that the above ef- 
fects cannot be simply rationalized by a higher defect concentration in the graph- 
ite phase under irradiation. Indeed such an argument, if it were to account for 
the stabilization of diamond at intermediate temperature, fails to predict a resta- 
bilization of the graphite phase at even lower temperatures. Banhart and Zaiser 
have proposed a kinetic model where interstitials produced near a graphite/dia- 
mond interface feed one phase or the other, depending upon the irradiation con- 
ditions. Following an approach based on a master equation (see Section 8.2.3 and 
Chapter 5), they calculated an effective free energy for the graphite-diamond sys- 
tem and showed (see Fig. 8.4) that this effective free energy can be used quantita- 
tively to calculate the stability boundaries of these phases under irradiation at 
steady state, in excellent agreement with experimental results. 

Surfaces of materials subjected to energetic ion beams constitute another re- 
lated example of materials driven by irradiation. Clearly, the projectile energy 
does not need to be as large as for bulk irradiation, and typical projectile energies 
range from a few electronvolts to a few kiloelectronvolts. While the low-energy 
range is used in beam-assisted deposition to provide additional mobility to ad- 
atoms and suppress the roughness of growing films, the higher energies can 
lead to morphological instabilities of initially planar surfaces. In particular, one- 
dimensional or two-dimensional ripples have been reported. It has been proposed 
to take advantage of these instability and self-organization reactions during the 
synthesis of functional surfaces. For the sake of conciseness, we will not discuss 
further surfaces under irradiation, for which several recent papers and reviews 
are available (see, e.g., ref. [99]). It is also interesting to note that ripple formation 
and other surface instability reactions may take place unintentionally, for instance 
during the preparation by ion milling of thin foils suitable for TEM. 


8.2.1.2 Alloys Subjected to Sustained Plastic Deformation 

Plastic deformation provides another way to drive materials away from equilib- 
rium, and it is commonly found in many materials processing techniques, in- 
cluding rolling, extrusion, drawing, friction-stir welding, high-energy ball milling, 
and laser shocking (also called laser peening). In addition, materials in service 
may also be subjected to plastic deformation, for instance in fatigue and during 
frictional wear. Our focus, as in Section 8.2.1.1 above, is on the steady states 
reached in alloys subjected to sustained plastic deformation. As for irradiated ma- 
terials, there is a wealth of experimental observations demonstrating that these 
systems can be driven into nonequilibrium states (see ref. [24] for some exam- 
ples). Under sustained plastic deformation, precipitates can be disordered and 
dissolved, crystalline phases may become amorphous, and the composition and 
the degree of order may organize spontaneously into patterns. These reactions 
are in addition to other well-documented nonequilibrium evolutions which take 
place even in pure systems, such as dislocation patterning [100-104] and dy- 
namic recrystallization [105] (see also ref. [106] for additional references on pat- 
terning in driven alloys). 
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In the late 1980s, Martin and Gaffet pointed out that powder materials pro- 
cessed by high-energy ball milling should be regarded as driven alloys, due to 
the repeated fracture and deformation of the powders during processing, all 
events which force changes of the local environment of atoms, in a way reminis- 
cent of the ballistic jumps. Several examples of phase transformations induced by 
this sustained plastic deformation were discussed in detail in our previous review 
[24]. In particular, detailed analysis of the extensive results available for the 
crystal-to-amorphous transitions in Ni-Zr compounds and order-disorder transi- 
tions in B2 Fe-Al compounds induced by high-energy ball milling revealed the 
following important points. First, after extended milling times, these alloy sys- 
tems always reach a stationary state, e.g., as defined by the degree of long-range 
order, which is furthermore independent of the initial state. Secondly, the transi- 
tions do not correspond to a minimization of the equilibrium free energy of the 
materials, even by imposing some constraint on the free energy. Thirdly, there 
exist boundaries in the dynamical equilibrium phase diagrams separating these 
steady states; thus dynamical phase transitions, i.e., transitions from one steady 
state to another, can be induced by varying the temperature and milling parame- 
ters, in particular the milling intensity (see Section 8.2.2). We showed in our pre- 
vious review that these steady states and dynamical transitions, as well as their 
kinetics, could be fully rationalized by extending the framework initially intro- 
duced for alloys under irradiation. Additional results published since then have 
been rationalized successfully as well [107-111]. 

Another important parallel was brought to the materials science community’s 
attention by Rigney and coworkers [112-114]. They showed that the microstruc- 
tures found in powders processed by high-energy ball milling were very similar to 
those observed in the mechanically mixed layers which form at the contact be- 
tween bodies undergoing frictional wear, also referred to as transfer layers or 
third bodies. Phase transformations induced by wear are in fact well known, and 
one classical example is the destabilization of pearlite to the benefit of carbon- 
supersaturated bcc solid solution at the surface of rail tracks [115]. This phase 
transformation is linked to the formation of the Bielby layer, a white layer that is 
revealed by etching. More recently, Le Bouar et al. have shown that, by extending 
a model developed for transitions induced by ball milling, they could predict 
quantitatively the wear rate of fast train wheels [116], as discussed in Section 8.2.5. 

More recently, the theoretical framework for alloys driven by plastic deforma- 
tion found another and unexpected area of application in the processing of or- 
ganic compounds by ball milling for pharmaceutical applications, as briefly dis- 
cussed in Section 8.2.5. 

From a fundamental viewpoint, it is helpful to compare the processes involved 
in alloys subjected to sustained plastic deformation with those discussed above 
for alloys under irradiation. Plastic deformation in crystalline systems, whether 
by dislocation slip or by twinning, induces a forced mixing of atomic species, as 
each dislocation glide shifts atoms in glide planes with respect to their initial 
neighbors. Furthermore, plastic deformation creates point defects [117], and 
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thus the rate of processes controlled by the internal dynamics is accelerated in 
proportion to the point defect supersaturation [118], as for alloys under irradia- 
tion. However, this forced atomic mixing presents three significant differences 
from the forced mixing induced by irradiation. First, the local shearing rate may 
depend significantly on the local microstructure of the alloys. Indeed, dislocation 
activity can be reduced by the presence of obstacles such as other dislocations, 
precipitates, grain boundaries, and interphase interfaces. One extreme situation 
is when, in a two-phase material, the imposed macroscopic deformation is ac- 
complished by the plastic deformation of one phase only, for instance when the 
second phase consists of hard, nonshearable precipitates. In that case, as in other 
cases of localized plasticity, the deformation will not force atomic mixing between 
the two phases [119]. Recent MD simulation results [120] suggest that this 
straightforward effect is the principle reason for the lack of mixing by high- 
energy ball milling which is observed at low temperatures in alloy systems such 
as Ag-Ni [121], Ag-Fe [122, 123], and Cu-W [124] (see ref. [119] for an extensive 
review). Now restricting our discussion to cases where plastic deformation pro- 
duces fairly homogeneously atomic mixing, the second difference from the irradi- 
ation case is that, based on currently available data, the nonconservative glide and 
crossing of dislocations produce vacancies but no interstitials [117, 125]. Recent 
molecular dynamics simulations in bcc metals [126] indicated that interstitial 
clusters can be produced, but, due to the high binding energy between intersti- 
tials and an interstitial cluster, no isolated interstitials can be re-emitted by these 
clusters. Finally, the third main difference resides in the intrinsic characteristics 
of the mixing produced by the glide of a dislocation (see Chapter 6). One glide 
event of a dislocation produces a perturbation, the sheared area, which is charac- 
terized by three lengths: the amount of shear, i.e., the modulus of the Burgers 
vector; the length of the moving segment of the dislocation; and the distance trav- 
eled by this segment, i.e., the sheared length Lsn. While the particular values of 
the latter two lengths are a function of the alloy and its microstructure, they may 
be very large. In some cases one expects the shear length to be of the order of the 
grain size. As discussed in detail in Section 8.2.4, this results in a forced atomic 
mixing whose efficiency is scale-dependent, and this property appears to play a 
key role in the spontaneous formation of compositional patterns. 


8.2.1.3 Alloys Subjected to Sustained Electrochemical Exchanges 

A third broad range of driven materials is the case of materials undergoing corro- 
sion due to chemical exchanges with their environment, e.g., a gas phase, or elec- 
trochemical exchanges with an electrolyte. While corrosion is generally and 
rightly perceived as a problem, subjecting alloys to forced electrochemical ex- 
changes with their environment can also be used to synthesize new structures, 
for instance by dealloying [127, 128]. This type of driven alloy systems will not 
be discussed further here, although we believe that the concepts and tools dis- 
cussed in the following sections could be transferred to materials undergoing 
electrochemical exchanges. 
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8.2.2 
Identification of the Relevant Control Parameters: Toward a Dynamical Equilibrium 
Phase Diagram 


The first important step in the study of a driven alloy is to identify the relevant 
control parameters. Consider, for instance, the case of an alloy under irradiation 
discussed in the previous section. The parameters set for an experiment include 
the mass, the energy, and the flux of the projectiles, the irradiation time, the tem- 
perature and the composition of the target, and the environment of the chamber 
— typically a high vacuum. These experimental parameters, however, are not nec- 
essarily the parameters relevant to the driven alloy framework. The mass, the en- 
ergy, and the flux of the projectiles for a given target determine the nature and 
the rate of the damage produced by nuclear collisions, i.e., the rate of production 
of freely migrating defects and of clustered defects, the rate of chemical mixing, 
the size of the displacement cascades, and the characteristic rate for atomic relo- 
cation in the cascades. It is therefore this second set of parameters which directly 
determines the spatial and temporal characteristics of the several dynamics active 
in the driven alloy. In the modeling of driven alloys simplifications are often 
made, thus leading to a smaller set of control parameters. For instance, if one fo- 
cuses on order—disorder transformations, it may be acceptable to ignore clustered 
defects. Furthermore, if one is interested in steady-state properties, kinetic equa- 
tions can be simplified further by the use of reduced, often dimensionless, quan- 
tities. In the previous example of an ordered alloy under irradiation, if the domi- 
nant effects can be reduced to the disordering produced by ballistic mixing and 
the reordering promoted by thermal diffusion, a reduced forcing intensity can 
then be defined as the ratio of the ballistic jump frequency to the thermally acti- 
vated jump frequency. Once control parameters have been correctly identified, the 
outcomes of different irradiation experiments can be directly compared. 

As discussed in Section 8.2.1.1, alloys under irradiation often reach a steady or 
a quasisteady state (see Fig. 8.5 for an example). By analogy with equilibrium 
phase diagrams, it is of fundamental and practical interest to construct the map 
of these steady states as a function of the control parameters. We refer to these 
maps as dynamical equilibrium phase diagrams, to emphasize the fact that these 
steady states are dynamical states stabilized by the external forcing. By construc- 
tion, for a given alloy system, a dynamical phase diagram contains the equilib- 
rium phase diagram, since when the forcing intensity goes to zero the stationary 
state reached is the equilibrium state. One of the main goals of the study of 
driven alloys is to develop tools, analytical ones if possible, that make it possible 
to construct dynamical equilibrium phase diagrams. We will illustrate in Section 
8.2.3 how this goal can be achieved in simple cases, for instance for microstruc- 
tures fully described by their composition field. In the most general case, how- 
ever, there are fundamental obstacles, such as the lack of detailed balance, and 
one is often reduced to using computer simulations to construct dynamical equi- 
librium phase diagrams. 
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Fig. 8.5 Comparison of (a) the evolution of Co-rich particle sizes and 
(b) normalized magnetizations in CugoCoj9 irradiated at 200 °C with 

1 MeV Kr ions as a function of the dose for three sets of thin films: 
as-grown by co-sputtering, randomized by pre-irradiation at room 
temperature, and preannealed at elevated temperature. Magnetic 
measurements (SQUID) were performed at T = 100 K (from ref. [197]). 


The characterization of the physical processes introduced by the external forc- 
ing is less advanced in the case of alloys subjected to sustained plastic deforma- 
tion than it is for alloys under irradiation. This results partly from the fact that 
the local rate of deformation can couple with the microstructure, whereas the 
rate of defect production by irradiation is largely independent of that microstruc- 
ture. Nevertheless, one natural variable to characterize the intensity of the forcing 
is the plastic strain rate. It is often difficult, however, to determine this rate, in 
particular for powders processed by ball milling [129, 130]. Analyses of the me- 
chanical system consisting of the powders, the milling balls, and the milling con- 
tainer were performed using instrumented mills, direct visualization, and model- 
ing [131, 132]. They have provided useful estimates of the frequency and velocity 
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Fig. 8.6 Steady-state structure of NijoZr7 as a function of milling 
conditions as defined by the transfer of momentum at impact and the 
impact frequency (see the text for definition). Filled and open squares 
correspond to fully amorphous and partly amorphous steady states, 
respectively. The boundary separating the two types of steady-state 
microstructures fits well a line of constant value for MpVmax f (from ref. 
1133). 


of impacts between milling tools and powder particles, as a function of milling 
parameters such as number, size, and mass of balls, ball-to-powder mass ratio, 
and ratio of velocity of vial to velocity of main disc in planetary mills. These pa- 
rameters, however, are not independent as far the degree of amorphization or the 
degree of chemical order is concerned. An alternative approach developed by 
Martin and coworkers has been to perform parametric studies to assess the effect 
of the experimental parameters on a phase transformation, e.g., crystal-to- 
amorphous [133] or order-disorder [134], and to define a milling intensity. One 
can then propose and assess reduced variables which can rationalize the observed 
phase transformations consistently. A point of debate in the community has been 
to identify whether the milling intensity is related to the kinetic energy or to the 
momentum transferred per impact per unit mass of powder. Chen et al. [133] for 
amorphization reactions in the Ni-Zr system (see Fig. 8.6), and Pochet et al. 
[134] for order-disorder reactions in the Fe-Al system, used an instrumented vi- 
bratory ball mill and concluded that the correct specific milling intensity is in fact 
related to the momentum transferred and is defined by Eq. (18), where M, and 
M, are the ball and powder masses respectively, Vmax the maximum velocity of 
the frame, and f the impact frequency. 


M b Vinax f 
Mp 


I= (18) 


For the NijoZr7 compound, for instance, full amorphization is achieved when the 
specific milling intensity exceeds a threshold value of 510 m s?, regardless of 
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the particular combination of values for Mp, My, Vmax, and f. The practical value 
of the experimental dynamical equilibrium phase diagrams is that they can be 
used to anticipate the outcome of other experiments. A practical limitation is 
that it can be quite time-consuming to construct these phase diagrams relying 
solely on experiments, in particular if one varies the composition of the alloy. An 
additional limitation is that results obtained with different mills, e.g., shaker, vi- 
bratory, and planetary mills, may differ somewhat due to variations in the me- 
chanical processes involved. It is nevertheless remarkable that the milling inten- 
sity as defined by Eq. (18) has been successfully extended to predict quantitatively 
the wear rate of fast train wheels [116]. 

It is worth making a connection between dynamical equilibrium phase dia- 
grams and the so-called Ashby maps. Ashby and coworkers have introduced very 
useful mechanism-based maps for materials subjected to creep, sintering, and 
dry frictional wear [135-137]. These maps indicate, for a given set of reduced 
variables, the dominant mechanism in the kinetic evolution of the variable used 
to monitor the evolution of the materials, i.e., the creep rate, the sintering rate, 
and the wear rate in the above examples. Ashby’s maps, however, do not locate 
transitions of these evolution rates. The purpose of dynamical equilibrium phase 
diagrams is to identify these transitions precisely. The consequences of the 
existence of dynamical phase transitions should not be underestimated. When 
one performs accelerated tests to assess the resistance of a material to a certain 
external forcing, for instance, a pressure vessel steel in a nuclear reactor or 
metallic waste container subjected to corrosion, the extrapolation of results ob- 
tained from short-time experiments at high forcing rates to an environment 
where the material would be subjected to a much lower forcing rate, but for a 
considerably longer period of time, may be valid as long as one does not cross a 
dynamical phase transition on going from the high to the low forcing rate. In- 
deed, if such a dynamical transition takes place, the system at low forcing rate 
would be evolving toward a different steady state, and the accelerated experiments 
would be of no predictive value. 

It will be recalled in Section 8.2.3 that, under fairly general conditions for time- 
independent external forcing, if the microstate probability distribution of a driven 
alloy reaches a steady-state distribution, this steady-state distribution is unique. 
There is however no guarantee that the microstate probability distribution will 
reach a steady state, or that the macroscopic variables used to characterize a 
driven alloy will reach steady state. In fact, it has been known for several decades 
that dissipative systems may reach time-dependent states such as oscillatory and 
chaotic ones [30, 138]. Oscillatory states had not been reported for the driven al- 
loys considered in this chapter until recently. El-Eskandarany and coworkers, 
however, have observed oscillatory regimes in the amorphization-crystallization 
reactions in Co7s5Tigs [139], CosoTiso [140], AlsoZrso [141], and Cusz3Zr67 [142]. 
Based on the experimental procedure used in these experiments, and on a careful 
analysis of the milled powders using XRD, DSC, SEM and TEM, these authors 
have excluded the possibility that the oscillatory behavior would be due to con- 
tamination or temperature variations. A possible explanation for the cyclic re- 
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sponse is that the damage created by one collision is a function of the state of the 
material. The corresponding situation in alloys under irradiation would be a sys- 
tem where the displacement cross-sections vary with the state of the material, so 
that, for a constant ion flux, various phases would in fact be irradiated at vari- 
ous displacement rates. As discussed in Section 8.2.1, this is the case for the irra- 
diation of carbon, due to a threshold displacement energy for graphite lower than 
that for diamond. Very recently, Johnson and coworkers introduced a kinetic 
model which aims at identifying the general conditions required for producing 
cyclic responses during ball milling [143]. 

Until now, the forcing intensity has been assumed to be a constant. There are 
practical situations, however, where this intensity varies in time. This is clearly 
the case for materials subjected to stress or thermal fatigue. The same is true for 
nuclear reactor components whenever the power is adjusted to the demand on 
the electric grids, or whenever a reactor is stopped and restarted. New reactor de- 
signs, for instance the pebble-bed nuclear reactor, offer the possibility to do so on 
rather a short time scale, of the order of one minute. Similarly, fission reactors 
will operate on a pulsed mode, at short time scales. From a fundamental perspec- 
tive, one cannot assume that it is sufficient to use the average forcing intensity to 
describe these situations. The kinetic equations describing the evolution of the 
material variables, for instance its composition field, are no longer autonomous 
differential equations. Very little basic work has been devoted to this question in 
the context of alloys under irradiation. Rauh and coworkers [144] calculated sink 
strengths under pulsed irradiation conditions and reported that, in particular for 
free surfaces, the instantaneous sink strength can differ significantly from its av- 
erage value. More generally, the rate theory of defect accumulation (see below) 
should be used with caution in order to assess the effect of pulsed irradiation; in- 
deed, the former rests on a time- and space-averaging procedure which is not pre- 
cisely defined; the applicability of the rate theory depends on the respective values 
of the duration of pulses, of the time lag between pulses, and on the lifetime of 
the cascade-produced defects [145]. 


8.2.3 
Theoretical Approaches and Simulation Techniques 


The evolution of an alloy subjected to a sustained external forcing is not driven by 
thermodynamic forces. During this evolution, the free energy of the alloy may in 
fact decrease or increase, as demonstrated experimentally in ball milling experi- 
ments (see Section 8.2.1). In the past, some authors proposed adding a constraint 
on the free energy to account for the contributions of the defects and the struc- 
tural or chemical disorder present in a nonequilibrium system. However, this 
approach fails to rationalize even simple irradiation-induced phase transforma- 
tions [146]. Due to the continuous production and annealing of disorder and de- 
fects in a driven alloy, only a kinetic model can provide a sound description. That 
kinetic model, furthermore, should include the relevant kinetic processes and 
their coupling. We review below the various methods which are available to 
model and simulate these processes, from the atomistic to the continuum level. 
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8.2.3.1 Molecular Dynamics Simulations 

Molecular dynamics (MD), which is discussed in detail in Chapter 12, is a power- 
ful method to identify the short-term response of an ensemble of atoms to a given 
external forcing. MD simulations have provided unique information on the ele- 
mentary processes involved during forcing events, in particular for the damage 
produced by irradiation in solids [39] (see, for instance, Fig. 8.3), and more 
recently for the plastic deformation in materials subjected to large stresses. 
Thanks to the development of realistic semiempirical interatomic potentials (see 
Chapter 11), and to the explosion in computing power, systems containing up to 
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Fig. 8.7 MD calculations of atomic relocations produced by the 
collision cascades initiated by the indicated ions on a AgsoCuso solid 
solution. Top: absolute histograms of relocation distances; bottom: 
probability densities of atomic relocations. Also shown are fits to the 
data based on exponential decays: 0.07 A~! exp[—r/(3.08 A)] for the 


case of Ne, Ar, and Kr, and 0.08A~' exp[—r/(1.44 A)] for the case of He 
(from ref. [50]). 
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several millions of atoms can be simulated for several nanoseconds of physical 
time. These simulated times, in general, do not make it possible to reach steady 
states which are partly determined by thermally activated diffusion. These limita- 
tions can be alleviated by simulating small systems, and by using high tempera- 
ture or high forcing rate, as long as the extrapolation of these results to more re- 
alistic conditions is valid. In the case of alloys under irradiation, in addition to 
assessing the number of Frenkel pairs and atomic replacements produced by a 
cascade of a given energy [see Fig. 8.3 and Eq. (8.17)], MD simulations have 
been employed to reveal the distribution of relocation distances as a function of 
the mass and energy of the projectiles. Figure 8.7 shows the number density of 
relocation as a function of the relocation distance for 1 MeV Kr, 500 keV Ar, 270 
keV Ne, and 62 keV He ions in a AgsCuso solid solution [50]. The energies of the 
ions are chosen so that they produce the same projected range for the projectiles, 
= 260 nm. As expected, the heavier the ion, the more relocations are produced. 
For short relocation distances, the number density displays oscillations which cor- 
respond to the neighbor distances allowed by the crystal structure. At longer dis- 
tances, however, the distribution follows an exponential decay, with a decay 
length R ~ 3.08A for the three heavy ions and R ~ 1.44A for He. The fact that 
an exponential decay, rather than a Gaussian one, reproduces well the distribu- 
tion for relocation distances greater than 5 A suggests that these relocations result 
from ballistic events, as opposed to a diffusion-like process. 


8.2.3.2 Microscopic Master Equation 

Kinetic Monte Carlo (KMC) simulations provide a powerful way to assess the ef- 
fect of the various parameters identified by MD on the long-term evolution of a 
driven alloy. KMC methods take advantage of transition rate theory for thermally 
activated processes to calculate directly the frequency of events which have been 
identified as being relevant for the alloy kinetics. In order to appreciate better the 
foundations of KMC methods, it is useful first to review a general microscopic 
description of an alloy where atoms can migrate because of the presence of sev- 
eral dynamical processes. For simplicity, consider a binary alloy system where 
a = A, B species are distributed on lattice sites i= 1, N. We denote the 2% micro- 
configurations by the set of N occupation numbers, {n7}, where n? = 1 if the site 
i is occupied by an « atom, n% = 0 otherwise — point defects are not included in 
this presentation. Consider now an ensemble of equivalent systems, and define 
the probability of finding one system in a given microconfiguration at time t, 
P({n?},t), as the fraction of systems found in that state at time t. Restricting our- 
selves to Markovian processes, i.e., processes that do not depend on the past his- 
tory, the temporal evolution of the microconfiguration probability is given by a 
master equation (19). 


P({nz},t) = DO PUY AWRY > {n?}) 
(nz) 


— P({n7}, 0) Winf} > {nf}‘) (19) 
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The summation is performed over all the states {n?}’ which can be reached from 
{n?} by one event. W({n®} — {n?}') is the transition rate from state {n?} to 
{n?}’. This transition rate includes all the processes that can induce such a tran- 
sition. If we assume that forced transitions are taking place independently of the 
thermally activated ones, we can write Eq. (20), where the “th” and “b” subscripts 
correspond to thermal and ballistic events, respectively. 


win} > {n7}') = Wan) > tn?) + Wnh > {n7}’) (20) 


Following transition rate theory, the thermal rate is expressed as Eq. (21), where 
E({n?} — {n?}’) is the activation energy for that transition. 





Win({nz} > {nf}) = v exp[-BE({n7} — {n7}’)], (21) 


In contrast to the activation energy, the pre-exponential factor v displays only a 
weak dependence on the configuration (see Chapter 5), and for simplicity it is as- 
sumed here to be constant. In the simple case where all forced transitions corre- 
spond to ballistic exchanges of first-nearest-neighbor atoms, the forced transition 
rate in Eq. (20) reduces to I, the ballistic jump frequency. More complex expres- 
sions may be required to take into account the specifics of a given forcing situa- 
tion, for instance the relocation distance, as discussed in detail in Section 8.2.4. 

In order to identify the steady states reached by a driven alloy, one solves for 
dP({n?}, t)/dt = 0 in Eq. (19). In the absence of forcing, microscopic reversibility 
implies that detailed balance [Eq. (22)] is satisfied at equilibrium for all {n?} and 
{n®}’ microconfigurations, where the subscript “e” refers to the equilibrium 
probability distribution. 


Pe({nj'}) W( {ni} > {nz }') = Peltn?}') Win?} > {n7}) (22) 


This property makes it straightforward to calculate the probability of any state rel- 
ative to that of a reference state, and thus it yields the explicit solution [Eq. (23)] 
for the equilibrium probability distribution, where E({n?}) is the internal energy 
of the state {nf}, and the normalization factor Z is the partition function. 


Pe({nj}) = Z~* expl-BE({n?})] (23) 


In the presence of external forcing, however, microscopic reversibility is lost, and 
detailed balance does not have to be satisfied. There is in fact no guarantee that, 
for a dissipative system, a steady-state probability distribution exists. Lebowitz 
et al. [60], however, demonstrated under quite general conditions that if one 
such solution exists, it is unique. One approach which has sometimes been pro- 
posed is to assume that this solution exists, and to write it as Eq. (24), where Hy 
is an effective Hamiltonian [147, 148]. 


Ps{nf}) = Zep exp|—BHeg({n7})] (24) 
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The problem is then shifted to the determination of these effective interactions, 
which, unlike Ising-type interactions, are a function of temperature and forcing 
parameters. While there is no general method to solve exactly this problem, effec- 
tive interactions can be determined by an inverse KMC method [149], and approx- 
imate solutions can be obtained using a mean-field approximation (see below). 


8.2.3.3 Kinetic Monte Carlo Simulations 

The microscopic master equation formalism provides a clear foundation for KMC 
simulations. As discussed in Chapter 12, one can formally integrate the master 
equation, and, starting from an ensemble of systems all in the same microscopic 
state, the probability for a given system to have remained in that state until time t 
is given by Eq. (25), 


P,(At) = exp(—Ato({n?})) (25) 


where o({n?‘}) is defined as the sum of the rate of all the possible transitions 
from state {n?} to any other state, as a result of thermally activated and forced 
processes [Eq. (26)]. 


o({nz}) = >, Win} = {n7}’). (26) 
{nz} 


As seen from Eq. (25), the average residence time of the system in its initial state 
is given by t({n?}) = 1/a({n?}). The probability of escaping along a given transi- 
tion is given by the transition rate of that particular transition normalized by 
a({n?}). The so-called residence time algorithm takes full advantage of these rela- 
tionships (see Chapter 12 for specific details). 

As discussed in Chapter 12, this residence time algorithm is equivalent to the 
so-called Bortz—Khalos—Lebowitz algorithm, and to various algorithms intro- 
duced in different contexts. The main advantages of these KMC algorithms is 
firstly that they are computationally effective, in particular at low temperature 
and when the system contains only few defects (vacancies) which are responsible 
for thermal diffusion. Secondly, they rely naturally on the physical time for build- 
ing the evolution of the system. As a result, these algorithms are easily extended 
to the case of driven systems, where several dynamical processes are acting in 
parallel, as in Eq. (20). The price that one pays in using KMC rather than MD is 
that only the atomistic processes identified a priori will take place during a simu- 
lation. Voter and coworkers [150-152], and Henkelman and Jonsson [153] have 
recently developed hybrid methods where MD is used to uncover all possible 
transitions, and this information is then fed into a KMC algorithm. Another lim- 
itation of nearly all current KMC simulations is that atoms are constrained to re- 
side on the sites of a rigid lattice. This excludes the modeling of amorphous 
phases. Even for crystalline materials it can be a severe limitation, for instance 
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when stress, dislocations, or grain boundaries play a significant role in the evolu- 
tion of the driven alloy. Attempts have been made to include lattice relaxations in 
KMC algorithms [154, 155], but they limit significantly the physical times that 
can be reached in the simulations. Indeed, in a concentrated binary alloy, there 
already exists an impossibly large number of distinct atomic configurations 
around a vacancy, and the situation is of course worse for multicomponent alloys. 
Furthermore, the direct calculation of activation energies for relaxed configura- 
tions is typically 10° to 10° times longer than for the case of an alloy on a rigid 
lattice. Various interpolation schemes have been proposed where one first builds 
a small database of activation energies calculated exactly, on relaxed configura- 
tions, and uses this data set to predict the activation energies of all other configu- 
rations. Le Bouar and Soisson [156] employed effective pair interaction energies 
successfully to predict activation energies for vacancy migration in a dilute Fe-Cu 
alloy described by embeded atom model (EAM) interatomic potentials. For 
concentrated alloys, Sastry et al. [157] showed that, for two-dimensional systems, 
genetic algorithms can be used successfully to predict activation energies by sym- 
bolic regression of a small subset of exactly calculated activation energies. In 
some favorable situations, for instance in Alq_,)Li, alloys [158], it appears that 
the migration energies are mostly dependent on the type of diffusing atoms, and 
nearly independent of the chemical environment at the saddle point. It is then 
possible to calculate these few barriers, for instance using ab-initio methods, and 
use them as input for the KMC algorithm. Nevertheless, significant work remains 
to be done before general large-scale and long-time KMC simulations can be per- 
formed on relaxed lattices. 

Kinetic Monte Carlo simulations are also employed to evolve the microstruc- 
ture of alloys under irradiation based on a continuum-space description. The evo- 
lution can be based on “events,” such as the elimination of a point defect on a 
defect cluster, as in the JERK code [159-161], or on the diffusion and reaction of 
objects, such as point defects and defect clusters, as in the BIGMAC code [162, 
163]. These codes are quite powerful for elemental systems, but their extension 
to dilute and concentrated alloys remains a challenge because, in general, one 
may expect complex interactions between the chemistry-related variables, compo- 
sition and chemical order, and the defect-related variables, such as point defects, 
defect clusters, dislocations, and grain boundaries. 

Continuum methods are also widely employed. For spatially homogeneous sys- 
tems, deterministic methods such as rate equations have been used in the past, 
for instance to calculate the evolution of point defects under irradiation [164, 
165], or to model the evolution of the long-range order parameter of chemically 
ordered alloys under irradiation [64]. A limitation of these techniques is that 
they lack fluctuations. These can be recovered by using stochastic variables, for 
instance solving the master equation [Eq. (19)] [166, 167] or its Kubo Ansatz 
[168], or using the path probability method (PPM): see Chapter 10. For inhomo- 
geneous systems, one can still rely on rate equations, or use phase field models 
(PFMs; see Chapters 10 and 12). A key difficulty in these continuum methods is 
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in the definition of the continuum variables, which are obtained by some time 
and space averaging of the atomic-scale, discrete variables. In the absence of a 
clear physical picture for this averaging process, PFM simulations lack absolute 
length scales and time scales. Recent developments have been proposed that 
make it possible to define an absolute length scale (see Chapter 10) and time 
scale [18]. Finally we note that while the new phase field crystal model [169], 
where the field variable is defined as a time-averaged atomic density, retains a 
clear definition of time scale and length scale, its computational efficiency over 
KMC simulations remains to be explored. 

From a broader perspective, while it appears clearly that the need to simulate 
large-scale systems over long physical time scales will make it necessary to rely 
on continuum methods, an important challenge is to retain in these models the 
key physical effects which can influence the long-term evolution of driven sys- 
tems. In particular, special attention needs to be paid to the thermodynamic and 
kinetic coupling which may exist between point defects, chemical species, and 
elements of the microstructure. In addition, one cannot ignore fluctuations in 
driven systems. In particular, the extrapolation of results obtained for one alloy 
irradiated with one type of ion and energy to other particles or other energies re- 
quires kinetic models that include the effect of the recoil spectrum on damage 
production. 


8.2.3.4 Kinetics of Concentration Fields under Irradiation [24] 

We may provide a continuum description for a solid solution under irradiation 
(the simplest example of a driven alloy) by completing the formalism summa- 
rized in Section 8.1.5 above. Indeed Eq. (15) should be complemented by the con- 
servation equations for vacancies and interstitials. In a solid solution with uni- 
form concentration, the latter are expressed as Eq. (27), where Ky (d =i for 
interstitials, v for vacancies) is the defect production rate (with a linear depen- 
dence on the irradiation flux), K;, is the rate constant for the bimolecular recom- 
bination reaction between a vacancy and an interstitial, and Ky (d = i, v) is the 
rate constant for the monomolecular elimination reaction of a vacancy or an inter- 
stitial on a fixed sink f. The latter elimination has been homogenized (smeared 
out) over the full volume. All such rate constants are dependent on solute concen- 
tration a priori. 


ôCa = Ka — KipCiC, — KapCa; d= iv (27) 


We restrict our treatment to a binary solid solution,” together with interstitials 
and vacancies and, assuming that the overall number of lattice sites remains 
constant (e.g., there is no “swelling” resulting from a net accumulation of vacan- 
cies) we are left with three independent concentration fields; we choose the in- 
terstitial, vacancy, and solute (hereafter labeled s) concentrations as independent 
variables. 


6) Extending the formalism to multicomponent alloys is still an open challenge. 
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In a non-uniform solid solution, the conservation equations are Eqs. (28). 





OC; = Ki — KivCiC, — Kp Gi — V - Ji 


OC, — K, = Kip iC, ur Ky Cy -V Jo 
(28) 
OCs =-V -Js 


J=-) Da VCp; a, p = i,v,s 
B 


The diffusion coefficients can be expressed, as a first approximation (see below), 
as Eqs. (29), where D* is the diffusion coefficient which results of the thermally 
activated jumps of point defects (as in Section 8.1), while D’”’ is the “ballistic dif- 
fusion coefficient,” which results from the collisions of atoms with the irradiating 
particles. The latter tends to erase concentration gradients, while the former 
erases the gradients of chemical potentials; as discussed in Section 8.1, D” is 
the product of Onsager’s matrix and the susceptibility matrix, fs [Eq. (30)]. 


h bal 
Dag = Dig + Dip 


h 
Dip =o LS 
% 


(29) 





fp = Ô( Ha — solvent) ; a, B = i, v,s (30) 
Cp 


Note that the Onsager coefficients are L’ rather than L, as before. Indeed, the sol- 
ute diffusion parameters are proportional to the defect concentrations, which are 
higher under irradiation compared to thermal equilibrium; also, solute now dif- 
fuses via both vacancy and dumbbell mechanisms. In the latter case, two atoms 
share one lattice site, forming a dumbbell. While a dumbbell may rotate around 
its center, its migration requires a rotation-translation mechanism, which leaves 
one atom of the dumbbell on its initial lattice site and transports the second atom 
to a neighboring site where it forms a new dumbbell with the atom found at that 
site. 
As shown by Eqs. (28) and (29), the kinetics of concentration fields under irra- 
diation differs from that discussed in Section 8.1.5 by two distinct contributions: 
e ballistic diffusion promotes random mixing 
e defect fluxes, which are sustained by irradiation, drive a 
solute flux (the so-called “inverse Kirkendall effect”), which 
appears in Eq. (28) via the solute-defect cross-diffusion 
coefficients. 


From Eqs. (28) and (29), we can identify three regimes, depending on whether 
ballistic diffusion dominates, the inverse Kirkendall effect dominates, or both ef- 
fects are of the same order of magnitude. 
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Ballistic mixing dominant We can neglect defect fluxes and we are left with a sin- 


OF 
gle conservation equation, Eq. (31), where stands for the functional derivative 


Ss 


of the free energy functional with respect to the concentration profile C;. 
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= von pvc, = (oe Nac 
J= ôC, ‘Sa öc? D Ss 


As is well known, in the absence of irradiation the equilibrium concentration pro- 
file (flat for a uniform solid solution, S-shaped for a two-phase state) corresponds 
to an extremum of the free energy functional. The latter equation can be written 
formally as Eq. (32). 


OF ep 
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Js=-L'V 


(32) 
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F,p has some of the properties of a free energy functional: the stationary concen- 
tration profiles correspond to extrema of Fy. The local stability of a stationary so- 
lution is given by the curvature of the functional. However, the relative stability of 
competing stationary states cannot be assessed from F.g. Such a functional is in- 
deed a Lyapunov function of a deterministic problem; to assess the full stability of 
a dynamic regime, we need a correct description of the noise it is subjected to 
(see below). 

In the simple case of the regular solution model, with some simplifying as- 
sumptions, it is a simple matter to show that Eq. (33) applies. 


Fag = [FGE] ale?) a 


(33) 
Ser (Cs) = U(C;) = Teg S(Cs) 


Uand S are the contributions of a slab of matter between x and x + dx (with con- 
centration C,), to the internal energy and to the entropy respectively; x is the gra- 
dient energy coefficient (see chapter 10), In the regular solution model, the latter 
is linked to the second moment of the ordering potential. 

The effective temperature T,p, is given by Eq. (34), where D}, is the enhanced 
chemical diffusion coefficient. 


pr” 
Tef = ri +7) (34) 
ch 


The details of the demonstration are given in ref. [44]. 
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As a first guess, one may say that the equilibrium state of a solid solution 
under irradiation is what it would achieve in the absence of irradiation at a 
temperature, T,¢, higher than the physical temperature T, by a factor of 
(1+ D**'/D'4,). As a consequence, the effective temperature is very high in the 
case of low-temperature irradiations (D'en is very low): the latter will promote sec- 
ond phase re-solution. On the other hand, for high-temperature low-flux irradia- 
tion, ballistic mixing will not affect the respective stability of the precipitates with 
respect to the solid solution (D'a is large and D* is small). Such a simple crite- 
rion proves very useful as a first guess, in many circumstances. 


Inverse Kirkendall effect dominant Ballistic mixing, and therefore ballistic fluxes, 
can be neglected, but we must consider in detail all the couplings between solute 
and defect fluxes. We find an entirely new class of phenomena, well identified ex- 
perimentally: these imply irradiation-induced segregation and irradiation-induced 
precipitation in undersaturated solid solutions, etc. Indeed, when the phenomeno- 
logical coefficients in Eq. (28) are dependent on solute concentration, permanent 
point defect fluxes may be triggered by irradiation due to solute concentration 
heterogeneity in an undersaturated solid solution. There are many possible mech- 
anisms for that. As an example, point defects may be produced uniformly (Kj, K, 
do not depend on C;), but they recombine preferentially in solute-enriched (or 
-depleted) regions, due to enhanced defect mobility (which enters K;,) in the latter 
regions: a uniform defect production together with a localized enhanced recombi- 
nation sustain point defect fluxes to those regions where recombination is en- 
hanced. Even if Kw, Kj, and K, do not depend on C,, a local solute overconcentra- 
tion may trap point defects: the recombination rate, Kj,C;C,, will be locally 
enhanced. Hence, it is no surprise that fluxes of point defects may be triggered 
under irradiation, by solute concentration heterogeneity. Provided the latter defect 
fluxes couple to the solute flux with the proper sign, they will trigger an uphill 
diffusion of the solute, which may overcome the thermodynamic force, which 
tends to erase solute concentration heterogeneities in the undersaturated solid so- 
lution. Such an effect is therefore the opposite of ballistic effects: it may promote 
solute segregation, even precipitation, in an undersaturated solid solution. 

The above effects can be modeled using a linear stability analysis in the follow- 
ing way. Consider a solid solution with a uniform solute concentration, C,; under 
irradiation, according to Eq. (28) the system accepts a uniform defect concentra- 
tion, C;, C,, with zero flux, since all concentrations are uniform in space 
(VC, = 0,0 = i,v,s). Now we assume that the uniform solute concentration is 
slightly perturbed by a concentration oscillation with amplitude c? and wave vec- 
tor q, giving Eq. (35). 


C.(x,t) = Čs + cs; cs = cP (t) cos qx (35) 


The concentration gradient will initiate defect and solute fluxes and changes in 
concentrations at a rate given by Eq. (28). Thus we write Eq. (36). 


Cr = Ca + ca; d=i,v (36) 
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We note that C;, C, vary rapidly compared to C,, since the solute mobility, unlike 
the defect mobility, scales with the point defect concentrations, which are much 
less than unity. We can therefore assume that the fast-relaxing defect concentra- 
tion fields, C;, C,, adjust instantaneously to the slowly varying solute concentra- 
tion field, C,, a technique called adiabatic elimination of fast variables. As a con- 
sequence, we split the concentration space (i, v, s) into two subspaces (i, v) and s, 
each governed by its own diffusion equations. The defect fluxes are given by Eq. 
(37), where the bold letters stand for column vectors (lower case, e.g., cq) and 
rank two tensors (upper case, e.g., D4). 


ja = —DaVea — dasVes 


; Ji D; Dip Ci Dis 
A Beo An a=(*); da= (3°) (37) 


Equation (38) gives the solute flux. 


Js = -d,Vc, and D,Vc; 
(38) 


dy = (Dg Ds) 
The elimination of fast variables consists in computing the stationary solution 
of Eq. (37) with c, assumed to be independent of time. We restrict it to a one- 
dimensional problem, so that V- = 0,-; assuming 0;cq = 0, Eq. (28) gives Eq. (39). 
OxJa = Kg — KCC, — Kap Ca; d= i,v (39) 


Since c, is a small quantity, a first-order expansion around the uniform stationary 
state, yields Eq. (40). 


ajg = 5(GxCs) — K?(öxca) 





ôi ô pe Z ; 
=) or (Ka — KivCiC, — Karla); d=i,v (40) 
v d A 
a KivCy + Kr KiCi 
KC, KC; F Ky 


After some manipulation, we obtain Eq. (41), where I is the identity matrix, and 
all terms in ø, ô, K? are evaluated for the stationary uniform state, i.e., with all the 
concentrations given their uniform stationary value, C,, « = i, v, s. 


Js = -Vof{(Ds — da - o) + dog - (D7 '[K?D7! + g?I]~')(6 + K?o)} 


Dis Dov = Dos Div 
ge D;'d, — Dw Dii — Div Dvi (41) 
7 z Dys Dii = DisDyi 


Dw Di = Div Dyj 
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The first term in the bracket { } in Eq. (41) represents the interdiffusion coeffi- 
cient in the absence of irradiation, assuming point defects are conservative. To 
make the connection with the classical treatment of diffusion, and with Eq. (31), 
we write the latter interdiffusion flux as Eq. (42). 


th _ a = 1 ; = oF 
J Ve;(Dss du a) L fisVes; fs m Fre 





(42) 


s 


Equation (41) can be cast in the same form as Eq. (42), with fs substituted by fE 
[Eq. (43)]. 


T = fo + Ås: (D7'[K?D7" + 91?) (0 + Ko), (43) 
Double integration of Eq. (43) with respect to C,, yields an effective free energy, F, 
functional for the solid solution under irradiation. F is a function of the solute 
concentration, of the temperature, and also of the irradiation conditions (which 
determine the defect concentrations entering the D,ss), of the microstructure 
(sink density which enters Kyr in K?) and of the wavenumber, q, of the composi- 
tion modulation. The fact that the wavenumber enters the irradiation-induced 
correction to the free energy implies that the interfacial effective free energy is 
distinct from the thermodynamic one. 

Equation (43) allows for a simple discussion of the effect of irradiation on the 
stability of the solid solution with respect to unmixing. Setting ff to zero de- 
fines the spinodal surface in the control parameter space: temperature, composi- 
tion, and irradiation flux (which scales C4, d = i, v, via the defect production rates, 
Kg, which are proportional to the irradiation flux). Depending on the signs of the 
cross-diffusion coefficients, which themselves depend on the signs of the solute- 
defect interactions [via dj, in Eq. (42)], and of the solute concentration depen- 
dence of defect mobilities and defect production rate (the ô matrix), the spinodal 
line may be shifted up or down by irradiation: an upward (or downward, respec- 
tively) shift implies irradiation-induced precipitation (or re-solution, respectively). 
A quantitative evaluation of the foregoing effects is intricate but has been done 
with a simple diffusion model [170]. 

As a summary, the irradiation-induced inverse Kirkendall effect [the second 
term on the RHS in Eq. (41)] may either stabilize or destabilize a solid solution 
with respect to unmixing. The possibility for irradiation-induced formation of 
large fluctuations in amplitude concentration in undersaturated solid solutions 
is clearly demonstrated by this model. Such fluctuations have been reported, after 
irradiation, in austenitic steels close to the Invar composition [171] (see Chapter 


14b). 


Ballistic mixing and inverse Kirkendall effect in competition Provided we only re- 
tain the solute diffusion coefficient in the ballistic diffusion matrix |D’ in Eq. 
(29)], it is a simple matter to obtain Eq. (44). 


D bal 


I = fot + da (DIIR2DZ! + G+ Ko) (44) 
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The physical meaning of the three terms on the RHS of Eq. (44) is as follows. 
e The first term defines the instability limit of the solid 
solution in the absence of forcing: fs = 0 is the equation for 
the spinodal line. 
e The second term (always positive) is the contribution of 
random ballistic mixing to the stabilization of the solid 
solution: indeed, adding a positive contribution to fs shifts 
the inflexion point of F (i.e., the spinodal limit) to higher 
concentrations, or lower temperatures; according to Eq. (34), 
Eq. (45) applies. 


T. R 
In spi 
br (45) 
1+4- 
Din 


e The third term results from the point defect fluxes sustained 
by the external forcing; this term does not reduce to the 
contribution of the point defects to the internal energy and 
entropy of the alloy (an effect taken into account by fs). On 
top of the point defect supersaturation, the external forcing 
sustains defect fluxes, because defects may be eliminated 
away from their site of generation. Such is the case whenever 
the defect properties (formation energy, migration energy, 
production yield) vary with the solute concentration (as 
reflected in Eq. (44) by a nonvanishing K? matrix). The 
forcing-induced defect fluxes (D4 matrix) may drive solute up- 
or downhill diffusion (du), which destabilizes or stabilizes, 
respectively, the uniform solid solution. 


The above formalism was first proposed in the mid-1990s, and many improve- 
ments have been made, e.g., in the formulation of ballistic mixing, or in the de- 
velopment of modeling techniques. But this general formalism deserves more 
work, in particular to address multicomponent alloys, precipitation of ordered 
compounds, fluctuations in solid solutions under irradiation, etc. 


8.2.3.5 Nucleation Theory under Irradiation [9, 172] 
Section 8.2.3.4 dealt with the effects of irradiation on the spinodal line, but did 
not address the question of the solvus line under irradiation. A large body of liter- 
ature is available on the nucleation of defect clusters under irradiation; since the 
defect concentration under irradiation is very small (but the defect supersatura- 
tion is large), such phenomena are best described by cluster dynamics [173, 
174]. This topic is beyond the scope of this chapter. 

Here we focus on solid solutions and their solvus line under irradiation. Forget- 
ting about defect agglomeration, we wonder whether the nonconservative nature 
of irradiation-induced point defects may alter the solubility limit, much in the 
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same way that it alters the instability limit (spinodal line) as discussed in Section 
8.2.3.4.” We restrict our discussion, as before, to irradiation-induced coherent un- 
mixing, as was observed, e.g., in dilute solid solutions of zinc in aluminum under 
high-energy electron irradiation (Cauvin et al. [9])”. Since we restrict our con- 
sideration to dilute solutions, cluster dynamics and nucleation theory stricto 
sensu as defined in Sections 8.1.3 and 8.1.4 above can be used equivalently. We 
represent the solid solution as a gas of solute clusters, embedded in an extremely 
dilute gas of point defects (vacancies and self-interstitials). Point defects may be 
trapped at the cluster, because of the solute-defect binding energy. For simplicity, 
we assume that a vacancy and a self-interstitial recombine instantaneously when 
trapped on the same solute cluster: as a consequence, a cluster made of n solute 
atoms contains a small number of defects (+d, + for interstitials, — for vacan- 
cies); the reaction paths by which an (n, +d) cluster may evolve are sketched in 
Fig. 8.8: on top of reversible paths, which would prevail if the point defect were 
a conservative species (the broken double arrows in Fig. 8.8), irreversible reac- 
tions (the solid single arrows in Fig. 8.8) drive the cluster to higher solute con- 
tents. As an example, whenever a solute vacancy pair reaches a solute cluster 
where a dumbbell is trapped, the vacancy-interstitial recombination deposits 
one solute at the cluster, and at the same time destroys the defects, which would 
allow the extra solute to escape from the cluster. 

Such reaction paths can be included in the master equation for the cluster size 
distribution (Eq. (6), Section 8.1.3). Without going into detail, two new parame- 
ters play a key role: the respective trapping probability (p+/p-) of defects at the 
solute atoms, and their respective arrival rates at solute clusters (f,/f_), where 
the cluster size dependence of the latter factorizes out. With some simplifying as- 
sumptions, it is found that the solubility under irradiation is reduced by a factor 
B [Eq. (46)]. 








g therm (1 + B_/B,) P+ > p- 
=; B=4 (148/8) for) P< p- (46) 
1/2 p+ = pP- 





Note that £,./B_ = D,'/D,”, the ratio of the solute diffusion coefficients via the 
interstitial and the vacancy mechanism respectively. As shown by Eq. (46), the re- 
duction of solubility is more pronounced if solute atoms diffuse to the cluster pre- 
dominantly via the defect which is less trapped at the cluster. Under such a con- 


7) Several attempts have been made to assess assuming that the precipitate is oversized, 


8) 


the contribution of the point defect 
supersaturation to the free energy function 
of solid solutions. At least for metals, the 
shift of the solvus curve resulting for such a 
contribution is negligible compared to what 
is observed experimentally. 

Incoherent precipitation under irradiation 
has been addressed by Russell et al. [172]: 


vacancy elimination at the interface relaxes 
the strain field built up by the growth of 
the precipitate. A “linked flux” analysis 

of the growth or decay rate of a precipitate 
made of n solute atoms and a net number 
of vacancies, v, is performed and leads to 
an irradiation-dependent critical nucleus 
size. 
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Fig. 8.8 A priori possible evolutions of a part). The solid single arrows show (some of 
cluster in the (solute x defect) space: the the) new processes which are active under 
broken double arrows show (some of) the irradiation, resulting from the irreversible 
reversible condensation—evaporation vacancy-interstitial recombination. Among 
processes which operate in the absence of the latter, some drive the cluster to a larger 
irradiation in a solid solution where solute solute content, n, and a smaller number of 
diffusion proceeds either by vacancy (lower trapped defects, d, thus delaying solute re- 
part ofthe chart) or by interstitial (upper emission to the surrounding (see the text). 


dition, most solute arrival at the cluster is accompanied by the destruction of the 
defect which promoted solute migration: the solute atom is temporarily trapped 
at the cluster. Such a simple model explains, at least qualitatively, the known 
cases of irradiation-induced coherent precipitation, as well as the irradiation flux 
dependence of the solvus, in Al(Zn) solid solutions under high-energy electron 
irradiation [9]. 


Rate theory of solute defect clustering under irradiation The master equation for 
the time evolution of the size distribution of solute clusters [Eq. (6) of Section 
8.1.3] can be extended to clusters made of solute and vacancies, or of solute and 
interstitials, in the limit of very dilute systems. A full numerical treatment of 
such an equation is not yet available, but work is in progress [175]. 


8.2.4 
Self-Organization in Driven Alloys: Role of Length Scales of the External Forcing 


As discussed in Section 8.2.1, the external forcing may introduce new length 
scales. Displacement cascades produced by large recoil energies in an irradiated 
alloy can be characterized by two lengths: the spatial extension of the cascade, L, 
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and the average atomic relocation distance within a cascade, R. In solids sub- 
jected to plastic deformation, individual shearing events are characterized by the 
area swept by moving Burgers dislocations in crystals, or by Somigliana disloca- 
tions in amorphous solids [176, 177]. We show in Sections 8.2.4.1-8.2.4.3 that, 
whenever there is a competition between thermal and forced dynamics, self- 
organization of the microstructure may take place provided that the different dy- 
namics operate at different length scales. 


8.2.4.1 Compositional Patterning under Irradiation 

Materials under irradiation are dissipative systems and, as such, they are capable 
of undergoing self-organization. In 1972 Nelson, Hudson, and Mazey reported on 
the evolution of precipitate size under irradiation [178]. A Ni-Al alloy containing 
L1>-type Ni;Al precipitates embedded in a Ni-rich matrix was irradiated with 
Ni ions at a flux producing = 10°? dpa s~!, at various temperatures. Room- 
temperature irradiation led to the disordering and dissolution of the precipitates, 
owing to the predominance of the ballistic mixing over thermally activated diffu- 
sion at low homologous temperature. Irradiation at 550 °C, however, led to the 
stabilization of an unexpected microstructure composed of nanoscale Al-rich 
ordered precipitates, apparently randomly distributed in the disordered Al-lean 
matrix. This self-organized microstructure, with nanoscale precipitates, was also 
stabilized by irradiation under the same conditions but starting from a quenched 
solid solution instead of a two-phase microstructure. Very similar observations 
were recently reported by Schmitz and coworkers [179], who irradiated a two- 
phase Ni-12 at.% Al alloy with 300 keV Ni ions, producing a displacement rate 
of x 10-3 dpa s“!, at 550 °C. These authors, taking advantage of improvements 
in electron microscopy, measured the evolution of the average ordered precipitate 
diameter as a function of the irradiation dose. For the above conditions, they 
reported that this diameter, 5 nm initially, decreased and reached a steady-state 
value of 2 nm. These recent observations seem to indicate that the nanoscale 
microstructure is not simply the result of irradiation-induced segregation and 
precipitation on microstructural defects such as dislocations. An important prac- 
tical and fundamental problem is thus to determine the origin and the character- 
istics of the self-organization of precipitates in irradiated alloys. 

In order to rationalize their observations, Nelson et al. proposed a model in 
which irradiation-induced dissolution competes with the thermally activated 
growth of the precipitates, assuming that the precipitate number density remains 
constant [178]. These authors made the ad hoc assumption that the irradiation- 
induced forced chemical mixing contributes only to the transport of solute atoms 
from the precipitates to the matrix, neglecting the contribution of the forced mix- 
ing to the transport of solute atoms from the matrix to the precipitates. Under 
such an assumption, the dissolution rate of a precipitate is proportional to its in- 
terfacial area, i.e., its radius squared, instead of being proportional to its radius 
when both forward and backward forced solute fluxes are included and treated 
as a diffusion process. Neglecting the backward forced solute flux, an assumption 
that has no particular justification, thus leads to a pathological model. Indeed, the 
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dissolution and thermal growth components in Nelson’s model have different 
precipitate-radius dependence, and the model predicts that precipitates will al- 
ways reach a stationary size under irradiation. Furthermore, there is no limit to 
this steady-state size. We will show below that the approximation made by Nelson 
and coworkers is in fact equivalent to assuming that the relocation distance of 
ballistic mixing is arbitrarily large. This is in contradiction to all the MD simula- 
tion results, which demonstrate that most of the forced mixing takes place be- 
tween nearest-neighbor (NN) atoms [39]. If we assume for the moment that the 
forced mixing is restricted to first-NN atoms, it is then equivalent to a forced dif- 
fusion. The dissolution rate and the growth rate would now have the same linear 
dependence on the precipitate radius. Such a model would predict that precipi- 
tates would either grow to a macroscopic size at elevated temperature or low 
forced mixing rate, or shrink and dissolve at low temperature or high forced mix- 
ing rate. This model, which is based on a physical picture very similar to the one 
used by Martin in his derivation of the effective temperature criterion [44], does 
not predict that irradiation can lead to the dynamical stabilization of nanoscale 
precipitates. 

A key element in the resolution of this paradox is recognition that, while most 
of the forced mixing takes place between nearest-neighbor atoms, some forced 
mixing extends beyond that distance. MD simulations carried out in a Ag-Cu 
solid solution indicate that the histogram of relocation distances can be well ap- 
proximated by an exponential decay. The decay length, R, increases from x 1.4A 
for light-ion irradiation (He) to x 3.1A for heavy ion irradiation (Ne, Ar, Kr) (see 
Fig. 8.7). As a result of this finite-range mixing, at scales of the order or smaller 
than the mixing distance the process cannot be reduced to a forced diffusion. The 
challenge is then to study the evolution of the composition field of an alloy under 
irradiation when the forced mixing takes place over a finite characteristic distance 
R. We present below several continuum models, as well as KMC simulations, 
demonstrating that R plays a critical role in triggering the self-organization of 
the composition field, as well as in limiting the maximum length scale of this 
self-organization. While the results discussed below draw largely from Enrique’s 
PhD Thesis [180], early simulation results of Haider [181] and modeling results 
of Vaks and Kamyshenko [148] and Pavlovitch [182] emphasized the importance 
of the finite-range character of the forced mixing on the possible self-organization 
of the composition field. 

We consider a binary alloy Aı_.B. with a miscibility gap at low temperature, 
and the temperature and composition are such that, at equilibrium, the alloy 
would be decomposed into two phases, an A-rich and a B-rich solid solution. 
KMC simulations reveal that, for short relocation distances (typically, when the 
forced mixing is restricted to near neighbors), the alloy reaches a steady state 
that is a solid solution when the relative rate of ballistic to thermal jumps is large; 
at lower relative forcing rates, the steady state is a macroscopically decomposed 
structure, with some solubility enhancement due to the ballistic mixing. For suf- 
ficiently large relocation distances, however, a third steady state is found at inter- 
mediate relative forcing intensity: it corresponds to a decomposed state, albeit at a 
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Fig. 8.9 Dynamical equilibrium phase entering the free energy functional (see Eq. 
diagram for an AsoBso alloy that undergoes (48), with f = —Ay? + By*). The transition 
phase separation at thermodynamic lines are calculated from the effective free 
equilibrium. y = T/M is a dimensionless energy given in Eq. (54). Insets: typical 
forcing intensity, and R is the average atomic microstructures simulated by KMC simula- 
relocation distance. These variables are tions, observed at steady state here in a (111) 


expressed in terms of the coefficients A, B, C, plane (from ref. [184]). 


finite, mesoscopic, scale (see Fig. 8.9). The characteristic scale of these composi- 
tional patterns increases as the relative forcing intensity decreases. However, a 
maximum size for these compositional patterns exists, and a further reduction 
of the forcing intensity then leads to the stabilization of the macroscopically de- 
composed steady state. 

In order to better understand the origin of this compositional patterning reac- 
tion, and the conditions required for it, a continuum kinetic model was proposed 
[149, 183, 184]. The temporal evolution of the local deviation from the nominal 
composition y(x) = c(x) — € is composed of two terms [Eq. (47)]. 





Ö F 
ê L mv E ) OEA (47) 
ot ow 


The first term corresponds to the thermally activated dynamics for the conserved 
order parameter y. M is the thermal atomic mobility and is related to the 
chemical diffusion coefficient through M =c(1—c)D/kgT, Q stands for the 
atomic volume, and F is the free energy of the alloy, which we express using 
the Cahn-Hilliard form, Eq. (48), where f is the free energy density of a homo- 
geneous alloy and C the gradient energy term. 


F= Sum + cvy av (48) 


The second term in Eq. (47) expresses the forced mixing induced by ballistic 
jumps, which occur at a frequency Tp, with relocation distances distributed ac- 
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cording to the normalized weight function wr. Due to conservation of the 
chemical species, the ballistic rate of change of the order parameter is propor- 
tional to the difference between the local composition and the nonlocal average 
denoted by the brackets and defined by Eq. (49). 


oe | atx — x')y(x') de! (49) 


When the average relocation distance is of the order of the length used in the 
coarse graining to define the continuum variable c(x), the contribution of the bal- 
listic mixing reduces to Eq. (50). 


w) =T,V*(y) (50) 


In this specific case, one recovers the model introduced by Martin in 1984 [44] to 
discuss the effect of ballistic jumps on phase stability in alloys under irradiation. 


Linear stability analysis It is often interesting from a practical and fundamental 
viewpoint to start with the control parameters which stabilize a homogeneous 
steady state, and then to carry out a linear stability analysis to determine the do- 
main of local stability of this homogeneous solution as the control parameters are 
varied [185]. In ref. [183], such a linear stability analysis is performed for a one- 
dimensional equiatomic system, with a specific choice for the distribution of bal- 
listic jumps [Eq. (51)]. 


|x| 


AS z exp (- 2) (51) 


Distributions of ballistic jumps for Ag-Cu alloys, measured by MD simulations, 
are indeed fitted well using such a decay function. Furthermore, such a function 
lends itself very well to analytical calculations. For small perturbations of the 
form e”*'* around a homogeneous composition profile c(x) = = 0.5, i.e., 
y(x) = 0 everywhere, we can linearize Eq. (47). Transforming to Fourier space, 
denoted by the carets, we obtain Eq. (52), where the amplification factor is related 
to the wave vector of the perturbation through the dispersion equation Eq. (53); 
f” is the second derivative, with respect to composition, of the free energy den- 
sity in the homogeneous alloy. 





(53) 
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Fig. 8.10 Linear stability analysis of the kinetic model given by Eqs. (47) 
and (48). The solid solution is unstable with respect to fluctuations of 
wavenumber k when the amplification factor œ(k), normalized by the 
thermal mobility M, is positive (from ref. [184]). 


Within a regular solution model (see Chapter 10), f” = u” +kgT/(c(1—¢)), 
where u is the internal energy contribution to the free energy density f. A dimen- 
sionless forcing parameter y = T/M has been introduced in Eq. (53). 

Figure 8.10 displays a set of dispersion curves for various y values, assuming 
that the irradiation temperature is such that f’(c) < 0, i.e., inside the equilib- 
rium miscibility gap. Figure 8.10 also illustrates the main results of this linear 
stability analysis: 

1. For large forcing intensity (y > y2), the homogeneous steady 
state is always locally stable since œ < 0 for all k wavenumbers. 

2. For forcing intensity satisfying yı < y < y2, the homogeneous 
steady state is unstable for wave vectors such that 
ky <k< k2. 

3. For low forcing (y < y1), the homogeneous steady state is 
unstable in a well-defined range of wave vectors (0 < k < k2). 


Cases 1 and 3 resemble what would be found for an alloy above and below its spi- 
nodal instability in the absence of irradiation. The remarkable and new regime is 
found in case 2. The fact that the amplification factor remains negative in the 
limit of long wavelength (k — 0) suggests that irradiation may stabilize a micro- 
structure which has decomposed, albeit not at the macroscopic scale. It is found 
that there is a critical value for the relocation distance R, R,, below which this sec- 
ond case ceases to exist, and the alloy goes directly from case 1 to case 3. 


Global stability analysis neglecting fluctuations In order to determine the global 
stability of the various steady states encountered so far, one needs to go beyond a 
linear stability analysis. If one first neglects fluctuations, one can define an effec- 
tive free energy functional, E{w(x)} for the steady states under irradiation by im- 
posing Eq. (54). 
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a SOF SQE 
Er = mw y ) Taly — <w>r) = mv (5) (54) 





For the above choice of distribution of relocation distances, wp, the effective free 
energy E can be calculated analytically, and it can then be used to build a dynam- 
ical equilibrium phase diagram [183] if one assumes that the most stable steady 
state corresponds to the deepest minimum in E. The resulting diagram is shown 
in Fig. 8.9. In this particular case, the homogeneous free energy density was writ- 
ten using a Landau form, f = —Ay? + By*. The relocation distance R and the 
forcing intensity y are then naturally expressed in ,/C/A and A?/C units, respec- 
tively. This analysis confirms the existence of the three steady states suggested by 
the linear stability analysis, and it also confirms the existence of a threshold relo- 
cation distance R, for compositional patterning to be induced by ballistic jumps. 
These predictions are in excellent agreement with the results of KMC simulations 
(insets in Fig. 8.9). 

A clear weakness of the above analysis, however, is that we have neglected fluc- 
tuations, whereas it is well known that the respective stability of competing steady 
states is controlled by fluctuations [61, 27, 166]. As a result, the effective free en- 
ergy defined above displays several deficiencies [180, 186]. In the regime R — 0, 
in particular, one does not recover the free energy of the alloy at the effective tem- 
perature introduced by Martin (see Section 8.34). We now extend the analysis so 
as to include fluctuations, thus transforming Eq. (54) into a Langevin equation, 
closely following the procedure proposed in ref. [186]. 


Langevin equation with thermal dynamics only We briefly recall here the proce- 
dure introduced by Cook [187] to introduce thermal fluctuations. In the absence 
of irradiation, the deterministic Eq. (54) is transformed into a Langevin equation 
by adding a random noise term ¢,,(x,t), with suitably defined statistical proper- 
ties (to be specified below), in Eq. (55). 


w) = MV? (=) + V?E, (55) 
th oy 


For small fluctuations around c(x) = ¢, we can again linearize Eq. (55) and trans- 
form it to Fourier space [Eq. (56)]. 


dý a 
H) = moin (56) 
th 


The amplification factor, now defined with a negative sign for convenience, is 
given by am(k) = Mk*(f"(¢) + 2Ck?). The noise term is assumed to be uncorre- 
lated in time, and has a spatial structure given by the quantity Q,,(k) [Eq. (57)]. 


ein (k HE CE, t)> = Qu (kja t') (57) 
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The statistical quantity Qy,(k) can be obtained by calculating the equilibrium 
structure factor Seg(k) = <|w(k,t > 0)|”). Following a standard procedure to 
solve Eq. (56), Cook obtains Eq. (58). 


Seq(k) = are (58) 





This structure factor can also be evaluated through a fluctuation-dissipation rela- 
tionship, Eq. (59) [187]. 


kgT 


F'O + 2Ck? > 


Seq(k) = 


Therefore the functional dependence of Q,,(k) which reproduces the structure 
factor properly is given by Eq. (60). 


Qali) = AM (60) 





Langevin equation with ballistic dynamics only Following a similar strategy, we 
can write an equation including fluctuations for the irradiation-induced mixing 
dynamics [Eq. (61)]. 





4) = n(v | vet x!)w(x") as! ) +V? (61) 


ot 


This equation is easily transformed into an ordinary differential equation in 
Fourier space, i.e., Eq. (62) where the ballistic amplification factor is given by Eq. 
(63) 


p(k) = Fy(1 — wr(k)) (63) 


Now the steady-state structure factor S,(k) which we expect from the ballistic dy- 
namics alone is that of a random solid solution, i.e., S,(k) = ¢(1 — č). Applying 
Cook’s solution to this case, we find that the spatial correlations of the ballistic 
fluctuations must be given by Eq. (64). 


a, = AO (64) 


Langevin equation with two dynamics in parallel For the model with competing 
dynamics with fluctuations, we add the terms describing the deterministic evolu- 
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tion of the composition, as in Eq. (54), as well as the noise terms. We assumed 
earlier that the two deterministic dynamics act in parallel, i.e., independently. 
Within this assumption, the corresponding fluctuations are uncorrelated and 
therefore additive [Eq. (65)]. 





T- (a(k) + a(k) Jý — k’ (En + Ê) Sa 


Equation (65) yields Eq. (66) for the steady-state structure factor. 


_ 4 Qm(k) + Qu(k) 
AS 2(an(k) + %lk)) 





(66) 


After simple algebraic manipulations we obtain Eq. (67), which is the central re- 
sult of this model. 


%p 


= kgT ab 
2,4" j f 
2Ck* + u” (c) 4 ERT) (1 | 2) 


S(k) = 





(67) 





We will now consider specifically two cases of ballistic dynamics. 


Case 1: Short-range ballistic jumps In this case, as discussed above, the dynamics 
can be approximated by a diffusional process with a diffusion coefficient Dy, and 
thus a(k) = Dyk?. Plugging this equation into Eq. (67) yields an equation that 
admits a straightforward interpretation in terms of Martin’s effective temperature 
criterion: the effective temperature Tef = T(1+ Dy/ D) describes both the effec- 
tive driving force and the effective fluctuations of the alloy under irradiation at 
steady state. We should also note that in Martin’s derivation of the effective tem- 
perature criterion, the ratio D,/D is assumed to be independent of composition. 
It thus implies that the fluctuations also are independent of composition. There- 
fore, for short-range ballistic jumps, Martin’s nonequilibrium potential has the 
desired properties of a nonequilibrium potential, and the common tangent rule 
is a valid construction to determine the globally stable steady states. 


Case 2: Finite-range ballistic jumps Following MD results [50], we choose wp to 
be an exponential decay with decay length R. Its Fourier transform is then given 
by wr(k) = 1/(1+k?R?), and thus «(k) =Tyk?R?/(1+ k?R?). Plugging this 
equation into Eq. (67) demonstrates that the same rescaling applies to the temper- 
ature in the numerator, which corresponds to the amplitude of the fluctuations, 
and in the denominator, which corresponds to the second derivative of the non- 
equilibrium potential. The concept of effective temperature, however, is no longer 
appropriate since the effective temperature would be a function of the wavenum- 
ber k. A better approach is to divide both the numerator and the denominator in 
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Eq. (67) by the rescaling factor of the temperature. It can then be shown [186] 
that one can re-interpret Eq. (67) for that particular case as the structure factor, 
evaluated at the physical temperature T, of an alloy system with effective pairwise 
interactions defined by Eq. (68). 


ö,(k) 
RI, 1 
D 1+R?’R 


ôf (k) = 





(68) 
1+ 


A similar equation was originally derived by Vaks and Kamyshenko [148], albeit 
from a discrete description. An analysis of Eq. (68) shows that finite-range ballis- 
tic mixing is equivalent to the introduction of effective medium-range interac- 
tions which are repulsive between like atoms. Patterning can then be rationalized 
as the result of the competition between the short-range attractive physical inter- 
actions and the medium-range repulsive effective interactions. It is worth noting 
that competition between short-range attractive and medium-range repulsive in- 
teractions in equilibrium systems, such as block copolymers with dipolar mo- 
ments [188-191], also leads to patterning and to the stabilization of mesoscopic 
phases. In the patterning regime, the steady-state structure factor S(k) peaks at a 
finite k value, kp. The microstructure has thus developed compositional patterns 
at a preferred length scale, which is proportional to 1/kp. 


Confrontation between model predictions and experiments The above models of- 
fer a clear rationalization of the origin of compositional patterning under irradia- 
tion. They also set a limit to the scale of decomposition. The largest decomposi- 
tion scale, reached on the y, boundary in Fig. 8.9, is equal to 47R. In the limit 
where R is arbitrarily large, as assumed implicitly by Nelson and coworkers, the 
steady-state scale of decomposition, and the size of the precipitates, can thus be 
arbitrarily large. In the limit R — œ, it is in fact possible to show directly that 
the forced ballistic exchanges can be seen as producing electrostatic-like repulsive 
effective interactions between like atoms [182]. It is well established that such re- 
pulsive long-range interactions, when competing with short-range attractive inter- 
actions, lead to patterning [149]. 

Heinig and Strobel studied the possible compositional patterning of precipi- 
tates during ion-beam implantation and ion-beam mixing [192, 193]. They intro- 
duced a kinetic model for the evolution of the precipitate size which takes into 
account both the forward and backward solute fluxes forced by irradiation. They 
used an effective surface energy to rationalize the size reached by precipitates at 
steady state. While this effective surface energy is positive at low ballistic jump 
frequency or at high temperature, it reaches a negative value for a range of pre- 
cipitate size at greater ballistic jump frequency or at low temperature. The flux or 
the temperature at which the effective surface energy goes to zero is interpreted 
as the onset of inverse ripening, or, in other terms, compositional patterning. 
While Heinig and Strobel’s model bears some similarity to the models presented 
in this section, it relies on several approximations and consequently the results 
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are not valid when R — 0 or when the irradiation flux is large. In particular, the 
model predicts that compositional patterning can take place when R is arbitrarily 
small, in contradiction to KMC simulations [184]. 

Returning to metallic systems, the observations reported by Nelson and co- 
workers and by Schmitz and coworkers appear to be qualitatively consistent with 
the predictions from the above models and summarized in the dynamical equilib- 
rium phase diagram, Fig. 8.9. The decomposition scale observed experimentally, 
several nanometers, seems however to be at the higher end of the spectrum ex- 
pected from the analytical model. It is unclear whether the presence of chemical 
order in the precipitates could have influenced the patterning reaction. In Section 
8.2.4.2, we will briefly review recent work indicating that the other characteristic 
length introduced by displacement cascades, their size L, can in fact trigger a pat- 
terning of the chemical order in alloys. In off-stoichiometric alloys, this pattern- 
ing of order produces a compositional patterning also, and additional experimen- 
tal work is needed to establish whether the patterning reported by Nelson and 
coworkers and by Schmitz and coworkers is triggered by the relocation distance 
R, by the cascade size L, or by both. 

Averback and coworkers have recently performed experiments designed specif- 
ically to test the main predictions of Enrique’s model. They have studied alloy sys- 
tems which are immiscible in the solid state only, e.g., Ag—Cu [194, 195] and 
Co-Cu [196], or in the solid and in the liquid states such as Ag—Co [197]. The 
results obtained for the former systems are consistent with Enrique’s model pre- 
dictions. In the Co—Cu case in particular, 1 MeV ion irradiation can induce a sta- 
bilization of the size of Co-rich precipitates to a value that increases with the irra- 
diation temperature, up to a threshold temperature of 350 °C, above which 
precipitates appear to coarsen continuously. In the case of alloys that are immis- 
cible in both the solid and the liquid state, however, no saturation of the size 
of the precipitates is observed. Averback and coworkers rationalized this result 
by noting that displacement cascades in highly immiscible alloys produce little 
or no chemical mixing [39]. Quantitative analysis of the coarsening rate in this 
alloy under irradiation led the authors to propose that, in fact, the mechanism 
of precipitate coarsening did not involve atomic diffusion, but rather particle 
coalescence. 


8.2.4.2 Patterning of Chemical Order under Irradiation 

Turning now to the case of alloy systems which undergo chemical ordering, as 
recalled in the introduction (Section 8.2.1.1) it is well documented that irra- 
diation at low temperature or high flux can disorder partly or fully an ordered 
alloy, while irradiation at elevated temperature or low flux can induce chemical 
ordering. Recent KMC simulations and analytical modeling, however, have pre- 
dicted that irradiation can also drive a chemically ordered alloy into a steady state 
whose microstructure consists of well-ordered domains of finite size, typically in 
the nanometer range [198-200]. The stabilization of this new steady state, which 
we referred to as patterning of order, requires three conditions. Firstly, the irradi- 
ation temperature should be below the equilibrium order-disorder transition 
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temperature, T,. Secondly, the irradiation flux should be such that the rate of dis- 
ordering imposed by irradiation is similar to the rate of reordering promoted by 
the thermally activated migration of point defects. A third condition was uncov- 
ered by atomistic simulations: the size of the disordered zones resulting from 
the dense displacement cascades produced by the energetic ions has to exceed a 
threshold value, L,. This value is such that, during the thermal annealing of a dis- 
ordered zone, several new ordered domains form and grow in that volume. The 
continuous disordering of existing ordered domains and the continuous creation 
of small new domains lead to the destabilization of pre-existing macroscopic or- 
dered domains, and to the dynamical stabilization of finite-size ordered domains 
at steady state. In the case of smaller cascade sizes (L < L,), as well as for large 
but dilute cascades, reordering proceeds instead by migration of the boundary be- 
tween the ordered matrix and the disordered zone. It does not produce new do- 
mains, therefore, and no patterning of order is observed. It has been proposed 
that the state of patterning of order can be defined by analyzing quantitatively 
the intensity and the nature of the fluctuations of order using a scaling approach 
[200]. In the absence of patterning, fluctuations of order obey a scaling that is 
consistent with the effective temperature criterion. This scaling, however, breaks 
down as the cascade size is increased, and the fluctuations of order then obey a 
different scaling. This change in scaling can be used to define rigorously the 
steady state of patterning of order, and to build dynamical equilibrium phase dia- 
grams. One example is given in Fig. 8.11 for an AsoBso alloy developing an L1o 
ordered structure. 

It is important to note that, according to the above simulations and models, the 
largest average size of the ordered domains at steady state is close to that of the 
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Fig. 8.11 Dynamical equilibrium phase cascade size L, calculated using the lattice 
diagram for an AsoBso alloy that develops LIo parameter of FePt). The KMC simulations 
long-range order (LRO) at thermodynamic are performed at a constant temperature 
equilibrium, as a function of the ballistic T = 0.09 eV = 0.6T.. The threshold for 
exchange frequency between nearest-neighbor patterning of order is around b = 100 
atoms, Tp, and the number of pairs (from ref. [201]). 


exchanged at once in each cascade, b (or the 
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disordered zones. Since these zones can extend up to 10 nm, according to dark- 
field TEM results [49] and MD simulations [39, 45, 46], it appears that the pat- 
terning of order could reach larger dimensions than the compositional patterns 
induced by a finite relocation distance R. From an application viewpoint, it has 
been proposed that one could take advantage of the dynamical stabilization of 
nanoscale ordered domains under irradiation to synthesize nanostructures opti- 
mized for mechanical or magnetic applications which required the presence of 
ordered domains at the nanoscale with a high degree of chemical order [201, 
202]. Such microstructures are difficult to achieve by conventional thermal pro- 
cessing since ordering is then accompanied by domain growth [203]. Systematic 
experiments need to be carried out to test the model predictions on patterning of 
order, and, if this patterning reaction is indeed observed, to evaluate its useful- 
ness for the synthesis of functional materials. 


8.2.4.3 Compositional Patterning under Plastic Deformation 

Alloys subjected to sustained plastic deformation can undergo nonequilibrium 
phase transformations and microstructural evolutions. As pointed out in the early 
1980s [35], these evolutions shared many similarities with those observed in al- 
loys subjected to irradiation. In particular, plastic deformation at low temperature 
can dissolve pre-existing precipitates, while at elevated temperature it may induce 
the formation of equilibrium or even nonequilibrium precipitates. In the mid 
1990s, generic atomistic computer simulations indicated furthermore that sus- 
tained plastic deformation can lead to the dynamical stabilization of composi- 
tional patterns [204], and of patterns of order [205]. The scale of these steady-state 
patterns was predicted to increase continuously as the temperature or the shear- 
ing frequency is increased. While these predictions have received experimental 
support from systematic studies employing high-energy ball milling, it is only re- 
cently that a theoretical approach has been introduced to offer a clear insight into 
the origin of plasticity-induced compositional patterning [120, 206]. 

This analysis, which parallels the standard approach to study of the mixing of 
passive markers transported by convective and possibly turbulent flow in fluids, 
focuses on the mean-squared relative displacement (MSRD) of pairs of atoms. In 
contrast to the case where the two atoms of a pair undergo uncorrelated migra- 
tion, for instance due to thermally activated diffusion, the MSRD produced by 
plastic deformation is a function of the separation distance, R, of the atom pair 
considered. By analogy with the works of Taylor and Richardson for turbulent 
flows (see ref. [207] for a recent review), one can define an effective diffusion co- 
efficient, D,p(R) by Eq. (69), where L is the glide length of dislocations, and is 
thus bounded by the grain size, b the magnitude of the Burgers vector, and Ty, 
the atomic jump frequency due to shearing events (a factor 2 appears because 
the two atoms comprising a pair may jump). 


b? RL? 
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The R-dependence of D,p comes from the fact that the probability of shearing a 
pair of atoms separated by a distance R increases linearly with that separation dis- 
tance. Equation (69) is derived by assuming that plastic deformation is homoge- 
neous and isotropic, and that the mobile dislocation segments and the glide dis- 
tance of these segments are larger than R. Equation (69) thus applies only to 
separation distances R < L. For separation distances R > L, the length R becomes 
irrelevant, and the effective diffusion coefficient becomes independent of R [Eq. 
(70)]. 


273 


Dep (R) oc Ta ERST (70) 


The R dependence of Dp for R < L leads to a superdiffusive mixing by plastic 
deformation. It also provides a simple and powerful rationalization for the stabi- 
lization of compositional patterns. The outcome of the competition between the 
mixing forced by deformation and the decomposition promoted by thermody- 
namic forces is now scale-dependent. At small length scales, thermal decomposi- 
tion dominates provided that temperature is high enough, and the microstructure 
is thus decomposed; at larger length scales, the forced mixing dominates, and 
thus the microstructure, at these scales, is homogeneous. Such a microstructure 
corresponds in fact to the mesoscale patterns observed in KMC simulations [204]. 
Recent 3DTAP characterization of ball-milled Ag—Cu alloys [206] offers support 
for this framework; in particular, in agreement with an earlier prediction, it is ob- 
served that the length scale of the patterns increases continuously as the temper- 
ature of the materials during plastic deformation is increased. It is furthermore 
proposed that the crossover length scale, which is obtained by equating the forced 
mixing diffusion coefficient to the thermal one, should correspond to the length 
scale of the compositional patterns. More work is necessary to evaluate systemat- 
ically these predictions, which rely on a very simplified description of the alloy 
system undergoing plastic deformation. In particular, this description ignores 
stress effects, interactions between dislocations, and any possible feedback be- 
tween the microstructure and the plastic deformation. These limitations can be 
overcome by using MD, although one then needs to extrapolate the results from 
the high strain rates required in MD simulations to the range of experimental 
strain rates, and from the high temperatures required to allow for thermally acti- 
vated diffusion to take place over the simulation times to the experimental tem- 
perature range. 


8.2.5 
Practical Applications and Extensions 


8.2.5.1 Tribochemical Reactions 
It is a well reported fact that high-carbon steels, when subjected to repeated load- 
ing, develop a so-called “white phase”, a supersaturated solution of carbon in 
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iron, resulting from the dissolution of the cementite, which is brittle. Such is the 
case in ball bearings, in railway tracks, and in the wheels of express trains. As 
discussed in Section 8.2.1, alloys under repeated mechanical loading are one ex- 
ample of driven alloys, much in the same way as alloys under ball milling. How 
should the forcing intensity, e.g., for a train wheel, be defined? 

For ball milling, on the basis of experimental measurements the forcing inten- 
sity Iym was identified as the momentum transferred from the ball to unit mass of 
powder per unit time [see Eq. (18)]. This latter intensity can be interpreted, also, 
as the force imposed on the material multiplied by the fraction of time for which 
the material is being loaded: indeed, V = acceleration x duration of impact, so 
that M,V = force x duration of impact and f/M, = frequency at which a given 
grain of powder is being strained. For a train wheel, we can estimate from the 
theory of elastic contact between a cylinder (the wheel) and a plane (the track), 
the size of the zone affected by the load [208]. The duration of loading is that 
size divided by the circumferential velocity of the wheel, and the frequency of 
loading is the frequency of rotation of the wheel. As a consequence, the forcing 
intensity of the material at the periphery of the wheel is written as Eq. (71), where 
F is the load on the contact, | is the length of the contact between the track and 
the wheel, E* is an effective Young’s modulus, and R is the wheel radius [116]. 


4p3 1"? 
w= El (71) 


As can be seen, for a perfectly circular wheel the forcing intensity does not de- 
pend on the velocity. Assuming that the rate of wear of a wheel is proportional 
to the intensity, we conclude that a perfectly circular wheel remains circular over 
the course of time. 

This is no longer the case if the wheel has local changes of curvature, as most 
real wheels do: a wheel may, for instance, present two local maxima of curvature 
(in an elliptical shape) or three or more. In such a situation, because of inertial 
effects, the load on the contact depends both on the location on the wheel and 
on the velocity. For a given symmetry of the wheel (two-, three-, fourfold), the 
higher the velocity, the higher the intensity. 

An interesting property is revealed by this analysis: for a given symmetry of the 
wheel, the location of the maxima of forcing intensity depends on the velocity. At 
lower velocities, the forcing intensity is a maximum at the points of maximum 
curvature; above a crossover velocity, the reverse is true: the intensity is a maxi- 
mum in the flatter regions. Assuming the rate of wear increases with the forcing 
intensity, we identify a critical velocity below which self-healing of the defects in 
curvature will dominate; above that velocity, the defects will amplify: with typical 
values for French TGV trains, the predicted velocity is 55 km h~! (or 165 km h7!) 
for threefold (or two-fold) symmetry. 

If we assume that the rate of wear is proportional to the forcing intensity, it is 
easily found that the amplitude of the oscillation of the wheel radius increases 
exponentially with the mileage; this is observed to be true on actual trains. 
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The practical implications of this approach are numerous: developing an appro- 
priate miniaturized wear test, developing cost-effective control techniques for the 
circularity of the wheels, optimizing the policy for remachining of the wheels, etc. 


8.2.5.2 Pharmaceutical Compounds Synthesized by Mechanical Activation 
Grinding and micronization are techniques commonly employed to reduce the 
size of particles of powder materials in the pharmaceutical, agrochemical, and 
dye industries. More recently, however, it has been realized that grinding can in- 
duce phase transformations in such compounds. Descamps and coworkers, in 
particular, have studied systematically the phase transformations induced by con- 
trolled high-energy ball milling in several organic compounds such as trehalose, 
lactose, D-sorbitol, and indomethacin (IMC) [209-212]. Using a planetary ball 
mill, these authors report that, regardless of the initial state, one and only one 
steady state is reached for a given milling temperature and rotation disc speed, 
Q. Furthermore, they observe that lowering the milling temperature at constant 
Q, or increasing Q at constant temperature, favors the stabilization of an amor- 
phous phase at steady state. Such effects can thus be rationalized, at least qualita- 
tively, by invoking the effective temperature criterion (see Section 8.2.4). In the 
case of indomethacin, however, the equilibrium crystalline phase y is first trans- 
formed to an amorphous phase, before crystallizing into a phase that is only 
metastable at equilibrium. 

This sequence of transformations is reminiscent of results reported for some 
metallic [134] and ceramic materials. Within the effective temperature approach, 
the rationalization of this sequence and of the stabilization of a phase that has no 
stability domain in the equilibrium phase diagram would imply, at least, that one 
should define different effective temperatures for different structures, and that 
the effective temperature of the metastable crystalline phase would be much 
lower than that of the y phase. On a phenomenological basis, since the same con- 
ditions of mechanical processing can produce different types or amounts of dam- 
age in different phases for the same overall chemical composition, and since the 
annealing kinetics of that damage can be quite different, it may not be too sur- 
prising that the dynamical stability of different compounds should be given by 
different effective temperatures. This approach, however, raises practical ques- 
tions since it would imply that one can no longer predict phase stability during 
milling by simple inspection of the equilibrium phase diagram. More fundamen- 
tally, it would require that the various phases in competition share a parent phase 
which can serve as a common reference, here the amorphous phase, for compar- 
ison of the values of the effective temperatures to be meaningful. We note that 
the more recent analysis of alloys driven by plastic deformation indicates that 
the concept of effective interactions provides a sounder approach than that of the 
effective temperature. The difficulties noted above are no longer problematic if 
one uses this concept of effective interactions. From a practical viewpoint, how- 
ever, this concept is somewhat difficult to apply since there is no simple formula 
for rescaling of the physical interactions which would describe the steady states 
stabilized by mechanical activation. 
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8.3 
Outlook 


Constructing a phenomenology of driven materials, i.e., an extended database 
and a theoretical framework to rationalize the latter, is of both academic and prac- 
tical interest. The diversity and the complexity of the responses of materials to ex- 
ternal forcing remain to be revealed; identifying the control parameters, finding 
the proper way to define the forcing intensity, and identifying the new character- 
istic lengths introduced by the external forcing, are fascinating challenges with 
direct practical implications (process optimization, prediction of time evolution 
of materials properties, alloy design, etc.). On the theoretical side, the develop- 
ment of models for driven materials often challenges the existing descriptions of 
the kinetics of the intrinsic thermal relaxation (i.e., that which occurs in the ab- 
sence of forcing) of nonequilibrium states. It also forces one to understand better 
the characteristics of the external dynamics, and, in particular, the effect of space 
and time correlations in the latter. Since the physics of forcing is better under- 
stood at the atomic scale, we are facing the need to make better links between 
the macroscopic theories and the atomistic processes. 

Much work remains to be done, which requires altogether, systematic accumula- 
tion of experimental data over a broad range of forcing conditions and materials, 
computer experiments on model systems, and the development of appropriate 
theoretical approaches. 
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9 
Change of Alloy Properties under Dimensional 
Restrictions 


Hirotaro Mori and Jung-Goo Lee 


9.1 
Introduction 


Small particles in the size range from a few to several nanometers have attracted 
considerable attention from both scientific and technological viewpoints in recent 
years. The reason is that these nanometer-sized particles (hereafter NPs) often 
exhibit structures and properties which are significantly different from those of 
the corresponding bulk materials and therefore have potential for use as ad- 
vanced materials with new electronic, magnetic, optic, and thermal properties [1, 
2]. These NPs consist of 10? to 10* atoms and the surface-to-volume ratio be- 
comes remarkably large with decreasing size of the NP (Table 9.1). The surface 
atoms have some broken bonds and are in a high-energy state compared with 
the inner atoms. 

The surface atoms therefore make significant contributions to the unique prop- 
erties of NPs. This is called the “surface effect.” There is another effect in relation 
to the finite size of particles. When the particle size becomes as small as the 
wavelength of electrons, the energy state of the electrons can be changed; this 
results in unique, measurable physical properties in NPs, and is known as the 
“quantum size effect” [3]. To get a fundamental understanding of these effects on 
the properties of NPs, intensive research has been carried out. It is well known, 
for example, that the stable structure of indium undergoes a change from body- 
centered tetragonal (bct) for bulk to face-centered cubic (fcc) when the size of the 
particles is reduced to several nanometers [4]. It is also well established that such 
phase transition temperatures as the melting temperature (Tm) are significantly 
reduced with decreasing size of particles [5-7]. All these examples are concerned 
with phase transitions in nanometer-sized pure substances, and in related experi- 
ments the temperature (T) and size (D) of particles were employed as experimen- 
tal parameters. On the other hand, studies on alloy NPs examined as a function 
of T, D, and composition (X), are quite limited [8-11], although in recent years 
much attention has been paid to alloy NPs as candidates for new functional ma- 
terials [12, 13]. This limitation of studies on alloy NPs is mainly due to the fact 
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Table 9.1 The number of total atoms, the number of surface atoms, and 
the ratio of surface atoms in nanometer-sized particles. 





The diameter The number The number of The ratio of 
of particle of total atoms surface atoms surface atoms (%) 
10 nm 30000 6000 20 

5 nm 4000 1600 40 

2nm 250 200 80 

lnm 30 ~29 ~99 





that it was rather difficult to control and measure the three parameters, T, D, 
and X, at the same time in a particle. 

However, recent remarkable progress in transmission electron microscopy 
(TEM) enables us to study not only the structure, but also the chemical composi- 
tion of an isolated nanometer-sized target material at a fixed temperature. With 
the use of this technique it now becomes possible to examine such alloy proper- 
ties as the alloy phase formation in isolated NPs as a function of T, D, and X of 
the system. This situation opens a wide, unexplored research field on the struc- 
tural stability of nanometer-sized condensed matter in two- (or multi-)component 
systems. 

In this chapter, an overview on recent work on alloy phase formation in NPs 
will be presented, with emphasis placed on studies carried out by in-situ TEM in 
combination with thermodynamic calculations. Factors affecting the phase equi- 
librium of alloy NPs are described. 


9.2 
Instrumentation for in-situ Observation of Phase Transformation of Nanometer- 
Sized Alloy Particles 


It is well recognized that TEM is one of the most powerful experimental tools for 
the characterization of NPs, especially for microstructural (or phase) analysis, and 
much work concerning NPs has been carried out using TEM. However, special 
attention has to be paid to keeping the surface of NPs as clean as possible during 
the experiments, because most properties of NPs are very sensitive to surface 
cleanliness. 

For this reason, such in-situ experiments with TEM, in which both production 
and observation of NPs are possible in the same vacuum chamber without expos- 
ing particles to any undesired atmosphere, have been developed [14, 15]. Two 
types of evaporation systems have been designed. In one type, a double-source 
evaporator is installed directly inside the TEM, as shown in Fig. 9.1. It consists 
of two tungsten filaments (three or four are possible if required). 
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Fig. 9.1 Hitachi H-800 type 200 kV electron microscope as modified for 
analyzing nanoparticles [14]: (a) column; (b) schematic cross-sectional 
illustration of a double-source evaporator installed in the specimen 
chamber (in the square in (a)). 


The distance between the filaments and a TEM sample holder is approximately 
100 mm. An amorphous carbon film was mainly used as a supporting film, 
mounted on a TEM grid. Using this evaporator, one of the source materials was 
first evaporated from one filament onto the supporting film kept at a certain tem- 
perature, and NPs of the material were produced on the film. Next, the other 
source material was evaporated from the second filament onto the same film 
kept at another fixed temperature. The filaments were heated by the Joule effect. 
Alloy phase formation in the NPs associated with the deposition of the second 
source material was studied by taking bright-field images (BFIs) and dark-field 
images (DFIs) and selected area electron diffraction (SAED) patterns. For quanti- 
tative analyses of the intensity of Debye-Scherrer rings, imaging plates (IPs) were 
employed instead of conventional films. A heating or a cooling holder was used to 
control the temperature of the amorphous carbon film. The TEM was equipped 
with a turbo-molecular pumping system to achieve a base pressure of around 
3 x 10° Pa in the specimen chamber. 

In the other type, a side-entry TEM sample holder equipped with a miniature 
double-source evaporator has been developed; actually, it is an improved version 
of the hot holder designed by Kamino and Saka [16], and is shown schematically 
in Fig. 9.2: it consists essentially of three spiral-shaped tungsten filaments. The 
middle filament is attached with a flake of graphite used as the supporting sub- 
strate, and the outer two filaments with source materials. Before experiments, the 
graphite flake was baked at 1070 K for 60 s by feeding an appropriate amount of 
current into the middle filament. After being baked, the graphite substrate was 
cooled to a certain temperature. Subsequently, one source material was evapo- 
rated from the second filament onto the graphite substrate, where pure NPs 
were produced. The other source material was then evaporated from the third fil- 
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Fig. 9.2 The tip of a side-entry TEM holder equipped with a miniature 
double-source evaporator (see the text for details). 


ament onto the pure NPs. Microstructural changes in an isolated particle associ- 
ated with alloying of solute atoms were studied in situ in the microscope, using 
a supersensitive television camera (Gatan 622SC) with a time resolution of 30 
frames s~!. Images were recorded on videotape through a video recorder system 
connected to the television camera. Some photographs were reproduced from the 
videotapes through an image processor (Image Sigma) operating in an accumula- 
tion mode (three frames were accumulated for one photograph). The base pres- 
sure in the microscope was around 5 x 107 Pa. The chemical composition of alloy 
NPs was examined with energy-dispersive X-ray spectroscopy (EDS), if necessary. 


9.3 
Depression of the Eutectic Temperature and its Relevant Phenomena 


9.3.1 
Atomic Diffusivity in Nanometer-Sized Particles 


The atomic diffusivity in NPs is a subject of fundamental importance, since it 
plays a key role in determining the kinetics of all diffusional phase transfor- 
mations in NPs. Experimental studies on self-diffusion and on interdiffusion in 
NPs in the literature will be described briefly. 

Studies on the self-diffusion in NPs of pure substances are rather limited. Dick 
et al. investigated in an indirect manner the size-dependent self-diffusion coefh- 
cient of gold in gold NPs [17]. First, the melting point of gold NPs was deter- 
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mined as a function of particle size by a combination of differential thermal 
analysis and thermal gravimetric analysis techniques. Then, based upon the 
size-dependent melting point data, the size-dependent self-diffusion coefficient 
was calculated under the following assumptions: (a) the diffusion coefficient ex- 
hibits a Boltzmann-Arrhenius dependence on temperature; (b) at Tm, the diffu- 
sion coefficient is the same for all gold particles, independently of the size; and 
(3) the activation enthalpy of diffusion is proportional to the average number of 
bonds that need to be broken in the melting process. It has been shown that the 
self-diffusion coefficient obtained increases significantly with decreasing size of 
particles [17]. For example, the coefficient was reported to be 10-78 m? s7! for a 
2 nm-sized particle at room temperature (RT) [17], which is far higher than the 
bulk diffusion coefficient at RT, i.e., around 3.6 x 10736 m? s~! [18]. Discussions 
on the assumptions employed will be interesting. Unfortunately, direct experi- 
mental evidence for an enhanced self-diffusion in NPs of pure substances is lack- 
ing until now. 

On the other hand, studies on the interdiffusion (or alloying) in NPs are not 
rare. For example, Mori et al. have examined the size-dependent alloying of cop- 
per into gold NPs by electron microscopy [19, 20], and the following observations 
were made: 

e when vapor-deposited copper atoms came in contact with 
gold NPs of diameter approximately 4 nm, supported on an 
amorphous carbon film, they quickly (in less than 20 s) 
dissolved into gold NPs to form a Au-Cu solid solution even 
at RT 

e in gold NPs of diameter approximately 10 nm, alloying of 
copper took place only in a shell-shaped surface region of 
individual NPs — no alloying occurred at the central core 
region of NPs at RT 

e in gold particles of approximately 30 nm diameter such 
alloying of copper atoms was not detected at RT. 


It has been confirmed that the rapid alloying in approximately 4 nm-sized par- 
ticles took place not via a melting process, but via a solid-state process [20]. It has 
also been confirmed that such rapid alloying is not an artifact originating from a 
temperature rise due to electron beam heating, heat of condensation, heat of mix- 
ing, or impingement of flying solute atoms with a kinetic energy of the order of 
kT, but is an intrinsic property of NPs [21]. These observations clearly indicate 
that the interdiffusion (or alloying) becomes markedly enhanced with decreasing 
size of particles. 

A simple relationship x = (Dt) "^, can be used as a rough estimate for the 
diffusion coefficient D, where t is the time needed to achieve appreciable diffu- 
sion of solute atoms over a distance x [22]. With x = 2.0 nm (half the diameter 
of 4 nmsized gold NPs) and t= 20 s, a value of 8 x 107" m? st is obtained 
for D. This value gives an estimate for the lower limit of the diffusivity of cop- 
per in 4 nm-sized gold NPs (or of the copper atom mixing rate with gold), since 
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the alloying took place in less than 20 s. On the other hand, the diffusion coeffi- 
cient of copper in bulk gold, Dcu-au, can be written (in m? s7!) as Dou-au = 
(1.2 x 1077) exp(—14 300/T), where T is the temperature [22]. Extrapolation of 
this relationship to room temperature gives a value of 2.4 x 10-78 m? s7! for 
Dcu-Au at 300 K. These calculations suggest that the observed copper diffusivity 
in 4 nm-sized gold NPs is many orders of magnitude faster than that in bulk 
gold. It has been confirmed experimentally that the enhanced alloying takes place 
not only in the solid-solution-forming alloy systems (e.g., Au-Cu) but also in the 
compound-forming alloy systems [21]. 

The rapid alloying (or interdiffusion) mentioned above is concerned with the 
atom mixing in NPs supported on a solid substrate, and is investigated by 
electron microscopy. It is of interest that enhanced atom mixing has also been 
observed in core-shell-type Au-Ag NPs in a completely different experiment. 
Shibata et al. have studied size-dependent alloy formation of silver-coated gold 
NPs in aqueous solution at ambient temperature using X-ray absorption fine 
structure spectroscopy (XAFS) [23]. Various Au core sizes and Ag shell thick- 
nesses were prepared using radiolytic wet techniques. The authors observed 
remarkable size dependence in the room-temperature interdiffusion of the two 
metals: for the very small particles (<4.6 nm initial Au-core size), evidence for 
the interdiffusion between the Au core and the Ag shell was obtained, whereas 
for the larger particles the core/shell boundary was maintained to within one 
monolayer, indicating the absence of interdiffusion. It was proposed that defects 
at the boundary may play an important role in the enhancement of the atom mi- 
gration (or atom mixing) in small NPs [23]. 

Since similar enhanced atom mixing has been obtained in completely different 
kinds of experiments, it seems safe to consider that size-dependent alloying of 
two metals does take place rather generally in NPs and a marked enhancement 
of intermixing (for example, room-temperature alloying of Cu into Au) has been 
achieved in NPs. The mechanism of the enhanced atom intermixing remains a 
subject for future studies. 


9.3.2 
Eutectic Temperature in Nanometer-Sized Alloy Particles 


It is well established that the Tm of pure substances is reduced with decreasing 
size of particles. Similarly, the eutectic temperature (T,,,) decreases with decreas- 
ing size of alloy particles. It should be noted that in the eutectic temperature 
suppression, not only the surface energy but also the interfacial energy plays an 
essential role. In the following, the extent of the eutectic temperature suppression 
will be described, together with the phase equilibrium induced by such deep sup- 
pression of Te„, which is characteristic to alloy NPs in the eutectic system. 

Figure 9.3 shows a typical sequence of the alloying process of indium atoms 
into an approximately 5.2 nm-sized tin particle at RT [15]. These pictures were 
taken from a video recording. The three numbers inserted in each micrograph 
indicate the time in units of minutes, seconds, and one-sixtieth seconds. 
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Fig. 9.3 In-situ observation of the alloying process of indium into an 
approximately 5.2 nm-sized tin particle. A crystalline-to-liquid (C-L) 
transition took place during deposition of indium onto the tin particle 
kept at room temperature (compare (c) with (e)). The time is indicated 
on each micrograph in units of minutes, seconds, and sixtieths of 
seconds. 


Figure 9.3(a) shows an as-produced, pure tin particle on a graphite substrate 
(bottom). The 0.29; nm-spaced fringes seen in this particle are the (020) lattice 
fringes of ß-Sn. Figures 9.3(b)-(e) show the same particle during indium-atom 
deposition. In spite of the indium deposition, the particle remains a single 
crystal, as shown in Figs. 9.3(b) and (c), which indicates the formation of a solid 
solution in the particle. The diameter of the particle has increased from approxi- 
mately 5.2 nm (Fig. 9.3a) to 6.7 nm (Fig. 9.3c) during indium deposition. From 
this size increment, the indium concentration in the solid solution can be esti- 
mated at approximately 53 at.% In. With continued deposition of indium, the par- 
ticle underwent a crystalline-to-liquid (C-L) phase transition, as shown in Figs. 
9.3(c) and (e): that is, all the lattice fringes in the particle disappeared abruptly 
and only a uniform contrast typical of the liquid state appeared (Fig. 9.3e). The 
time interval between Figs. 9.3(c) and (e) was 4 s, indicating that the C-L phase 
transition took place very quickly. Figure 9.3(f) shows the same particle (liquid 
droplet) after the power of the tungsten filament attached to the indium had 
been turned off. It is evident from Fig. 9.3(f) that the C-L phase transition in 
Fig. 9.3(c) to (e) did not result from the temperature increase induced by radia- 
tion from the neighboring filament attached with indium during heating for in- 
dium deposition onto the tin NP. In view of the fact that the Te„ of bulk alloys is 
393 K, it seems safe to conclude that the melting temperature of In-Sn alloy NPs 
with compositions close to the eutectic one is below RT. In other words, a depres- 
sion of Te, by as much as 100 K took place in association with the particle-size 
reduction. 
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Fig. 9.4 A sequence from a video recording of the alloying process of 
tin atoms into a nanometer-sized gold particle. A crystalline-to- 
amorphous (C-A) transition took place during deposition of tin onto 
the gold particle. The time is indicated as in Fig. 9.3. 


Figure 9.4 shows a typical sequence of an alloying process of tin atoms into an 
approximately 6 nm-sized gold particle at RT. In Fig. 9.4(a) the as-produced pure 
gold particle lies on a graphite substrate (top) [24]. Figures 9.4(b)-(f) show the 
same particle under tin-atom deposition. In Figs. 9.4(b)-(d), the (111) lattice 
fringes of the gold particle become more and more distorted and segmented 
with an increasing amount of tin, and eventually the salt-and-pepper contrast ap- 
pears as a characteristic reminiscent of an amorphous structure (Figs. 9.4e,f). 

The observations shown in Fig. 9.4 indicate that tin atoms vapor-deposited onto 
a gold NP rapidly dissolve into the gold particle, which is consistent with the re- 
sults on In-Sn in Fig. 9.3. A difference is that the alloyed gold particle transforms 
into an amorphous phase. 

In an attempt to study the thermal stability of the amorphous phase such 
as that shown in Figs. 9.4(e) and (f), additional experiments on annealing of 
Au-36 at.% Sn alloy particles were carried out. Figure 9.5 shows a sequence of 
structural changes in an Au-36 at.% Sn alloy particle during heating and subse- 
quent cooling [25]. In Fig. 9.5(a) the as-produced amorphous alloy particle is sit- 
ting on a graphite substrate (bottom). (As shown in Fig. 9.4, Au-Sn alloy NPs 
formed at RT are in an amorphous state, when the composition is in a range of 
30-40 at.% Sn [24, 25].) With increasing temperature, the salt-pepper contrast, 
which is characteristic of the amorphous phase, starts to fluctuate; that is, at 
about 473 K individual black and bright spots in the salt-pepper contrast begin 
to move to and fro slowly in the particle, suggesting that atoms in the particle 
have started a long-range migration in a noncrystalline state (Figs. 9.5b,c). With 
continued heating, the particle melted, and in the molten state only a uniform 
contrast typical of the liquid state appeared (Fig. 9.5d). It should be noted here 
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that before melting no traces of crystallization were observed. Upon cooling to 
RT, the liquid particle solidified again into the amorphous phase (Fig. 9.5e). 
From the results in Fig. 9.5, it is evident that the amorphous phase which appears 
over a composition range near the eutectic composition possesses an extremely 
high thermal stability so that upon heating it melts without crystallization and 
the melt solidifies directly into an amorphous solid upon cooling. 

As shown above, when the particle diameter was approximately 6 nm, a liquid 
phase and a thermodynamically stable solid amorphous phase were formed at RT 
in particles in the In-Sn and the Au-Sn system, respectively. We speak here of 
a thermodynamically stable amorphous phase, because the above result in the 
Au-Sn system suggests that the Gibbs free energy of the amorphous phase is 
lower than that of a crystalline counterpart, at least at temperatures near and 
above RT. This situation is a consequence of the large depression of Te, across 
the glass transition temperature (T,) induced by the size reduction, which to the 
best of our knowledge never occurs in bulk materials. 

It is known that there is an empirical rule that T, is located at a temperature 
near two-thirds of the melting temperature of a bulk material [26, 27]. According 
to this rule, T, is expected to be located at a temperature higher than RT in the 
Au-Sn system, and lower than RT in the In-Sn system, respectively. T, is consid- 
ered to be directly related to the cohesive energy of the material [28], and is 
thought to be less sensitive to the size of particles in the size region around 10 
nm. In the case when the three temperatures, Teu, Tg, and RT (where the obser- 
vation is carried out), lie in such an order as T, > RT > Tz, in NPs in the Au-Sn 
system, it is postulated that a crystalline-to-amorphous (C-A) phase change is in- 
duced by simply adding solute atoms (i.e., Sn atoms) to crystalline NPs of pure 
gold. This postulate is in agreement with what is shown above in Fig. 9.4. In the 
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case, however, when the two temperatures T, and RT lie in such an order as 
RT > T, in NPs in the In-Sn system, it is postulated that a C-L phase transition 
would be induced instead by simply adding solute atoms (i.e., In atoms) to crys- 
talline NPs of pure tin. Again, this postulate is in agreement with what is shown 
above in Fig. 9.3. It should be emphasized here that neither the amorphous phase 
in the former system nor the liquid phase in the latter system is the equilibrium 
phase at RT in bulk materials. These phases can be present at RT as the equilib- 
rium phase only when the size of the system is in the nanometer range. One of 
the reasons for the enhanced suppression of T,,, with size reduction may come 
from the fact that the relative contribution of interfacial energy associated with a 
distinct hetero-interface to the total Gibbs free energy of a crystalline two-phase 
particle increases with decreasing size of the particle, and therefore such an inter- 
face destabilizes a crystalline, two-phase structure relative to a noncrystalline, 
single-phase structure. In this manner, a noncrystalline phase becomes more sta- 
ble than a mixture of crystalline phases. 


9.3.3 
Structural Instability 


It is established that NPs often exhibit a structural instability under observation 
in a TEM; three different explanations have been proposed. The first was sug- 
gested by lijima and Ichihashi [29, 30], who proposed that a particle and a local 
area of the insulating substrate might deviate temporarily from electrical neutral- 
ity under electron-beam irradiation and the coulombic forces would then induce 
structural fluctuations [30]. In accordance with this suggestion, it has been re- 
ported that the charging effect becomes weak when an electrically conductive sub- 
strate such as amorphous carbon is used [29]. In the second explanation, Ajayan 
and Marks [31, 32] proposed that the free energy barrier between different atomic 
configurations in a NP might be so small that the thermal activation is large 
enough even at RT to allow random fluctuations to occur among different con- 
figurations; this is called the quasi-molten state [32]. It has been confirmed ex- 
perimentally that once initiated, the particle remained in a quasi-molten state 
even after the electron-beam intensity is reduced to almost zero [32]. The quasi- 
melting model will be particularly suited to explaining such an observation. How- 
ever, it is emphasized that for a stable particle on the substrate to transform into 
the quasi-molten state, decoupling of the particle from the substrate and reduc- 
tion of the contact area between the particle and the substrate are required [32]. 
The third explanation was suggested by Mitome et al. [33]. Structural fluctuations 
in approximately 3 nm-sized gold particles occurred only at temperatures above 
about 773 K and the temperature became lower the smaller the particle was [33]. 
They also reported that under electron irradiation at an intensity of 10-15 A cm~? 
the particle temperature increased by about 20 K at most [33]. Based upon these 
observations, they suggested that the fluctuations are caused by a thermal effect 
[33]. 
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Fig. 9.6 A typical alloying process sequence two-phase microstructure; (e), (f) the same 
of bismuth atoms into a nanometer-sized tin particle after turning off the power of the 
particle, taken from a video recording: (a) as- tungsten filament attached with the bismuth. 


produced pure tin particle on a graphite The arrows in (d)-(f) indicate the position of 
substrate; (b), (c) the same particle after a Bi/Sn hetero-interface, and the time is 
bismuth deposition; (d) the same particle indicated as in Fig. 9.3. 


after additional bismuth deposition, with a 


Figure 9.6 presents a typical sequence of alloying process of bismuth atoms 
into a tin NP [34]. Figure 9.6(a) shows an as-produced pure tin particle on a 
graphite substrate (bottom). The pure tin particle was a single crystal with a di- 
ameter of approximately 8 nm, in which structural fluctuations were completely 
absent. Figures 9.6(b) and (c) show the same particle after bismuth deposition. 
The particle size increased from ca. 8 to ca. 9.5 nm by bismuth deposition but 
the particle remained a single crystal, which indicates that a solid solution was 
formed in it. The bismuth concentration in the solid solution, as estimated from 
the size increment, was approximately 40 at.% Bi. In this solid solution particle, 
fluctuations in the particle shape took place at a rate of approximately one per 1- 
3 s. The change in shape was always accompanied with the change in orientation. 
Therefore, the fluctuations observed here were orientational fluctuations of the 
crystallite, as seen from a comparison of Figs. 9.6(b) and (c). It should be noted 
that the fluctuations became more frequent with increasing bismuth concentra- 
tion in the solid solution. With continued bismuth deposition, the structure of 
the particle changed from a single crystalline solid solution to a mixture of two 
solid phases, as shown in Fig. 9.6(d). The particle in Fig. 9.6(d) was approximately 
12 nm in diameter. The bismuth concentration of this particle evaluated from the 
size increment was approximately 70 at.% Bi. It was also confirmed by an analy- 
sis of the Fourier transform pattern taken from each area in Fig. 9.6(d) that the 
particle was composed of a bismuth grain and a tin grain [34]. (Arrows in Figs. 
9.6(d)-(f) indicate the position of a Bi/Sn hetero-interface.) This is the same 
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phase relationship as in a bulk alloy with the corresponding composition at RT 
[35]. It can therefore be concluded that the phase relationship in an alloy particle 
of approximate diameter 12 nm is essentially the same as that in a bulk alloy in 
the Bi-Sn system. Figures 9.6(e) and (f) show the particle as in Fig. 9.6(d) after 
the power of the tungsten filament attached with bismuth had been truned off. 
For the 12 nm-sized alloy particle with a two-phase microstructure, the shape 
and position of the Bi-Sn hetero-interface changed continually with high fre- 
quency (i.e., a few times per second), in contrast to the solid solution particle 
shown in Figs. 9.6(a)-(c). It should be noted here that the evaporation of bismuth 
from the source filament had been stopped right before the micrograph in Fig. 
9.6(d) was taken. The structural fluctuations observed here are therefore not in- 
duced by the arrival of bismuth atoms on the particle. 

It is evident from the observations in Fig. 9.6 that even at the fixed temperature 
where the experiments were carried out (i.e., RT), the fluctuations became more 
frequent with increasing bismuth concentration in the alloy particle. It is of 
interest here that the difference in Gibbs free energy between a 12 nm-sized solid 
particle and the corresponding liquid particle of a tin-rich Bi-Sn alloy at RT de- 
creases continuously from a high value of 1620 J mol! for pure tin to almost 
zero for an alloy with a near-eutectic composition [36]. The reason why the Gibbs 
free energy of a 12 nm-sized solid particle with a two-phase microstructure be- 
comes almost equal to that of the corresponding liquid particle, even at RT, can 
be found in the large contribution of the Bi-Sn interfacial energy to the total 
Gibbs free energy of the solid particle [36]. As a result, T,, of a 12 nm-sized alloy 
particle can be lowered to a temperature slightly above RT [36]. In fact, it has 
been confirmed that T,,, is below RT when the size of a particle is further reduced 
to approximately 10 nm or below [37]. All these results suggest the view that a 
solid NP, no matter whether it consists of a single phase or of multiple phases, 
undergoes structural fluctuations, when the Gibbs free energy difference between 
the solid particle and the corresponding liquid particle is reduced to a value close 
to zero. 

According to an estimate based upon Thermo-Calc [38], in a 12 nm-sized parti- 
cle with a near-eutectic composition in the Bi-Sn system, the Gibbs free energy 
difference between a liquid particle and a solid particle (with a two-phase micro- 
structure) changes with temperature at a rate of approximately 100 J mol! per 
5 K. This implies that if the particle is warmed up, for example, by 5 K under 
electron-beam irradiation, then the relative stability of the particle in the solid 
state in comparison with the liquid state deteriorates by 100 J mol! when 
measured in terms of the free energy. This value of 100 J mol! is quite small 
compared to the free energy difference in a 12 nm-sized particle of pure tin, i.e., 
1620 J mol”! as mentioned before. It is postulated that even if a temperature rise 
of 5 K is induced by electron irradiation, it would not affect the phase stability in 
a 12 nm-sized pure tin particle and therefore it would not bring about any struc- 
tural fluctuations in the solid particle. On the other hand, the value of 100 J 
mol“! is not negligibly small compared to the free energy differences between a 
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solid and a liquid particle with a near—eutectic composition [36]. It is therefore 
postulated that if a temperature rise of, for example, 5 K is induced by electron 
irradiation, it would produce an appreciable alteration in the relative phase stabil- 
ity and would result in the occurrence of structural fluctuations in the solid parti- 
cle. This postulate is in agreement with the observations shown in Fig. 9.6. These 
results provide evidence for the view that a solid NP undergoes structural fluctua- 
tions when the Gibbs free energy difference between the solid NP and the corre- 
sponding liquid NP becomes almost zero. This view is consistent with the sug- 
gestion by Mitome et al., who observed the onset of structural fluctuations in an 
approximately 3 nm-sized gold particle only when the substrate temperature was 
raised to about 773 K [33]. According to the experimental data of Buffat and Borel 
[6], the melting temperature of such a gold particle is around 773 K. This fact 
therefore indicates that the Gibbs free energy difference between the solid and 
the liquid particle of that dimension does take a value close to zero when the sub- 
strate temperature is increased to about 773 K, where the structural fluctuations 
set in. On the other hand, in the experiment shown in Fig. 9.6, a flake of graph- 
ite, a good electric conductor, was used as a substrate. In fact, in no case was the 
detachment of a particle from the substrate encountered during observation, a 
typical event which is expected for the charging effect; the contact area between 
the particle and the substrate remained almost the same, as shown in Fig. 9.6. It 
seems safe, then, to consider that the structural fluctuations shown in Fig. 9.6 
were not caused by a charging effect. Therefore, the structural fluctuations shown 
in Fig. 9.6 are not compatible with the quasi-melting model. 

In conclusion, it is reasonable to assume that the structural fluctuations in an 
NP take place when the phase stability of a solid particle under observation be- 
comes almost equal to that of the particle in the liquid state. 


9.3.4 
Thermodynamic Discussion 


9.3.4.1 Gibbs Free Energy in Nanometer-Sized Alloy Systems 

In order to specify the phase equilibrium of NPs, not only the Gibbs free energy 
in the bulk (i.e., the volume free energy), but also the surface free energy has to 
be taken into account due to the extremely large surface-to-volume ratio. There- 
fore the total Gibbs free energy of a nanometer-sized alloy system can be de- 
scribed by Eq. (1) [39]. 


gta = G bulk A G surface (1) 


Calculations of G' for a simple eutectic system (which consists of a liquid 
phase and two pure solid phases) can be done in the following way. 

The Gibbs free energy of a bulk A-B binary alloy, G’" in Eq. (1), which corre- 
sponds to G'* in bulk materials, is expressed by Eqs. (2a) and (2b), where Na 
and Np are the mole fractions of components A and B. 
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Table 9.2 Temperature dependence of the surface tension of pure liquids [41]. 





Element Surface tension of pure Ort 
liquid [N m] ôT 
[N m7! T7] clarify 


Bi 0.378-0.000 07 (T — 1336.15) —0.000 07 
Sn 0.560-0.000 09 (T — 505) —0.000 09 








Ghul — NAGÙ? + NgG9* + RT(Na In Na + Ng In Np) 
+ G% (for liquid) (2a) 
GRS = NAGY? + NgGy® (for solid) (2b) 
ee and G%° (or coe and er are the Gibbs free energies of pure A and B rel- 
ative to their reference phases (i.e., solid phases) at 298.15 K. GS is the excess 
Gibbs free energy of mixing, which represents the interaction between the two 


components. GS can be written as Eq. (3), with the interaction parameters L; 
(i= 0,1,2,...) [38]. 


Ges — NaNp{Lo Li (Na Nz) } Iy(Na Ng)’ <} (3) 





It is known that the surface energy depends upon composition [40]. Convention- 
ally, a linear relationship is assumed for a liquid phase, for simplicity. Then, the 
second term of Eq. (1) can be described by Eqs. (4) and (5), where y$ and y 
(X = A or B) are the surface energy of pure solid and liquid particles, r is the ra- 
dius of a particle, and V} and V} are the molar volumes of pure solid A and B. 
As an example, the temperature dependence of y! for Bi and Sn is listed in Table 
9.2 [41]. 


G surface, S z 2 


“(NavAVE + Neva VŠ) (4) 


2vi 


r 





gehel — —— (Naya + Nays) (5) 


The value of y$ of pure solid at the melting temperature is known to be approxi- 
mately 25% greater than the surface tension of the pure liquid [39]. This way the 
surface energy y$ of a pure solid X as a function of temperature T can be ex- 
pressed as Eq. (6), where rE, mp ÍS the surface energy of the pure liquid X at its 
melting temperature Ty, mp- 


Oy 
Vig = 1.259% mp + (T — Tx, mp) T (6) 
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The temperature dependence of y$ in Eq. (6) is assumed to be the same as that 
of yt. Finally, it is also assumed that a surface with a free energy y (in J m~?) ex- 
erts a surface tension of y (in N m~*) in both solid and liquid. 


9.3.4.2 Result of Calculations 

With the use of Eqs. (1-6), it is possible to construct a Gibbs free energy- 
temperature-composition (G-T-X) diagram for an alloy NP. An example of such 
a G-T-X diagram is depicted in Fig. 9.7, which is constructed based upon the 
result of calculations for alloys in the Bi-Sn system. The solid and dotted lines 
indicate the Gibbs free energies of the bulk material and a 10 nm-sized particle, 
respectively. The relatively large increase in the Gibbs free energy in a solid NP 
compared to that in a liquid NP leads to the melting temperature depression AT 
of 51 K from 544 K to 493 K in pure bismuth and of 112 K from 505 K to 393 K in 
pure tin particles, respectively. 


n ZIG (kJ/mol) 
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os Liquid (nano) 
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Fig. 9.7 Gibbs free energy-temperature-composition (G-T-X) diagram 
constructed from the result of calculations. The solid and dotted lines 
indicate the Gibbs free energy of bulk material and 10 nm-sized 
particles, respectively. 
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Figure 9.8 shows the Gibbs free energy-composition (G-X) diagrams of a 10 
nm-sized Bi-Sn particle at several different temperatures; clearly, according to 
the calculation, Tm of 10 nm-sized particles of pure bismuth and of pure tin are 
493 K (Fig. 9.8b) and 393 K (Fig. 9.8c), respectively. The melting temperature 
depressions AT obtained here, i.e., AT = 51 K for bismuth, and AT = 112 K for 
tin, are somewhat smaller than those confirmed by experiments [7]. A possible 
reason for this discrepancy may be that such effects as the presence of a hetero- 
interface and the intrinsic lattice distortion in a solid NP are not included in the 
present calculation. An improved calculation, taking these effects into account, 
would predict larger values for the melting temperature suppression, AT. 

As mentioned above, the effect of the interface between the different phases is 
not included in the calculation. In the following, the further enhancement of the 
suppression of the eutectic temperature, AT,,, in alloy NPs by the influence of the 
interfacial energy associated with a hetero-interface in a solid particle is dis- 
cussed. The relative contribution of interfacial energy to the total Gibbs free en- 
ergy of a system becomes increasingly large with decreasing size of the system. 
In the case where the composition of an alloy particle falls in the two-phase re- 
gion and the particle becomes decreasingly small, the relative contribution of 
interfacial energy therefore becomes so high that the total Gibbs free energy of a 
two-phase mixture cannot be expressed as a straight line connecting points Ao 
and Bo in Fig. 9.9, as can be done for bulk materials, but the total Gibbs free 
energy of the mixture should be expressed as a convex curve, such as those 
shown by dotted lines in Fig. 9.9 [42]. The upward deviation of these convex 
curves (denoted as AGinterface in Fig. 9.9) from the straight line ApBo increases 
with decreasing size of the system. 

In general, the interfacial energies between two solid phases range from 0.2 J 
m? (in a coherent interface) to 1.0 J m? (in an incoherent interface) [43]. In the 
literature it is reported that the interfacial energy depends on the size of a system, 
but that the dependence is rather weak over the size range discussed here [44— 
46]. In the present calculation the size dependence was therefore ignored, for 
simplicity. Based upon these premises, the contribution of the interfacial energy 
to the total Gibbs free energy has been evaluated as a function of the size of par- 
ticles in the Bi-Sn system, and examples of the results are listed in Table 9.3. The 
values in this table have been calculated assuming that the interfacial energy is 
0.2 J m°?. It is seen from Table 9.3 that the contribution of the interfacial energy, 
AGinerface becomes higher than the Gibbs free energy difference between the 
single-phase liquid and the two-phase mixture in bulk at RT when the particles 
become smaller than about 10 nm (this Gibbs free energy difference corresponds 
to the gap between curve L and the straight line AoBo in Fig. 9.9). In other words, 
Te, becomes lower than RT in 10 nm-sized alloy particles. This means that the 
total Gibbs free energy of an isolated Bi-Sn alloy particle 10 nm in diameter and 
of eutectic composition, even at RT, becomes lower when the particle is in the liq- 
uid single-phase state than in the two-phase (i.e., Bi and Sn) state, when a hetero- 
interface is considered. This is consistent with the experimental observations [37]. 
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interface 


Bulk( a + 8 ) 





A X(B) >B 


Fig. 9.9 Schematic diagram showing the increase in Gibbs free energy 
due to the contribution of the interfacial energy in a nanometer-sized 
alloy particle with a solid/solid two-phase microstructure. 


Table 9.3 Size dependence of the contribution of the interfacial energy 
to the total Gibbs free energy of an alloy particle with a two-phase 
microstructure. 


No. of Alloy particle Max. area of AGinterface Gibbs free energy difference 

atoms diameter [nm] interface [nm?] [J mol="Jeal between the liquid phase 
and the two-phase mixture 
in bulk at RT [J mol="] 


2170 5 20 4357 

17000 10 79 2224 
30000 12 113 1815 fae 

139 000 20 314 1088 





a Contribution of interfacial energy to total Gibbs free energy. 
9.4 
Solid/Liquid Two-Phase Microstructure 


9.4.1 
Solid-Liquid Phase Transition 


Recently, it became possible to observe directly an alloy NP in the solid/liquid 
two-phase state and to analyze the phase relation in a direct manner. 


9.4 Solid/Liquid Two-Phase Microstructure 





Fig. 9.10 A sequence from a video recording expense of the crystalline phase (cf. (b) and 


of the alloying process of tin atoms into two (c)) until the whole particle became liquid 
differently sized (8 nm and 5 nm) bismuth (d). However, for the 5 nm-sized bismuth 
particles: (a) and (a’) as-produced bismuth particle during tin deposition, first a bismuth- 
particle on the graphite substrate; (b)-(d), rich solid solution was formed (cf. (b’) and 
(b’)-(d’) the same particle during tin depo- (c’)), which with continued deposition 

sition. In the 8 nm-sized bismuth particle abruptly changed directly into the liquid 
during tin deposition, (b) a crystalline/liquid phase (d’). The arrow in (b) indicates the 
interface was first formed in the interior location of a crystal/liquid interface, and the 
which then moved right and upward at the time is indicated as in Fig. 9.3. 


Figure 9.10 shows typical sequences of the alloying process of tin atoms in two 
bismuth NPs of different sizes at around 350 K [42, 47]. The three numbers in- 
serted in each micrograph indicate the time in units of minutes, seconds, and 
one-sixtieth seconds. Figure 9.10(a) shows an as-produced pure bismuth particle. 
The particle was approximately 8 nm in diameter, and the 0.327 nm-spaced 
fringes seen in the particle correspond to the (0112) lattice fringes of pure 
bismuth. The 0.335 nm-spaced fringes correspond to the (0001) lattice fringes of 
the graphite substrate. Figure 9.10(b) shows the same particle after tin deposition, 
which changed the structure of the particle from a single crystal to a mixture of a 
crystalline phase and a liquid phase. The liquid phase corresponds to the portion 
in uniform contrast in the lower left part of the particle (Fig. 9.10b). A faceted sur- 
face of the crystalline bismuth (shown in Fig. 9.10a) was replaced by a round, 
curved surface of the liquid phase (the arrow in Fig. 9.10(b) indicates the location 
of a crystal/liquid interface). With continued deposition of tin atoms, the crystal/ 
liquid interface moved right and upward at the expense of the crystalline phase 
(Fig. 9.10c), and eventually the whole particle became liquid (Fig. 9.10d). An 
EDS spectrum taken from the central portion of the particle depicted in Fig. 
9.10(d) showed that the composition of the particle was Bi-50 at.% Sn (spectrum 
not shown here). It did take about 4.5 min to form the liquid alloy particle (Fig. 
9.10d) by alloying of tin atoms into the pure bismuth particle (Fig. 9.10a). This 
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period can be considered long enough for the alloy particle to reach its equilib- 
rium state. The change in the microstructure mentioned above (i.e., crystalline 
— (crystalline + liquid) — liquid) is qualitatively consistent with that predicted 
from the phase diagram for bulk materials between the eutectic temperature and 
the melting temperature of bismuth [35]. 

Figure 9.10(a’) shows another as-produced pure bismuth particle, which is ap- 
proximately 5 nm in diameter, i.e., smaller than the particle in Fig. 9.10(a). The 
0.327 nm-spaced fringes seen in the particle can be assigned consistently as the 
(0112) lattice fringes of bismuth, and the 0.393 and 0.37, nm-spaced fringes as 
the (0003) and (0111) lattice fringes, respectively. Figure 9.10(b’) shows the same 
particle after the onset of tin deposition. In spite of the deposition and concomi- 
tant alloying of tin atoms, the structure of the particle remained single-crystalline, 
as shown in Fig. 9.10(b’), suggesting the formation of a terminal bismuth solid 
solution. With continued tin deposition, the structure of the particle still re- 
mained unchanged for a while (Fig. 9.10c’), but then the particle underwent a 
C-L phase change abruptly, as shown in Fig. 9.10(d’). The C-L phase change oc- 
curred within 4, s, the time resolution of the present video recording system, and 
formation and movement of a crystal/liquid interface were not detected. This is 
in sharp contrast to the manner of phase change observed in the larger (8 nm) 
particle shown in Figs. 9.10(a)-(d). From these results, it seems reasonable to 
consider that the phase change shown in Figs. 9.10(a’)-(d’) took place not by a 
heterogeneous mechanism (i.e., by the movement of a crystal/liquid interface) 
but by a homogeneous mechanism. It took approximately 8 min to form the liq- 
uid alloy particle (Fig. 9.10d’) by alloying of tin atoms into the pure, crystalline 
bismuth particle (Fig. 9.10a’). This period of about 8 min is considered again to 
be long enough to achieve an equilibrium state. In short, when the bismuth par- 
ticle was 5 nm or below, a direct C-L phase change took place with increasing 
concentration of tin, but when the bismuth particle was 8 nm or larger, the C-L 
transition proceeded via the migration of a solid/liquid interface. 

In an attempt to compare the phase change in bismuth NPs induced by tin 
alloying (i.e., Figs. 9.10a—d) with that in tin NPs induced by bismuth alloying, a 
C-L phase change in a tin NP associated with alloying of bismuth is depicted in 
Fig. 9.11, which shows a typical sequence of the alloying process of bismuth 
atoms into a tin NP at around 350 K [47]. 

Figure 9.11(a) shows an as-produced pure tin particle, approximately 8 nm in 
diameter; the 0.29; nm-spaced fringes seen in the particle correspond to the 
(020) lattice fringes of ß-Sn. Figure 9.11(b) shows the same particle after bismuth 
deposition. In spite of the deposition and alloying of bismuth atoms, the struc- 
ture of the particle remained single-crystalline. With continued bismuth deposi- 
tion, the structure of the particle was still unchanged for a while (Figs. 9.10c, d), 
but it then underwent an abrupt C-L phase change (Fig. 9.10e). The C-L phase 
change occurred again within 4 s, the time resolution of the present video record- 
ing system, and formation and movement of a crystal/liquid interface were not 
observed. An EDS spectrum of the central portion of the particle depicted in Fig. 
9.11(e) showed that the composition of the particle was Sn—6 at.% Bi (Fig. 9.11f). 


9.4 Solid/Liquid Two-Phase Microstructure 





Fig. 9.11 An alloying process sequence of bismuth atoms into a 
nanometer-sized tin particle: (a) as-produced tin particle on a graphite 
substrate; (b)-(e) the same particle during bismuth deposition. First 
(b)-(d) a tin-rich solid solution was formed, which with continued 
bismuth deposition changed abruptly into the liquid phase (e). (f) EDS 
spectrum of the central portion of the particle depicted in (e). 


It has been shown above that the change in the microstructure associated with 
alloying on the bismuth side (see Figs. 9.10a—d) differs from that on the tin side 
(Fig. 9.11) in the 8 nm-sized particle: in the former a sequence of phase changes 
of crystalline — (crystalline + liquid) — liquid has taken place, whereas in the 
latter a direct change from crystalline to liquid has occurred without passing 
through a two-phase crystalline/liquid mixture state. It is also shown that even 
on the bismuth side, the microstructure change induced by alloying of tin atoms 
has taken place in different manners depending upon the size of particle (com- 
pare Figs. 9.10(a)-(d) with Figs. 9.10(a’)-(d’)). It is of interest to analyze these 
results from the thermodynamic viewpoint. 

Figure 9.12 shows the Gibbs free energy curves for the liquid phase, the Bi- 
based rhombohedral phase, and the Sn-based bct phase in a 10 nm-sized particle 
at 350 K over a composition range from pure tin to pure bismuth. The Gibbs free 
energy curves were calculated in such a manner that curves for bulk materials 
available in the Thermo-Calc database system [38] were modified by taking the 
contribution from the surface energy into account [47], which was already men- 
tioned in Section 3.4.1. It is evident from the figure that in a 10 nm-sized alloy 
particle at 350 K, the conjugate phase of the Bi-based rhombohedral phase is not 
the Sn-based bct phase but the liquid phase. This fact suggests that T,, is below 
350 K in this reduced-size system, which is in agreement with the present obser- 
vations shown in Figs. 9.10 and 9.11. In the case of alloy NPs, in addition to the 
contribution of the surface energy mentioned above, the contribution of the solid/ 
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Gibbs free energy (kJ/mol) 


Thom 
tangent 





0.0) 0.2 0.4 0.6 0.8 1.0 
B © D X(Sn) x E 


Fig. 9.12 Calculated Gibbs free energy curves for the liquid phase 
(liquid), the Sn-based bct phase (bct) and the Bi-based rhombohedral 
phase (rhomb) in a 10 nm-sized alloy particle at 350 K. See the text for 
the other labels and lines. 


liquid interfacial energy, y*, to the total Gibbs free energy has also to be taken 
into account. This contribution from y* is not included in the curves in Fig. 
9.12. Therefore, for example, the total Gibbs free energy of a 10 nm-sized alloy 
particle of composition X’ between compositions D’ and E’, consisting of the 
bismuth-based rhombohedral phase and the liquid phase in Fig. 9.12, cannot be 
given by point X on the common tangent DE but by point X” located somewhat 
above point X. The difference between points X and X” corresponds to the contri- 
bution of y*. When it is assumed that there is only one, rather flat, solid/liquid 
interface in a 10 nm-sized particle as observed in the above experiment, then 
point X” will be on a convex curve joining points D and E. If the shape of the par- 
ticle is postulated to be hemispherical, the curve will be a parabolic one with the 
maximum at a composition near the midpoint between compositions D’ and E’. 
Such a parabolic curve is shown in Fig. 9.12 (dotted line). To! "is the point 
where the total Gibbs free energies for the bismuth solid solution and for the 
liquid phase become equal. Tee "and Tj are the points where the vertical line 
through Ten L intersects with the common tangent DE and the parabolic curve 
(dotted line), respectively. A preliminary specific estimate shows that in a 10 nm- 


sized alloy particle, the Gibbs free energy difference between points Tae "and 
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Fig. 9.13 Dependence of Gibbs free energy difference in a 10 nm-sized 
alloy particle at 350 K on (a) temperature and (b) particle size. See Fig. 
9.12 for the meanings of the labels. 


T% is small compared to the Gibbs free energy difference between points Tee i 


and te | if a value of 0.061 J m”? is used for y” [48]. In fact, the parabolic 
curve (dotted line) in this figure. was calculated using this value. The height of 
the maximum of this curve relative to the common tangent DE is approximately 
165 J mol~!. This means that in a 10 nm-sized alloy particle, the interfacial en- 
ergy y’ of 0.061 J m”? would make a maximum contribution of about 165 J 
mol“! to the total Gibbs free energy at a composition near the midpoint between 
compositions D’ and F’. 

The Gibbs free energy difference between points Tee L and TÄ®b/T and that 
between points ie and 1% "overa temperature range from 350 to 390 K are 
shown in Fig. 9.13(a). It should be noted that the Gibbs free energy difference be- 


tween points Tae "and Loo " is much larger than that between Te and 


TEV" and is also larger than that between points Tee "and T4, which is approx- 
imately 165 J mol~?, over the 350-390 K range. In this case, it is predicted that a 
phase change essentially similar to that for bulk material will take place upon al- 
loying tin atoms into a bismuth particle. That is, a pure bismuth particle would 
first change to a particle with the crystal/liquid two-phase microstructure and 
then eventually to a particle of the liquid phase. This prediction is in agreement 
with the observation depicted in Figs. 9.10(a)-(d). On the other hand, as seen 
from Figs. 9.12 and 9.13(a), the Gibbs free energy difference between points 
Ts "and Te on the tin-rich side, where ns "is the point at which the total 
Gibbs free energies for the tin solid solution and for the liquid phase become 
equal, and Tee is the point where the vertical line through TEY L intersects 
with the common tangent BC, is smaller than that between points I 
and Tj (not shown here); the latter is approximately 155 J mol! if a value of 
0.055 J m? is used for y* [48], over a 350-390 K range. In this case it is therefore 
predicted that with increasing concentration of bismuth, a particle of the terminal 


tin solid solution would change directly into a particle of the liquid phase, without 
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having a stage of solid/liquid coexistence. This prediction is consistent with the 
observation that the phase change from the terminal tin solid solution to the 
liquid phase took place very quickly and in the process no movement of a 
crystal/liquid interface was confirmed, as shown in Fig. 9.11. The contribution 
of the solid/liquid interfacial energy, y*, toward the total Gibbs free energy is ex- 
pected to increase with decreasing size of the system. 


The Gibbs free energy difference between points on L and TÜ@b/l and that 


between points Tee "and T% at 350 K over a size range from 4 to 10 nm are 
shown in Fig. 9.13(b). It should be noted that the Gibbs free energy difference be- 
tween points Toe. "and u E slowly decreases, but that the contribution of 
the solid/liquid interfacial energy, y”, toward the total Gibbs free energy increases 
with decreasing size of the system, and these two values become almost equal at 
an approximately 4 nm-sized alloy particle as shown in Fig. 9.13(b). This means 
that for the system smaller than approximately 4 nm in size, an alloy with a bis- 
muth solid-solution/liquid two-phase microstructure is not necessarily thermody- 
namically favorable compared to the alloy with a single-phase microstructure (i.e., 
either a liquid phase or a terminal bismuth solid solution). Therefore, in such a 
case, it is predicted that with increasing concentration of tin, a particle of the ter- 
minal bismuth solid solution would directly change into a particle of the liquid 
phase, without having a stage of solid/liquid coexistence. This prediction is qual- 
itatively consistent with the observation that in an approximately 5 nm-sized alloy 
particle the phase change from the terminal bismuth solid solution to the liquid 
phase took place very quickly and in the process no movement of a crystal/liquid 
interface was confirmed, as shown in Figs. 9.10(a’)-(d’). From these discussions, 
it seems reasonable to consider that the different manners in which the phase 
change occurs as observed in Figs. 9.10 and 9.11 can be explained consistently 
in terms of the magnitude of the contribution of y* relative to the Gibbs free en- 


; 5 lid/L lid/L 
ergy difference between points Ti and T6” > 


9.4.2 
Two-Phase Microstructure 


The morphology of the two-phase microstructure in alloy NPs is a function of the 
surface energy and the interfacial energy. Details will be discussed in this section 
using examples from Au-Sn and Bi-Sn systems. 

Figure 9.14 shows a typical sequence of microstructural changes in an isolated 
gold NP during alloying of tin at 430 K. Here 430 K is located below T,,, of the 
bulk alloy materials (i.e., 551 K for the Au-Sn system [35]), but above Tz, of 10 
nm-sized alloy particles [11]. Figure 9.14(a) shows an as-produced pure gold par- 
ticle on the surface of graphite at 430 K. The particle is approximately 8 nm in 
diameter, and the 0.23; nm-spaced fringes seen in it correspond to the (111) lat- 
tice fringes of pure gold. The 0.33; nm-spaced fringes correspond to the (0001) 
lattice fringes of the graphite substrate. Figures 9.14(b)-(h) show the same parti- 
cle undergoing continuous deposition of tin atoms. With tin deposition, a thin 
liquid layer appeared in a form of spherical shell around the crystalline gold solid 


9.4 Solid/Liquid Two-Phase Microstructure 





Fig. 9.14 Sequence from a video recording of the alloying process of tin 
atoms into a nanometer-sized gold particle: (a) as-produced gold 
particle on a graphite substrate; (b)-(h) the same particle undergoing 
continuous tin deposition. (c)-(e) First a thin liquid shell was formed 
over the gold-tin solid solution; (f)-(g) the liquid layer grew inward at 
the expense of crystalline solid solution until (h) the whole particle 
became liquid. The time is indicated as in Fig. 9.3. 


solution, as shown in Figs. 9.14(c)-(g). The formation of a thin liquid shell can 
be seen from the fact that the shape of the particle changed from a faceted poly- 
hedron (Fig. 9.14a) to a truncated sphere (Fig. 9.14c) and from the fact the (111) 
lattice fringes of the gold-tin solid solution disappeared in the outermost periph- 
eral region of the particle. With continued deposition of tin atoms, the interface 
between the crystalline gold solid solution and the liquid shell moved inward at 
the expense of the crystalline core (i.e., the central region appearing dark) as 
shown in Figs. 9.14(d)-(g), and eventually the whole particle became liquid as 
shown in Fig. 9.14(h). In view of a previous observation that in isolated NPs alloy- 
ing reaction takes place quite rapidly compared to that in bulk alloys [25], alloy 
phase formation could occur even at RT in a particular Au-Sn system [24, 25]. It 
seems reasonable to consider that the core (crystal)-shell (liquid) two-phase mi- 
crostructure seen in Figs. 9.14(c)-(g) is in an equilibrium state at this tempera- 
ture of around 430 K [49]. 

On the other hand, it is known that in alloy NPs in the Bi-Sn system, a quite 
different crystal-liquid two-phase microstructure developed in an equilibrium 
state as shown in Figs. 9.10(b) and (c). That is, in the Bi-Sn system, a two-phase 
structure appeared in which a single, rather flat, planar hetero-interface con- 
nected a crystalline phase (the bismuth solid solution) and a liquid phase (see 
Fig. 9.10b), with deposition of tin atoms onto a particle of bismuth solid solution. 
With continued tin deposition, the interface moved right and upward at the 
expense of the crystalline phase until the whole particle became liquid (Figs. 
9.10c,d). As mentioned above, both a gold NP and a bismuth NP undergo a 
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Fig. 9.15 Schematic illustrations of possible two-phase microstructures 
in an isolated nanometer-sized alloy particle: (a) case 1; (b) case 2. See 
the text for details. 


sequence of phase changes such as crystal (solid solution) — crystal + liquid — 
liquid, with increasing tin concentration. This sequence itself is consistent with 
the sequence which an alloy would exhibit when the solute concentration in- 
creases across the cystal + liquid two-phase region bounded by the solidus and 
liquidus curves. However, the microstructure developed in the crystal + liquid 
two-phase region in particles in the Au-Sn system differed from that in the 
Bi-Sn system: a core (crystal)-shell (liquid) structure (designated as case 1 here- 
after) appeared in the former system, whereas a simple two-phase structure with 
a planar interface separating the crystalline and the liquid phases (designated as 
case 2 hereafter) in the latter system. These two microstructures are illustrated 
schematically in Fig. 9.15. 

It has been reported by Hagege and Dahmen that several different microstruc- 
tures within small Pb-Cd inclusions embedded in an aluminum matrix can be 
discussed qualitatively in terms of the relative magnitudes of the A-B, A-M, and 
B-M interfacial energies (here A, B, and M stand for Pb, Cd, and Al, respectively) 
[50]. A similar analysis would be possible in isolated alloy particles if the crystal- 
liquid interfacial energy, y', the surface energy of crystal, y5, and the surface en- 
ergy of liquid, y!, are employed instead of A-B, A-M, and B-M interfacial ener- 
gies in Hagege and Dahmen’s treatment, respectively. Based upon this premise, 
the experimental observations shown in Figs. 9.10 and 9.14 are discussed in 
terms of y', y5, and y!. Firstly, the difference in Gibbs free energy (per mol), AG, 
between a bulk alloy and an alloy NP is calculated for cases 1 and 2. Of course, in 
the case of NPs, the surface and interfacial energies make an appreciable contri- 
bution to the total Gibbs free energy. We assume here that the volume fraction of 
liquid phase and crystalline solid solution phase is the same (i.e., 50%) in a parti- 
cle. Then, the difference in molar Gibbs free energy for case 1, AGı, can be de- 
scribed By Eq. (7), where r, y', and y' are the radius of particle, the surface energy 
of liquid, and the interfacial energy between crystalline solid solution and liquid, 
respectively. 


AG, = Anr?(y! + 0.63y') (7) 


9.4 Solid/Liquid Two-Phase Microstructure 


The difference for case 2, AG, can be described by Eq. (8), where y5 is the surface 
energy of crystalline solid solution. (In Eqs. (7) and (8), a numerical factor of 
No/N is omitted for simplicity, where No and N are the Avogadro number and 
the number of atoms comprising a particle, respectively.) 


AG, = ar?(2y° + 2y! + 7') (8) 


Next, it should be noted that the relative stability between microstructures in case 
1 and 2 becomes equal when AG, = AG). For this condition Eq. (9) is satisfied. 


27° — 2y' — 1.52y' = 0 (9) 


If we set y5 = Yy! and yt = Xy!, then we get a simple but useful equation [Eq. 
(10)] which enables us to predict the relative stability using two parameters X 
and Y. 


Y =0.76X+1 (10) 


This is shown as a straight line in Fig. 9.16. In the region below the line (i.e., 
AG; > AG) a microstructure as in case 2 is preferred, whereas in the region 
above the line (i.e., AG; < AG,) a microstructure as in case 1 is more favorable. 
The values of y5, y!, and yÝ for actual alloys can be estimated by calculations un- 
der the assumption that the surface and interfacial energies are in linear depen- 
dence on composition. Values of y5, y!, and y' for pure tin, bismuth, and gold are 
collected from the literature [41, 48, 51]. It has been confirmed that the tin con- 
centrations in the bismuth solid solution and in the liquid within a particle with 
the two-phase structure of melt + a-solid solution are approximately 5 and 50 
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Fig. 9.16 Diagram showing which two-phase microstructure (Fig. 9.15) 
is more stable as a function of y°/y! (= Y) and y'/y' (= X). See the 
text for details. 
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at.%, respectively [47]. Based upon this result, the values of y and y! in the Bi-Sn 
system can be taken as 0.5 and 0.47 J m~’, respectively. The value of yi is esti- 
mated to be 0.054 J m-?. Consequently, for particles in the Bi-Sn system, 
y/y! (= Y) and y!/y! (= X) are given as 1.064 and 0.115, respectively, and the 
point with the coordinates is indicated by a filled circle labeled (Bi-Sn) in Fig. 
9.16. Since this point is below the line mentioned beforehand, it is predicted 
from Fig. 9.16 that in particles in the Bi-Sn system, such a microstructure as in 
case 2 is thermodynamically more favorable than in case 1. This prediction is con- 
sistent with the experimental result shown in Figs. 9.10(b) and (c). On the other 
hand, in the Au-Sn system, the values of y5 and y! can be taken as 1.43 and 0.99 J 
m?, respectively. The value of y! is taken as 0.132 J m~*. Then, for particles in 
the Au-Sn system, y°/y! (= Y) and y'/y! (= X) are given as 1.446 and 0.134, re- 
spectively. The point with these coordinates is indicated by a filled circle labeled 
(Au-Sn) in Fig. 9.16. Since this point is above the line, it is predicted from Fig. 
9.16 that in particles in the Au-Sn system, such a microstructure as in case 1 is 
thermodynamically more favorable than in case 2. This prediction is again consis- 
tent with the present experimental result shown in Fig. 9.14. 

From the discussion, it seems reasonable to consider that the experimental re- 
sults shown in Figs. 9.10 and 9.14 can be explained consistently in terms of the 
relative contributions of y5, yl, and y' to the total Gibbs free energy of a particle 
with the two-phase structure. 


9.5 
Solid Solubility in Nanometer-Sized Alloy Particles 


This section describes the effect of size on the solid solubility in alloy NPs. 
Figure 9.17 show a typical sequence of the process of alloying tin atoms into 
lead NPs at around 383 K [52]. Figures 9.17(a) and (a’) show a BFI of as-produced 
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Fig. 9.17 A series of electron micrographs (c) the same area as (b) after cooling to RT; 
showing the alloy phase formation in (d) the same area as (c) after reheating to 
nanometer-sized particles in the Pb-Sn 383 K; (a’)-(d’) the SAEDs corresponding 
system: (a) as-produced lead particles on to (a)-(d), respectively. A mordel of dirt at 
an amorphous carbon film kept at 383 K; the bottom right-hand corner of each BFI 


(b) the same area as (a) after tin deposition; serves to mark a fixed position. 


9.5 Solid Solubility in Nanometer-Sized Alloy Particles 
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Fig. 9.18 Intensity profiles of the Debye-Scherrer rings shown in Fig. 
9.17 (a’)-(d’). See the text for details. 


lead particles on an amorphous carbon film at 383 K and the corresponding 
SAED, respectively. The mean diameter of the lead particles is approximately 6 
nm. The corresponding Debye-Scherrer rings can be indexed consistently as 
those of a crystal with the fcc structure, which is the same as that of pure bulk 
lead. An intensity profile of the Debye-Scherrer rings is depicted as curve (a) in 
Fig. 9.18. The profile was obtained by digitalizing the data in IPs. Figures 9.17(b) 
and (b’) and curve (b) in Fig. 9.18 show a BFI of particles after tin deposition, the 
corresponding SAED, and an intensity profile of the rings in the SAED, respec- 
tively. The particle size increased from 6 to 17 nm on average by tin deposition. 
The size increment came partially from the coalescence among particles induced 
by tin deposition. It seems that the tin deposition causes a migration of the par- 
ticles on the supporting film. All the Debye-Scherrer rings in Fig. 9.17(b’) can be 
indexed consistently as those of a crystal with the fcc structure. However, as may 
be seen from a comparison of profile (b) with (a) in Fig. 9.18, the corresponding 
lattice parameter decreased with respect to that of pure lead particles in Fig. 
9.17(a), suggesting the formation of a lead solid solution with undersized 
tin atoms in NPs. These observations indicate that when tin atoms are vapor- 
deposited and come in contact with lead particles, they dissolve quickly into the 
lead particles to form a lead-tin solid solution. It was revealed by EDS that the 
particles shown in Fig. 9.17(b) contained 56 at.% Sn, on average, which is almost 
five times higher than the solubility limit of tin in bulk lead-tin solid solution at 
the same temperature (i.e., 10 at.%) [35]. 

High resolution electron microscopy (HREM) observations revealed that a frac- 
tion of the lead-tin solid solution particles possessed the multiply twinned struc- 
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* (b) 





Fig. 9.19 A high-resolution image of a multiply twinned nanometer- 
sized particle of a lead-tin solid solution. Arrows indicate the locations 
of twin boundaries. 


ture, which has been frequently observed in NPs of pure fcc metals [53, 54]. An 
example of the multiply twinned particles of the lead-tin solid solution is shown 
in Fig. 9.19. 

Figures 9.17(c), (c’) and curve (c) in Fig. 9.18 show a BFI of the same area as is 
shown in Fig. 9.17(b) but after cooling to RT, together with the corresponding 
SAED and an intensity profile of rings in the SAED, respectively. The Debye- 
Scherrer rings in the SAED can be indexed consistently as those of a crystal with 
the fcc structure (i.e., lead solid solution) superimposed with those of another 
crystal with the tetragonal structure (i.e., tin solid solution). These facts indicate 
that at RT the tin concentration (i.e., 56 at.% Sn) in alloy particles approximately 
17 nm in size well exceeded the solubility limit of tin in lead and therefore excess 
tin was precipitated as a tin solid solution. In other words, a phase change from a 
single-phase lead solid solution to a two-phase structure of lead solid solution and 
tin solid solution took place upon cooling from 383 K to RT. This observation is of 
particular interest since it clearly indicates that even at RT atomic mobility is high 
enough to induce a phase change in approximately 17 nm-sized particles. The 
high mobility of atoms in NPs is consistent with the results in the previous sec- 
tions. It should be noted here that the lattice parameters of the fcc lead solid so- 
lution in Fig. 9.17(c) is almost the same as that in Fig. 9.17(b), as seen from a 
comparison of profiles (c) and (b) in Fig. 9.18. A possible qualitative explanation 
for this fact is as follows. The lattice parameter of the fcc lead solid solution 
shown in Fig. 9.17(b) tends to increase due to the loss of dissolved undersized 
tin atoms (associated with the formation of the tin solid solution within individ- 
ual NPs). On the other hand, the lattice parameter tends to decrease due to the 
thermal contraction associated with cooling from 383 K to RT. These two mutu- 
ally opposite trends apparently cancel each other and bring about an almost con- 
stant lattice parameter of the fcc lead solid solution between the two states shown 
in Figs. 9.17(b) and (c). Quantitative analyses of this point will be an interesting 
subject of a future study. 


9.6 Summary and Future Perspectives 


Figures 9.17(d) and (d’), and curve (d) in Fig. 9.18, show a BFI of the same area 
as is shown in Fig. 9.17(c), after heating to 383 K, the corresponding SAED, and 
an intensity profile of rings in the SAED, respectively. By comparing particles in 
Fig. 9.17(d) with the corresponding particles in Fig. 9.17(c), it can be seen that in 
some cases coalescence took place among particles upon reheating to 383 K. Ex- 
amples are particles X and Y in Fig. 9.17(d). It is considered that migration of 
particles would be induced due to possible nonisotropic mass flows on the sup- 
porting film at an elevated temperature (i.e., 383 K) and that the migration might 
result in the coalescence of particles. All the Debye-Scherrer rings in the SAED 
can be indexed consistently as those of a crystal with the fcc structure (i.e., lead 
solid solution). Furthermore, as seen from a comparison of curves (d) and (b) in 
Fig. 9.18, the lattice parameter of the fcc lead solid solution in Fig. 9.17(d) is iden- 
tical with that of the solid solution in Fig. 9.17(b) at the same temperature, 383 K. 
These facts suggest that a phase change from the two-phase state (a lead solid so- 
lution and a tin solid solution) to a single phase of lead solid solution occurred 
upon heating to 383 K. This is just the reverse of the phase change that took place 
upon cooling from 383 K (Fig. 9.17b) to RT (Fig. 9.17c). Therefore it can be said 
that the above-mentioned change from a single- to a two phase state could take 
place reversibly when the temperature of the system is changed cyclically between 
383 K and RT. This observation provides further evidence for the fact that in par- 
ticles approximately 17 nm in size the solubility limit of tin in lead is higher than 
56 at.%Sn at 383 K but lower than 56 at.%Sn at RT. 

Recently it was reported that the elastic modulus would decrease with decreas- 
ing size of particles. This decrease in the elastic modulus can lower the strain en- 
ergy in the solid solution. That is, the contribution of strain energy to the heat of 
mixing (i.e., to the formation enthalpy) will become small when the elastic mod- 
ulus becomes low. This reduction in the formation enthalpy of a solid solution 
would inevitably bring about the enhancement of the solid solubility; that is, the 
solubility limit would be increased when the size of the particles is reduced. This 
explanation is in accordance with the observation that an enhanced solid solubil- 
ity of tin ino lead has been achieved in NPs (see Figs. 9.17 and 9.18, for example). 
However, further study is needed to elucidate quantitatively the mechanism be- 
hind the enhanced solid solubility. 


9.6 
Summary and Future Perspectives 


In this chapter properties of alloys under dimensional restrictions have been 
described, with emphasis on the phase equilibrium in alloy NPs. In particular, 
the effects of system size on the eutectic temperature, on the crystalline-to- 
liquid phase change, and on the solid solubility have been presented and dis- 
cussed in some detail. Furthermore, the formation of a thermodynamically sta- 
ble amorphous solid, which is a feature characteristic of alloy NPs, has been 
described. 
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In order to obtain a comprehensive understanding of alloy properties under di- 
mensional restrictions, studies on the following issues are still needed. 


_ 


3 


Substrate effect on phase stability of nanometer-sized alloy 
particles: The crystalline-to-liquid (C-L) transition of NPs has 
been investigated primarily on solid substrates so far. 
However, the particle/substrate interfacial tension, as well as 
the surface tension of the substrate, might also affect such 
phase transitions as the C-L transition in NPs, which has not 
been taken into account explicitly in previous studies. 

Size dependence of the surface tension in NPs: The surface 
tension of small particles may change due to the curvature. It 
may decrease with decreasing size of particle, as suggested by 
Tolman [55]. Experimental verification for this point is 
necessary to assess quantitatively the phase diagram of NPs 
less than a several nanometers in diameter. 

Evaluation of the solid/liquid interfacial energy: It is mentioned 
in this chapter that the interfacial energy for an interface 
between liquid (or solid) and solid phases can affect the 
phase equilibrium when the particle is in the two-phase 
region (liquid-solid or solid-solid). An accurate measure- 
ment of the interfacial energy is required to establish the 
phase equilibrium of alloy NPs. 

Glass transition: It is suggested in this chapter that a 
thermodynamically stable amorphous phase could be formed 
in a eutectic system when the particles are smaller than 
approximately 10 nm in diameter. This unique amorphous 
phase gives us an opportunity to obtain an in-depth 
understanding of the liquid-to-glass (L-G) transition, which is 
one of unsolved issues in modern solid-state physics. 
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10 
Statistical Thermodynamics and Model 
Calculations 


Tetsuo Mohri 


10.1 
Introduction 


Electronic structure calculations determine various ground-state properties of an 
alloy, while at the finite temperature an alloy is brought into excited states, and 
one needs to consider the entropy and free energy to determine the most stable 
state. This is basically the main task of statistical thermodynamics in alloy theory. 
Among various approaches to calculate entropy, the Bragg—Williams approxima- 
tion (Bragg and Williams 1934) has been the most widely employed in the mate- 
rials science community. This is mostly due to its physical transparency and 
mathematical simplicity. In fact, most of commercial software for phase equilib- 
ria calculations are based on the Bragg—Williams approximation. 

Yet it has been realized that the Bragg—Williams approximation does not pro- 
vide an accurate result in many thermodynamic calculations including a phase 
diagram. One of the most serious deficiencies involved in the Bragg—Williams 
approximation is the neglect of atomic correlations. In general, spatial extension 
of the atomic correlation exceeds the atomic interaction range. It is well known 
that atomic interaction in a typical alloy is not confined in a nearest-neighbor 
pair but extends well over a few distant pairs. Also, many-body interactions can- 
not be ignored in a less symmetric crystal structure. Hence, it is essential to in- 
clude wide range of atomic correlations in the entropy formula for an accurate 
calculation of phase equilibria. The single-site approximate nature of the Bragg- 
Williams approximation, however, is by no means able to take this into account. 
The modern theory of alloys, therefore, demands more reliable statistical thermo- 
dynamic theory to incorporate atomic correlations. 

The cluster variation method (Kikuchi 1951; hereafter abbreviated to CVM) has 
been regarded as one of the most reliable theoretical tools in dealing with atomic 
correlations, and the main focus of the present chapter is placed on the introduc- 
tion and applications of the CVM. Two advantageous features are particularly em- 
phasized in Section 10.2. The first is that the internal energy calculations are 
combined coherently with entropy calculations by the CVM through common 
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configurational variables termed correlation functions. The reader will see how 
the atomic correlation on a discrete lattice is dealt with by the CVM and how the 
energy and entropy calculations are combined to construct a free energy in an ef- 
ficient and accurate manner. The second advantage is that the CVM can be ex- 
tended naturally to a time domain to study kinetics. The kinetics version of the 
CVM is termed the path probability method (Kikuchi 1966) and a general frame- 
work of the theory is introduced. Thereby, the statistical thermodynamics and ki- 
netics on a discrete lattice are discussed consistently in Section 10.2. 

Many of traditional text books of statistical thermodynamics do not refer much 
to the microstructure. However, in view of the fact that most of the macroscopic 
properties of an alloy, including its mechanical and electric properties, are con- 
trolled by microstructure, the discussion of statistical thermodynamics without 
microstructure is not very useful in alloy physics. Recent advancement of the 
phase field theory (L.Q. Chen 2002) in describing the microstructure and its 
time evolution process is quite remarkable. The versatile feature of the phase field 
method (PFM) has been amply demonstrated in various calculations including 
crystal growth, twin boundaries, evolution/devolution of magnetic domains, etc. 
Hence, in Section 10.3, a general framework of the PFM is introduced, which 
will be applied and extended to the study of order—disorder transition of Fe-based 
alloys in Section 10.4. The basis of the PFM can be traced back to the celebrated 
Cahn-Hilliard equation and the time-dependent Ginzburg-Landau equation. 
After deriving these equations, two kinds of interface properties, the width and 
the velocity, are calculated. It is shown that, within the PFM, the interface is not 
a special entity to be described separately but it is merely a spatial localization of 
field variables. 

In order to materialize the theoretical frameworks introduced in Sections 10.2 
and 10.3 in real alloy systems, two kinds of calculations recently performed by the 
author’s group on Fe-based binary alloy systems are introduced in Section 10.4. 
The first is the calculation of phase diagrams. By combination of electronic struc- 
ture calculations with the CVM, the reader will see that the phase diagram is ob- 
tained with surprisingly high accuracy from first principles. In fact, such a fruit- 
ful outcome proves the rigorous framework of statistical thermodynamics 
introduced in this section. 

The second is an ongoing subject. As will be seen, the behaviors of electrons 
and atoms on a discrete lattice are the subjects in Sections 10.2 and 10.3, respec- 
tively, while that of microstructure in a continuum medium is described effi- 
ciently. This is due to the differences of length scale and time constant involved 
in each category. How to unify these two kinds of behavior into a single mathe- 
matical framework is quite a difficult subject, which is often addressed in terms 
of multiscale calculation. Although coarse graining operation in statistical physics 
is known as the clue to settling the issue, there is no fixed recipe for applying the 
rigorous principle in order to elucidate varied phenomena in alloys. This is one of 
the most challenging subjects in statistical thermodynamics, and it is realized 
that a natural consequence expected from the challenge is the performance of 
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the first-principles microstructural evolution calculation. At the end of this sec- 
tion, one potential approach based on the CVM and PFM attempted by the au- 
thor’s group is discussed. 

The body of the present chapter is based on the author’s own work, but the 
studies referred to do not cover all the important contributions in this field. Vari- 
ous excellent books and monographs (Khachaturyan 1983; Ducastelle 1991), and 
review articles (de Fontaine 1979, 1994) exist on the subject matter discussed in 
this chapter. 


10.2 
Statistical Thermodynamics on a Discrete Lattice 


10.2.1 
Description of Atomic Configuration 


A microscopic approach to alloy phase equilibria on a discrete lattice has been 
termed configurational thermodynamics (de Fontaine 1979) and a variable which 
defines the atomic configuration on a lattice is called a configurational variable. It 
is essential to define configurational variables properly and efficiently for a sys- 
tematic study of configurational thermodynamics. For simplicity, the discussion 
in this article is limited to a binary system in which two atomic species A and B 
are distributed with a specified composition. 

Among configurational variables, the most primitive one is the point probabil- 
ity x;(p), which specifies the probability of finding a species i (either A or B) ona 
given lattice point p. It should be noted that this is nothing but a concentration of 
species i if all the lattice points are indistinguishable. If, on the other hand, lattice 
points are distinguishable under a symmetry operation, the entire lattice is di- 
vided into sublattices. In this case, x;(p) is the concentration of species i on the 
sublattice to which p belongs. 

The probability, yj(p,q), of finding species i and j in a pair of lattice points p 
and q can be defined in a similar way and is termed the pair probability. A special 
case is the one in which the lattice point q is the nearest neighbor to p, and 
yij(p, p’) is termed the nearest-neighbor pair probability, where p’ is a nearest- 
neighbor point of p. The distant pair probabilities are defined in a similar way, 
and it should be noted that a pair probability is nothing but the concentration 
of pair configurations in a given lattice. When the lattice points are not all 
equivalent, the pair probability and pair concentration are defined on (between) 
a relevant sublattice(s). The probability for a multibody configuration such as a 
triangle, tetrahedron, square, etc. can be defined in a similar way and these are 
termed cluster probabilities or cluster concentrations. 

One should remember that all cluster probabilities are interrelated by condi- 
tions. The first is the normalization condition, which is written for point and 
pair probabilities, respectively, as Eqs. (1) and (2). 
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i=l (1) 
ded we! (2) 
i j 


The second, the geometrical condition, is given as Eq. (3). 


xi = > Vij (3) 
J 


The geometrical condition is a constraint imposed by the geometry of a given lat- 
tice and is extended to further larger clusters. For an fcc lattice, for instance, x; 
and y;; can be related to the regular triangle probability, zix, and the nearest- 
neighbor tetrahedron probability, wi, as in Eq. (4). 





m= Domi DD = DDD (4) 
J J 


J 


Because of these two kinds of conditions, the cluster probabilities do not form a 
set of independent variables in the free energy functional. 

An alternative way to describe atomic configuration is to employ correlation 
functions (Sanchez and de Fontaine 1978; Sanchez et al. 1984; Mohri et al. 
1985a; Kikuchi and Mohri 1997; Mohri 2002) which are derived in the following 
manner. First, in order to specify an atomic species on a lattice point p, we define 
a spin operator o(p) by Eq. (5). 


o(p) = = for ä at a lattice point p (5) 


Then, by using o(p), an occupation operator T;(p) is defined as in Eq. (6), where 
A and B atoms are specified by i = +1 and —1, respectively. 


(43-6) (6) 


Nile 


T;(p) = 


The occupation operator in Eq. (6) suggests that if species i specified in the sub- 
script exists at a lattice point p the operator returns the value 1, while 0 results for 
the other species at p. For instance, if atom A is specified in the subscript i, Eq. 
(6) is explicitly written as Eq. (7). 


(14+1-0(p) (7) 


Nile 


Tı(p) = 
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When A is actually located at p, the spin operator o(p) is 1 according to the defi- 
nition in Eq. (5), and Tı(p) becomes 1. If, on the other hand, B is located at p, 
a(p) is —1 which yields Tı(p) = 0. 

We assume a lattice in which A and B atoms are distributed in numbers 
Nj (Na) and N; (Ng), respectively. Please note that —1 and 1 are used equivalently 
in the rest of this chapter. Then, the total number of lattice points, N, is given by 
the sum of N; and N; [Eq. (8)] and the concentration of A atoms is given by Eq. 


(9). 


N=Nı+N; (8) 
N 
gan (9) 


By operating the occupation operator on the entire lattice points, the number of A 
atoms, Nj, is written as Eq. (10). 


N 
N =) Tı(p) (10) 
p=1 
By substituting Eq. (10) into Eq. (9), one obtains Eq. (11). 


x = 


Ss“ Ti(p) (11) 
p=1 


z|= 


A further substitution of Eq. (7) into Eq. (11) yields Eq. (12). 


(pn) 


n= ya" to) =5 +e (12) 


m 








1 
“= 


1M: 


The second term in the parentheses in Eq. (12) is rewritten as Eq. (13) and is 
called the point correlation function. 


N 


cp) 
a= oy (13) 





Then, the concentration of A atoms given by Eq. (12) is rewritten as Eq. (14). 


(1+¢1) (14) 


x = 


N| = 
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Likewise, the concentration of B atom is given as Eq. (15). 
xj =5 (1-41) (15) 


In a more concise form, the concentration of species i is written as Eq. (16). 


(1+i-&) (16) 


NI =e 


Xi = 


Next, we consider the pair probability yj and a particular focus is placed on yıı, 
the concentration of A-A pairs, which is given as Eq. (17), where Nj; is the num- 
ber of A-A pairs and N(2) is the total number of nearest-neighbor pairs in the 
lattice. 


(17) 


™ = NG) 


One notices that, by employing the occupation operator, Ni is given by Eq. (18), 
where p’ is a nearest-neighbor lattice point of p. 


Nu =), >, Tı(p) -Tı(lp)) (18) 
7 


p=1 


The substitution of Eq. (18) into Eq. (17) via Eq. (7) leads to Eq. (19). 


m= Do Da Fol) (+ olp’) 





N 
op ap) XO) olp) alp’) 
-1.142 a pL (19) 
2? N N N(2) 


It should be noted that the denominators of the second and third terms in the 
parentheses are N, not N(2), and, therefore, these terms are the point correlation 
functions. The fourth term is termed the pair correlation function and is formally 
defined as in Eq. (20). 


X X olp) olp’) 
p p 


N(2) 





&) = (20) 
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Fig. 10.1 Schematic illustration of (a) phase separation, (b) random 
solid solution, and (c) an ordered phase, on a square lattice (Mohri 
2002). 


Then, yıı is finally given as in Eq. (21). 
1 
Yu = 52° (1+ 2& + &) (21) 


It is straightforward to derive the general expression for the pair probability in the 
form of Eq. (22). 


1 
22 





Vy = 5a + E+) čti jé) = 


Before further generalization, it is helpful to consider the examples in Fig. 10.1, 
in which phase separation (a), random solid solution (b), and ordered arrange- 
ment (c) are indicated. The concentration of A(B) atoms is 50 at.% for all three 
cases; hence Eq. (14) suggests that the point correlation function is 0, while the 
pair correlation functions for (a), (b), and (c) are, respectively, +1, 0, and —1, 
which are readily derived from Eq. (20). Note that the atomic arrangements on 
an interface and a surface in Fig. 10.1(a) are negligible for a large system. 

The correlation function can be defined for a further larger cluster in a similar 
way. In fact, it can be easily shown that the cluster concentration of a n-point clus- 
ter is written as Eq. (23), where ij specifies the atomic species at a lattice point j. 


1 Ia ; Bey oy 
Xinig-ij--in 5a tl H(i tig +--+: 4 +4 in) &ı 











(ii -i2 +i bt Hin in) 
+ (i ig ig +--+ tina in in) - & 


+-+ (ip ines tna: tn) En} (23) 





It is a straightforward extension of Eqs. (13) and (20) to define the n-point corre- 
lation function é, as in Eq. (24), where N(n) is the total number of n-point clus- 
ters in the given lattice. 
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So So So olp): olp) alpa) 


= pP PR Pn Ne) (24) 





Then, Eq. (23) is conveniently generalized as Eq. (25), where the atomic configu- 
ration {i,iz---ij---i,} is abbreviated to {J}, and | indicates the cluster formed by 
n lattice points, while I’ is a subcluster contained in a cluster I. V; p (1, l’) is called 
the V-matrix (Sanchez and de Fontaine 1978; Mohri et al. 1985a) and involves the 
sum of products of i1, i2,...ij...,in, as one can see in Eq. (23). 


Xn) = {+E voter ae} (25) 


As an example, we consider atomic configurations on a nearest-neighbor tetra- 
hedron cluster in the fcc lattice. The total number of atomic arrangements is 
2* = 16 and we describe the tetrahedron cluster probability for each configuration 
by wy. According to the geometrical condition indicated by Eq. (4), the cluster 
probabilities of subclusters contained in the tetrahedron cluster can be written 
in terms of wij. Some examples are given in Eqs. (26-28). 








| Ss | z | | = | i vl a | res 
X1 = Wun T Wigg T Wiji T My T Wii T Wini T Wiii T iii (26) 
Fr a | BE | eo | mes 
Ya — iin T W111 T Mii T Muti (27) 
211 = Wan + wiii (28) 


When all the lattice points are equivalent (nondistinguishable), Eqs. (29-31) hold. 








Wii = Yuin = Win = Win (29) 
Wii = Mii = Wi = Mini = Win = Wiin (30) 
Witt = Mii = Yin = Win (31) 


Then, only five cluster probabilities, w1111, 1447, W117, Wir, and wi7j7, are dis- 
tinguishable. These five variables, however, should satisfy the normalization con- 
dition stated in Eq. (32). 





| ae | Fear N | anes 
win + 404447 + OW 1 + 4M, GGT + Win = 1 (32) 


Hence, one can conclude that only four (5 — 1 = 4) are independent variables. 
By employing Eq. (23), on the other hand, the tetrahedron cluster probability, 

wy, can be written as Eq. (33), which clearly indicates that all the relevant cluster 

probabilities can be written with only four correlation functions €,, €, €3, and 4. 
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Fig. 10.2 Division of original lattice into two sublattices upon 
ordering. 
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This suggests that the correlation functions form a set of independent variables. 
Sanchez et al. (1984) further suggested that the correlation functions form an or- 
thonormal basis in the thermodynamic configuration space, which is the basis of 
the cluster expansion technique described in the next section. 

The discussions so far have been limited to a disordered phase in which all the 
lattice points are indistinguishable. For an ordered phase, on the other hand, a 
symmetry operation divides the entire lattice into sublattices, as is illustrated 
schematically in Fig. 10.2 for a two-dimensional square lattice. For such an 
ordered phase, normalization conditions and geometrical conditions are de- 
scribed by explicitly specifying the sublattices. The normalization condition of 
Eqs. (1) and (2) can be easily extended, respectively, as Eqs. (34) and (35) and the 
geometrical condition (3) is likewise extended as Eqs. (36) and (37). 


2 ae Xf =1 34) 
i i 
Sorf=1, X y =1 35) 
i,j i,j 
= Y?= 5 vr 36) 
j j 





B op _ BB 
f= Dy =D 37) 
J J 


It should be noted that in this particular two-dimensional square lattice, yi (ion 
a and j on f) is the nearest-neighbor pair probability in the original lattice, while 
Yj” (yf ) (i, j on « or p) corresponds to the second-nearest neighbor pair proba- 


bility. 
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Such distinctions of sublattices are also reflected in the correlation functions, 
and point and pair cluster probabilities are expressed with the correlation func- 
tions defined by explicitly taking the distinction of sublattices into account. One 
can readily show that Eqs. (38-41) are the extensions of Eqs. (16) and (22), where 
é{ and a are point correlation functions on « and £$ sublattices, respectively, and 
er, and EM 1 are pair correlation functions. 











1 

xp = 5 (1 +i: či) 38) 
1 

xf =5(1+i- či) 39) 

an 1 $ a 4 $ Fae tea 

Yj zn (+j) či Hi) &%) 40) 

y” (Gti étt j ef +i joé) 41) 





j = 52 


It should be noted that the subscript of the pair correlation functions distin- 
guishes the first (2, 1) and second (2, 2) nearest neighbors. 


10.2.2 
Internal Energy 


One of the most primitive descriptions of the internal energy of an alloy is based 
on the pair interaction model. The internal energy, E, per atom is then given by 
Eq. (42), where Z is the coordination number, which depends on the crystal struc- 
ture, and e;; is the atomic pair interaction energy between species i and j. 


1 
E=5-Z-) ey vi (42) 
ij 


The functional form of e;; is not unique and often the dependence on the atomic 
distance is incorporated, for instance, by a Lennard-Jones type potential (Oh et al. 
1994). But, for simplicity, we assume in this section that ej is a constant which 
depends only on the species i and j. 

As was seen in Section 10.2.1, the pair probabilities {yj} do not form a set of 
independent configurational variables. Hence it is desirable to describe the inter- 
nal energy, E, in terms of correlation functions which are the independent config- 
urational variables. In fact, according to the cluster expansion method (CEM) 
(Connolly and Williams 1983) the internal energy of an alloy, E, can be expanded 
in the form of Eq. (43), where & indicates a correlation function for a k-type clus- 
ter and vz is termed the effective cluster interaction energy. 
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Ei) =) wé (43) 
k 


It is essential to correlate effective cluster interaction energies {v,} with atomic 
interaction energies {ej}, and this is attempted in the following way. 
By substituting Eq. (22) into Eq. (42), one obtains Eq. (44). 


1 
E= Z$ 0: vy 
ij 


1 1 
53 Z zalea + 2eag + egg) +2- (eaa — eps) Či 





+ (eaa + egg — 2eap) - č2} (44) 


On the other hand, within the pair interaction model, k takes values 0, 1, and 2, 
and Eq. (43) is written explicitly as Eq. (45), where the significance of vo and & 
will be discussed later. 





2 
B=) u Č = vo: ot + & (45) 
k=0 


The comparison of Eq. (44) with Eq. (45) immediately yields Eqs. (46-48) 





Z 

vo = 3 a (era f 2e4B f egg) (46) 
Z 

vu = 22 * (era = egg) (47) 
Z 

v2 = z3 7 (era + EeBB — 2eap) (48) 


One can see that vı is directly proportional to the difference between the cohesive 
energies of constituent elements A and B, while vz is the interaction parameter 
(also called the effective interaction energy) often employed in thermochemistry. 
It is noticed that in a completely random solid solution, the correlations of 
atomic arrangement vanish, and the k-point cluster correlation function ¢ is de- 
composed into the kth power product of the point correlation function [Eq. (49)]. 


GSE (49) 


By noting that ¢, vanishes at 50 at.% as is deduced from Eq. (16), the internal en- 
ergy of a random solid solution is given as Eq. (50). 
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This suggests that the physical meaning of the effective interaction energy for a 
null cluster, vo, is equivalent to the internal energy of a random solid solution at 
50 at.%. 

Although the physical meaning of vp is a general consequence of CEM, it 
should be noted that the relationships in Eqs. (46-48) are applicable only for the 
nearest-neighbor pair interaction model. In fact, for the description of the inter- 
nal energy within the tetrahedron atomic interaction energy, &jjkı, E can be written 
as Eq. (51), where œ’ is the number of tetrahedron clusters per lattice point, 
and the comparison with Eq. (43) with k= 0,1,2,3, and 4 yields different 
expressions. 


E = wo! é 5 Ejkl . Wijkl (51) 
i,j,k, 1 


It is left for a reader to derive the relationships between ej; and {vr}. 

The actual operation of the CEM will be discussed in Section 10.4. It is worth 
pointing out that a stable ordered phase can be predicted once the effective inter- 
action energies are assigned. Two schemes should be addressed. One is based on 
linear programming (Richards and Cahn 1971; Sanchez and de Fontaine 1981) 
and the other is called the method of inequality (Kanamori and Kakehashi 1977). 
Both yield identical ground-state structures within the pair interaction model 
which is our main concern. The stable ordered structures predicted at 1:1 stoi- 
chiometric composition is demonstrated in Fig. 10.3 for an fcc-based system as a 
function of 12 2/1, 1, which is utilized in the discussions in Section 10.4. The 
ground-state analysis for bcc-based systems is reported by Allen and Cahn (1972). 


10.2.3 
Entropy and Cluster Variation Method 


The discussions so far have been limited to 0 K, at which interaction energies 
play a dominant role in determining an equilibrium atomic configuration, while 
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at finite temperatures one needs to consider the entropy contributions. In this 
section, we focus only on the configurational entropy. 

The configurational entropy for a binary system can be formally given as in Eq. 
(52), where kg is the Boltzmann constant and W is the number of possible ways 
of distributing A and B atoms on a given lattice. 


S=kp-In W (52) 


In the most primitive approach, W is given as in Eq. (53) and the substitution of 
Eq. (53) into Eq. (52) yields Eq. (54). 


N! N! 
Nal: Ng! [Jo 


l 





W = NCN, -n-m Cm = (53) 


N! 


II» 


i 


S = kg- lIn W = kg -In (54) 


By employing Stirling's formula, In x! = x In x — x, Eq. (54) is rewritten as Eq. 
(55). 





Na, Na, No , Ne 
S= Noein © | w P s) (55) 


Finally, division by N yields the entropy per lattice point, given by Eq. (56). 
S 
N —kp(xq-In x4 + xg- In xg) = kg) xi In Xi. (56) 


This celebrated entropy formula has been known as the Bragg-Williams approxi- 
mation (hereafter abbreviated to BW) (Bragg and Williams 1934). 

However, we soon realize that Eq. (53) assumes that each atomic species be- 
haves independently like a rigid sphere. In reality, atoms interact with each other, 
and therefore their configurations are correlated with each other. Therefore, we 
focus our attention on the arrangement of atomic pairs instead of individual 
atoms. W in Eq. (52) for this case can be formulated easily as Eq. (57), where N2 
is equivalent to N(2) defined in Eq. (17). 


W pair =m Cras “(N2— Naa) Cras “(No—Naa—Nas) Cupa “(N2—Naa—Nag—NpBa) CNop 


N,! 
Naa! Nag! - Nga! - Nop!’ 





(57) 
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Note that „C,„ represents m!/{n!- (m — n)!}. This equation, however, does not 
guarantee that the total number of each species is maintained as Nj. In other 
words, the geometrical condition introduced in the Section 10.2.2 does not neces- 
sarily hold. When one attempts to derive W by including atomic arrangements in 
a bigger cluster to take the wider correlations into account, the problem becomes 
very complicated. The most efficient and systematic way to circumvent such an 
inconvenience is to employ the cluster variation method (CVM) devised by Kiku- 
chi (1951). In the following, we reproduce the essential points of the CVM (Kiku- 
chi and Mohri 1997; Mohri 2002) by referring to the smallest cluster, i.e., a pair 
cluster. 

We consider an ensemble which consists of the two-atom systems in Fig. 10.4 
(Mohri 2002), and the number of the systems included in the ensemble is speci- 
fied by L. Then, Wpair under a given pair probability yj is given by Eq. (58); 
[];,;(Z- yy)! in this equation appears frequently below, and is abbreviated to 
{Pair},. 


L! o 
IIE. vw! {Pair}, 


ij 





W pair = (58) 


Next, we consider W in the left- and right-hand columns of the ensemble inde- 
pendently. One can readily derive Eqs. (59) and (60), where a similar abbreviated 
notation is defined as {Pt}, (Pt stands for point) for [],(Lx;)!. 


L! 


L! ! 
Wief PET Ieg = {Lefi Pt}, (59) 








L! 


L! ! 
Wright Pt II)! {Right Pt}, (60) 
J 








system-1 


system-2 





Fig. 10.4 Ensemble of two-atom systems (Mohri 2002). 
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Furthermore, in Eqs. (59) and (60), the left- and right-hand columns are distin- 
guished by {Left Pt}, and {Right Pt},, respectively. The product of Wjep p, and 
Wright pı is a special case of Wpair in which a pair of atoms in a system behave 
independently. Hence, it is necessary to introduce a correction factor to obtain a 
general expression, Eq. (61), where Gpair is the correlation correction factor. 


W pair = Wiek Pt’ Wright Pt’ Gpair (61) 
From eqns. (58-61), the correlation correction factor is obtained as in Eq. (62). 


Gpair = apy: (62) 


Note that {Left Pt}, and {Right Pt}, are not distinguishable in the disordered 
phase. 

The discussion so far has been limited to a two-atom system. We will now ex- 
tend it to an N-atom system and consider an ensemble as shown in Fig. 10.5 
(Mohri 2002). When atoms in each system behave independently, W can be given 
as in Eq. (63). 


x funy 
ee P 





Noting Eq. (64) holds for large values of N and L, Eq. (63) is substituted into Eq. 
(52) to obtain Eq. (65). This is the entropy for L systems. 


oy Ni V 
(er) = (m) | or 


N! \ N! 
wel.) = kg- ln Te»! (65) 


i 


By dividing Eq. (65) by L, one can obtain the entropy per system from Eq. (66). 








(Jrunnnnnunnuel JanunfJenuunl) 


Fig. 10.5 Ensemble of N-atom systems (Mohri 
2002). 
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i 
Ss 1 1 N! 
Sw =F= 7 kein Wa fol rae] 


N! N! 





(66) 


This is nothing but an entropy expression given in Eq. (54) which leads to BW 
approximation in Eq. (56). 

Now we extend the above procedures developed for a two-atom system to N- 
atom systems in Fig. 10.5. 

The total number of atomic pairs in the system is given by N - œ, where œ is 
one-half of the coordination number Z. When atoms in each pair are correlated 
and cannot behave independently, one needs to make the correction given in Eq. 
(62) for each pair. W is therefore given by Eq. (67). 


W= (Wor) ie (Gpair) ue (67) 


By substituting Eq. (67) into Eq. (52) via Eqs. (62) and (63), one can obtain Eq. (68). 


api (ORIOL) ne ee) 
ir (ar) fous ee (re) Pe 


By noting that Eq. (64) holds also for { Pair}, the entropy per system is derived as 
in Eq. (69). 








1 
fe 
L B 





Spair = 


day | 
(NI) T. ({Pair}y)” 


(Io =! 
= kg -In } z 
(NY. (ge w 


ij 








(69) 


This is a general expression of the entropy formula within the pair approxima- 
tion. By employing Stirling’s approximation, one can readily show that the en- 
tropy per atom is given by Eq. (70). 


i i 


S= kpg- fov- 1) X (xi In x; — xi) - 0) (yy In yj — yy) + (@— o} 
(70) 
As compared with Eq. (56) of the Bragg-Williams formula, one realizes that the 


pair probability, yj, is incorporated in Eq. (70) in addition to the point probability, 
Xi. 
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The largest cluster contained in the entropy formula is termed the basic cluster, 
which provides a measure of a level of the approximation. Within the BW approx- 
imation, a lattice point is the basic cluster, while for the pair approximation a 
nearest neighbor pair is the basic cluster. Based on the basic cluster, the BW ap- 
proximation has often been called a point approximation. 

The discussions above can be extended to a further larger basic cluster. In fact, 
the most conventional approximation employed in the fcc-based system is the tet- 
rahedron approximation (Kikuchi 1974), in which the nearest-neighbor tetrahe- 
dron cluster is taken as the basic cluster. One can show that the entropy formula 
in this case can be written as Eq. (71). 

6 


II: ww! ] N! 


ij 


5 
(e =)! = II (N- wii)! 


i ijkl 


Stetra = kg -In 





It is proved that a cluster concentration of a subcluster, which is not shared by 
two basic clusters, does not appear in the entropy formula. Indeed, one notices 
that the nearest-neighbor triangle cluster probability zj, is not contained in Eq. 
(71) while point and pair cluster probabilities are both involved in the entropy for- 
mula. This is because the triangle is not shared by two tetrahedron clusters in the 
fcc lattice. 

For an fcc lattice, a further larger basic cluster is often employed. This is the 
cluster which is made up of the combination of a tetrahedron and an octahedron 
cluster as shown in Fig. 10.6 (Mohri 2002). 

The resultant entropy formula is called the tetrahedron-octahedron (TO) ap- 
proximation (Sanchez and de Fontaine 1978; Mohri et al. 1985a) and is given by 
Eq. (72), where vjimn is the cluster probability of the octahedron cluster. 
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II 2!) - (Te) 


i,j,k i 





Sto = kg -lIn 2 6 


I] (Novii)! | | IN wit) | TOV: ww! 


ij,k,l,m,n ijkl i,j 

















Fig. 10.6 Tetrahedron—octahedron cluster (Mohri 2002). 
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10.2.4 
Free Energy 


By combining internal energy and entropy, we derive the free energy expression. 
The internal energy of a system with N atoms is obtained by multiplying Eq. (42) 
by N, giving Eq. (73). 


1 
cn (73) 


The combination with the entropy formula within the tetrahedron approximation 
in Eq. (71) yields the Helmholtz free energy as [Eq. (74)]. 


6 
(ge w N! 


F=E—T-: Sara 


i,j 


5 2 
(Ile) . (x ou) 
i i,j, k,1 


(74) 


1 
ij 
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This is easily extended to an ordered phase; the free energy of the L1ọ ordered 
phase, for instance, is given by Eq. (75) 


FRO-IN- ZU Da y 


yO ij 


{Tose {Tey} {Tose} 
le = id 

2 5/2 5/2 

Town! Town) | II ay 


i i i j,k, 


(75) 





The minimum of Eq. (74) [and Eq. (75)] suggests an equilibrium state. Two 
major minimization procedures are available for given temperature and atomic 
pair interaction energies, €44, Erg and egg. One is the natural iteration method 
(NIM) (Kikuchi 1974), which minimizes F in Eq. (74) [and Eq. (75)] under a con- 
straint originating from a normalization condition which is added through the 
Lagrange multiplier. The other is the Newton—Raphson procedure, for which Eq. 
(74) [and Eq. (75)] is rewritten in terms of independent variables. The substitu- 
tion of Eqs. (16), (22), and (33) into Eq. (74) formally provides the free energy F 
as Eq. (76), and the minimization of F is expressed as Eq. (77). 
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F= F(T, {ey}; &1,62, &, &) = E(fej}; &1,&) -T: S(&1,&2, 63, &) (76) 
OF 


: =0 (77) 

om |T, fey}, beam 

When the second nearest-neighbor pair interaction N is considered in the 

internal energy, E can be given as in Eq. (78), where Zx, a 3 py and č; x are the 

coordination number, atomic pair interaction energy, cluster probability, and pair 
correlation function for a kth nearest-neighbor pair, respectively. 


1 QQ) (2) 2) 
tJ tJ 
= Ef} {eP h 1, én én) (78) 


It is noticed, however, that one cannot add entropy formulae for point [Eq. (56)], 
pair [Eq. (70)], and tetrahedron approximation [Eq. (71)] to Eq. (78) to construct a 
free energy. This is because the atomic correlation generally exceeds the range of 
atomic interactions, and therefore the largest cluster considered in the internal 
energy term should be involved in the basic cluster of the entropy term. Hence, 
for the internal energy term given by Eq. (78), the minimum meaningful entropy 
formula is the TO approximation given by Eq. (72). A combination of Eqs. (72) 
and (78) provides the free energy as Eq. (79). 


1 
F=5-N: ae) _ Tks 
i,j J 
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kim ijkl ij 


(79) 


This equation can be rewritten formally in terms of correlation functions as Eq. 
(80), where the subscript m of the correlation function, čm,s, indicates a pair 
(m = 2), three-body (m = 3), four-body (m = 4), five-body (m = 5) and octahe- 
dron (m = 6) cluster, and the subscript s distinguishes the different types 
(shapes) of the cluster. 


F(T, {eP}, {eP} én énn 62,25 63,1 3,2 ča a2 Ča 35 Č6) 
= Eep}, {eP h &,&,1,&.2) 
= S(č1, 2,1, €2,25 &3, 15 63,2» Č4, 1 Č4, 2 4,3» 65, 66) (80) 


543 


544 


10 Statistical Thermodynamics and Model Calculations 


Table 10.1 Subclusters contained in the tetrahedron—octahedron cluster: 
see Fig. 10.6 (Mohri 2002). 





a 62,1 3,1 4,1 €2,2 63,2 4,2 4,3 5,1 Še 


Lattice points a ab acd abcd ag ace acde cdef acdeg acdefg 





The type of each correlation function appearing in Eq. (80) can be identified in 
Fig. 10.6 and Table 10.1 (Mohri 2002). 

In summarizing the discussions above, it is necessary that the relationship of 
Eq. (81) should be obeyed for the free energy formula given by Eq. (82). 


Emax < Smax (81) 


F= E(E1, 62,1, €2,25 : Cp) SE S(E1, 2,1, €2,25 : Es.) (82) 


Recalling that the second nearest-neighbor pair interaction is necessary to distin- 
guish the stability of various ordered phases as shown in Fig. 10.3, it is realized 
that the TO approximation is necessary for the discussion of the phase equilibria 
involving these phases at a finite temperature. 

The essential task to be achieved by CVM is to determine the exponent of the 
cluster probability such as {Pt}, {Pair},... {subcluster} in the entropy formula. 
We note that there are alternative ways, and Barker’s formula (Barker 1953) has 
been recognized as one of the most powerful formulae with which to determine 
the exponents in a systematic way. 

It is worth pointing out that besides the rigorous CVM-based free energy ex- 
pressions discussed above, a number of free energy models have been proposed. 
Among them, the CALPHAD type of model has been recognized as one of the 
most powerful in dealing with practical multicomponent alloy systems. The es- 
sential points are summarized in the Appendix. 


10.2.5 
Relative Stability and Intrinsic Stability 


In the discussion of the stability of a system, one needs to distinguish the relative 
stability and the intrinsic stability of the system. The relative stability can be mea- 
sured by comparing the magnitude of the energy of a phase of interest against 
those of other competing phases, while the intrinsic stability is defined as the sta- 
bility of a system against fluctuations. We start the discussion with the relative 
stability. 

Figure 10.7 is a schematic diagram of the heats of formation of various ordered 
phases at four stoichiometric compositions, A3B, A,B, AB, and AB3. The energy 
reference state, namely the zero energy level, is the segregation limit, which is 
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Fig. 10.7 Schematic diagram of the heats of 
formation of ordered phases. P; and P3 are the 
table ordered phases. 





z 


Formation energy 











defined as the concentration average of the total energies of pure A and pure B. 
At each stoichiometric composition, heats of formation of several ordered phases 
are plotted and the phase at the lowest energy is the most stable phase at that 
stoichiometry. For instance, P4 is the most stable phase among six competing 
phases at the stoichiometric composition A3B. The stability of other phases is dis- 
cussed based on the hierarchy of energies. For instance, the phase just above Pı 
in the figure is less stable than P; but more stable than all the other phases at the 
same stoichiometry. Likewise, at each stoichiometry, the phase with the lowest 
energy is the most stable phase; it is marked by a solid circle and are denoted as 
P;, where i = 1,2, 3, and 4 correspond to A3B, A,B, AB, and AB; stoichiometries, 
respectively. 

Since the formation energies of these phases, Pı-P4, the lowest at each stoichi- 
ometry, are negative with reference to the segregation limit, these phases are sta- 
ble against the phase separation into A and B. However, the minimum energy 
state is more carefully examined by connecting each point. One realizes that P3 
(P4) is located above the connecting line of Pı (P3) and P; (B), indicating that P2 
(P4) is unstable against the separation into Pı (P3) and P; (B). This way, the en- 
ergy apices being concave downward is the necessary condition of the most stable 
state which is realized in the system. The difference in the formation energies is 
a measure of the relative stability. If this is small, the hierarchy may be reversed 
by some external or internal causes. 

In order to discuss intrinsic stability, we start with a thermodynamic potential 
given as Eq. (83), where {g;} forms an independent set of variables. 


F = fois) (83) 


Among various thermodynamic potentials, we suppose f is an internal energy, 
and entropy S, volume V, and the number of species (equivalently the number 
of lattice points for a pure metal) N are assigned to the first three variables, 9,, 
P, and 3, respectively. Note that these are all extensive variables. Then, the ex- 
plicit form of the first-order derivative provides Eqs. (84-86), where T, p, and u 
are the temperature, pressure, and chemical potential, which are the conjugate 
variables of the entropy, volume, and number of species, respectively. 
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of Ff 

F > 

ap an P” (85) 
F F (86) 
69; ôN 


When the second-order derivative, 67f / ég?, is taken, the result provides a mea- 
sure of the stability of the system. In fact, a system generally undergoes incessant 
fluctuations of the variables {;}, and the positive second-order derivative assures 
the restoring force toward the equilibrium state. It may be easier to grasp the 
physical significance by examining S, V, and N; explicitly. Then, one obtains 
Eqs. (87-89) as the stability criteria. 


OF f eT AT 
op ôs? ôs AS 





>0, (87) 


of of õp Ap 
ap; ôv? oV AV 


af f om (2) = 


ôg? ON? ON; \öc} 
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>0 (88) 





(89) 


Eq. (87) claims that if the heat is absorbed (AS > 0) in the system, the tempera- 
ture should increase (AT > 0); this is the thermal stability criterion. Likewise, if 
the volume of the system is decreased (AV < 0), the pressure should increase 
(Ap > 0); this is the mechanical stability criterion given by Eq. (88). These coin- 
cide with our general intuition. Finally, the third equation (89) indicates the 
chemical stability criterion which, if negative, is the condition of spinodal decom- 
position. 

The arguments can be extended to other variables. As was discussed in the pre- 
vious section, the configurational free energy is a multivalue function of correla- 
tion functions and within the tetrahedron approximation of the CVM, the free en- 
ergy of a disordered phase given by Eq. (74) is rewritten symbolically in Eq. (90) 
by converting atomic interactions and cluster probabilities to effective interactions 
and correlation functions, respectively. 


fF" {EH = f(T, v2; &, 6,6, &) (90) 


Likewise, the free energy of the L1o ordered phase given by Eq. (75) is expressed 
as Eq. (91). 


FENSTER EN) (91) 
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Note that the variables before the semicolon are given a priori, while those after it 
are unknown, i.e., they are the variational parameters. 

When Eqs. (90) and (91) are minimized with respect to each correlation func- 
tion [Eq. (92)], one can determine the phase equilibria by a common tangent con- 
struction. 


ô dis ô L10 
(Z ) =0 and (Er) =0 (92) 
Èi J 16 #6, Èi / Tere 


The Llo-disorder phase diagram calculated in this way is demonstrated in Fig. 
10.8 (Mohri 1992, 1994a) in the vicinity of 50 at.%. The temperature axis is nor- 
malized with respect to the nearest-neighbor effective pair interaction energy vp. 
The transition temperature is determined as shown, and is 1.893. Since the pair 
interaction energy is independent of concentration, the phase boundary is sym- 
metrical around 50 at.%. 

For simplicity, the following discussion is limited to the L1p ordered phase at a 
1:1 stoichiometric composition. Then, due to the symmetry of the L1o ordered 
phase, some of the correlation functions in Eq. (91) are mutually related, as 
¿b = -¢f, pp =i and er P =g ; hence the resultant free energy is rewrit- 
ten as Eq. (93). 








FOREN fee ee ee). (93) 


It should be noted that £7 serves as a long-range order (LRO) parameter while all 
the others are short-range order (SRO) parameters. By following a convention, 


TN, 


Disorder 


* 


Normalized Temperature, T =k, 





“04 0.45 05 0.55 06 


concentration 


Fig. 10.8 Llo phase diagram of L1o-disorder system in the vicinity of 
50 at.% as calculated by tetrahedron approximation of the CVM. The 
temperature axis is normalized by the nearest-neighbor pair interaction 
energy. The broken line indicates the locus of the spinodal ordering 
temperature (Mohri 1992, 1994a). 


547 


548 


10 Statistical Thermodynamics and Model Calculations 


the LRO parameter is expressed by 7 in the following discussion. When the free 
energy is minimized for a given temperature and LRO parameter, 7, with respect 
to the SRO parameters only [Eq. (94)], one can obtain the free energy, f, as a 
function of the LRO parameter, 7, at each temperature. This is constrained mini- 
mization. 


Co on 





In Fig. 10.9 (Mohri 2000a,b) f(7) is shown at six representative temperatures at 
1:1 stoichiometry. A 7 value of unity corresponds to a perfectly ordered phase, 
which is realized only in the ground state, while a null value indicates a com- 
plete random solid solution, and a finite value suggests some degree of order. 
At the highest temperature, T = 2.30 (Fig. 10.9a), which is in the disordered 
phase region (see Fig. 10.8), the lowest free energy is found at 7 = 0, and at the 
lowest temperature, T = 1.60 (see Fig. 10.9f), which is in the L1ọ ordered phase 
region, the minimum in free energy appears very close to unity. The shifting of 
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Fig. 10.9 Free energy versus LRO parameter at six temperatures. The 
number on each graph is the temperature normalized with respect to 
the nearest-neighbor or pair interaction energy (Mohri 2000a,b). 
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energy extrema between these two temperatures is demonstrated well in the 
figures. 

Two characteristic temperatures should be addressed. One is the temperature 
T = 1.893 (Fig. 10.9c), at which two free energy minima can be connected by a 
common tangent, which is the indication of the transition temperature, T;. The 
other is T = 1.633 (Fig. 10.9e), below which the free energy hump between the 
disordered and ordered phases disappears and the free energy curve decreases 
monotonically from the disordered state to the ordered state. This temperature is 
called the spinodal ordering temperature (de Fontaine 1975; Mohri et al. 1985b), 
Ts, of which the trace is indicated by a broken line in Fig. 10.8. A spinodal disor- 
dering temperature, Tsg = 1.94, corresponding to Tso (Mohri 2000a,b) appears in 
the disordered phase region although it is not shown in the phase diagram in Fig. 
10.8. 

Let us suppose that a disordered phase is quenched from T = 2.3 down to 1.80. 
The final equilibrium state is the ordered phase characterized by y close to 0.90 
(see Fig. 10.9d); however, due to the free energy hump, the system remains in a 
metastable state at 7 = 0. In order for the system to transit to the final equilib- 
rium state, fluctuations are necessary to overcome the free energy barrier. This is 
an indication of the first-order transition, while, if the system is quenched down 
to 1.60 (Fig. 10.9f), the system transits to the final ordered phase (y = 0.969) 
spontaneously. 

It should be clearly noted that the order of the transition is determined at T;. If 
T; coincides with Tso, the transition is of the second order. The larger the separa- 
tion between these two temperatures, the more noticeable is the first-order nature 
of the transition. Note that for a second-order transition, not only Tso but also Tsa 
merges into T;. 

In this connection, metastability of a single system is quite clear. The dis- 
ordered state at T = 1.80 and the ordered state at T = 1.94 are metastable states, 
which eventually transit to an ordered state and disordered state, respectively, 
with the aid of configurational fluctuations brought about by thermal energy. In 
fact, fluctuations are detected by diffuse scattering experiments and are predicted 
theoretically by Fourier transformation of the second-order derivative of the free 
energy with respect to the correlation functions (de Fontaine 1975; Mohri et al. 
1985b). By using the Hermitean properties of the second-order derivative matrix 
of the free energy, one can diagonalize the matrix, which provides eigenvalues. 
The temperature at which one of the eigenvalues goes through zero is defined 
as the spinodal ordering temperature. For this, k-space formulation is essential 
and interested readers should consult the original articles (de Fontaine 1975; 
Mohri et al. 1985b). 


10.2.6 
Atomistic Kinetics by the Path Probability Method 


When a system is in a nonequilibrium state, it transits toward an equilibrium 
state. How the system changes its state to the most stable one is a matter of the 
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kinetics. Kikuchi devised the path probability method (PPM) (Kikuchi 1966) as a 
natural extension of the cluster variation method to the time domain, and various 
calculations, including transition kinetics across a phase boundary, relaxation ki- 
netics within an ordered domain, steady-state kinetics under cyclic temperature 
variations, and fluctuation kinetics in the disordered states, were performed by 
the present author (Mohri 1990, 1992, 1993, 1994a,b, 1996, 1997, 1999, 2000a,b; 
Kikuchi et al. 1992; Mohri and Ikegami 1993a,b; Mohri et al. 1995, 1996a,b, 1997; 
Mohri and Miyagishima 1998). 

As has been mentioned, the PPM is an extension of the CVM to the time do- 
main, and there are correspondences between the kinetic variables of the PPM 
and the thermodynamic functions of the CVM. Corresponding to the cluster 
probabilities of the CVM are path variables in the PPM, which correlate the 
cluster probabilities at times t and t+ At. Equations (95-98) are examples of 
the point and pair path variables in a disordered state, where X; j(t, t+ At) 
(Yi,u(t,t-+ At)) are point (pair) path variables that describe the configurational 
transition on a lattice point (nearest-neighbor lattice points) from i (ij) at time t 
to j (kl) at t+ At. 








x(t) = X1,1(t,t+ At) + X, 1(t,t + At) (95) 
xy (t+ At) = X1,1(t,t+ At) + X, (t,t + At) (96) 
yii(t) = Yir,1(t,t+ At) + Ya g(t t+ At) + Yii 11t t+ At) (97) 
y(t + At) = Yu, u(t, t+ At) + Yii u(t t+ Ab) + YG, u t+ At) (98) 


For an ordered phase, the distinction of the sublattice is necessary. 

A counterpart of the grand potential (free energy) of the CVM is the path prob- 
ability function (PPF), P, in the PPM which is given as the product of three 
terms, Pı, P2, and P3. Each term for the disordered phase is given in Eqs. (99- 
101), where N is the number of atoms, 0 is the spin flipping probability per unit 
time, which corresponds to the diffusivity in an alloy system, and AE is the 
change of the internal energy during At, given by Eq. (102). 








Pi = (0- A)" nid. At) rl -9- At) N11 — 9. AB) NI (99) 
AE 
P, = = 100 
nl) 0 
6 
[[@yw!? N 
ij, kl 
P3 = kp ‘ In 5 7 (101) 
II)! E Il (NW, mnop)! 
ij ijkl, mnop 
1 


AE=3 N-Z- X ej {yylt+ At) — yy(t)} (102) 
LJ 
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Pı describes a noncorrelated spin flipping process over the entire lattice, which 
is the Markov process. P, is the conventional thermal activation process and de- 
scribes the probability of gaining thermal assistance from a heat reservoir. Finally, 
P, most characterizes the PPM, and one may see the similarity with the entropy 
formula of the CVM as given in Eqs. (69), (71), and (72). In fact, P, provides the 
freedom of the microscopic path, while the CVM entropy describes the configura- 
tional freedom. Like the CVM, the PPM forms a hierarchic structure and Eqs. 
(99-101) in the tetrahedron approximation may be compared with the entropy 
formula of the CVM given by Eq. (71). It is worth pointing out that the free en- 
ergy and/or its derivative are not explicitly considered in the path probability 
function, in marked contrast with other kinetic theories. 

The PPM is based on the variational principle and the most probable path of 
the time evolution is determined so that the PPF, P, is maximized at each time 
step through Eq. (103), where P is the path probability function given by 
P= P; - P3 - P3, and Eijk... Imn.. denotes a path variable which describes the transi- 
tion from ijk... at time t to Imn... at time t+ At. 


ae | (103) 
OBijk..., Imn... 
Figure 10.10 is a schematic illustration of the physical principles involved in 
the PPM. We suppose there is a transition from some initial state to a final state 
via various intermediate states. The energy of the initial and final states is indi- 
cated as Ej, and Er, respectively, while those in the intermediate states are desig- 
nated E,, E2, and £3. From the initial to final states, there exist various paths 
which are drawn as tubes in the figure. It should be noted that the energy differ- 
ences are the same, no matter which paths the system takes. Hence, the P, term 
does not depend on the path. The P3 term indicates the microscopic freedom of 
the transition, which is implied by the width of the path or tube. It is natural for 
the system to take a series of widest paths if the energy difference is the same. 
The variational principle claimed in the PPM is therefore interpreted as the selec- 
tion of the quickest passage among the transition paths. 
Once the PPF is maximized, the most probable path is determined by optimiz- 
ing the path variable {Z;;... imn...}- With these path variables, {3;;..., Imn... }, the clus- 


E; Fig. 10.10 Schematic illustration of the transition paths from the 
initial state E;n to the final state Ey. 
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ter probability at time t+ At can be uniquely determined. Such a relationship is 
exemplified in the point and pair probabilities in the disordered phase according 
to Eqs. (104) and (105), which are readily obtained from Eqs. (95-98). 





x(t + At) = x1 (t) + X, (t,t + At) — X, i(t, t+ At) (104) 





yii(t + At) = x1(t) + X, (t,t + At) — X, (t,t + At), (105) 


Once the initial equilibrium state is assigned by the CVM, the nonequilibrium 
time evolution processes of cluster probabilities are determined by the PPM. 
Moreover, it has been proved that the final steady state obtained in the PPM in 
the limit of t — oo agrees with the equilibrium state calculated independently by 
the CVM. 

Finally, it should be noted that the formulation of the PPF depends entirely 
on the type of kinetics assumed in the study. The vacancy-mediated or exchange 
kinetics (Kawasaki dynamics; Kawasaki 1966) requires a large number of path 
variables, which makes numerical operations quite intractable. The spin flipping 
kinetics (Glauber dynamics; Glauber 1953), which is the type used in the above 
study, generally does not conserve the species with time; however, conservation 
is assured at 1:1 stoichiometric composition without imposition of any additional 
constraints. In this regard, the spin system approximates to an alloy system at 1:1 
stoichiometry. 

PPM is a powerful tool for studying nonequilibrium transition kinetics. In par- 
ticular, not only near-equilibrium but also far-from-equilibrium transitions can 
be studied, since the PPM does not explicitly deal with the free energy. However, 
the number of variables is quite huge and the applicability of the PPM is mostly 
limited to a homogeneous system. 


10.3 
Statistical Thermodynamics on Continuous Media 


10.3.1 
Ginzburg-Landau Free Energy 


In the previous section, the free energy was described by taking into account the 
discrete nature of the crystal lattice. In this section, we ignore the atomistic pic- 
ture and focus on a continuous medium. Hence, the heterogeneity in the atom- 
istic scale is leveled out, and in this section an inhomogeneity is discussed on a 
larger scale in terms of field variables, {7;}, including concentration and order pa- 
rameters. As was originally derived by Cahn et al. (Cahn and Hilliard 1958; Cahn 
1961), the free energy is assumed to be written as Eq. (106), where we focus only 
on a single field variable, y. 


f = f(n, Vn,V?n,..) (106) 
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Then we attempt a Taylor expansion around the homogeneous system, fo, charac- 
terized by Eq. (107), which yields Eq. (108), in which we ignore the terms higher 
than the second-order derivative and parabolic terms. 


km) = f(n,0,0,0,...) (107) 
f = f(n, Yn, V’n,..) 


~ fo(m) 4 = (2 8 (=) 23 ( > ) i (a) 
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For simplicity, we confine ourselves to an isotropic system, the symmetry require- 
ment of which eliminates the second term and cross-terms contained in the third 
and fourth terms on the right-hand side. Equation (108) can then be simplified to 
Eqs. (109-110), where x; and x2 are given as in Eqs. (111) and (112), respectively. 


f = f(n, Vn,V’n,...) 


Sau Pe (5) ! (my (2) oe 
0 0 
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KL = f (111) 
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a= f (112) 
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Hence, the free energy of the entire system is written as Eq. (113). 
F= | fm Vn,V?n,...)-dV & | { fo(n) + KıV?n + r2(Vn)?*} - dV (113) 
v v 


According to Gauss’s divergence theorem, Eq. (114) holds for a vector quantity 
F, where n is a unit normal vector on a surface element dS. 


| dick -dv=| Fen-ds (114) 
V S 
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By noting that Eq. (115) applies, with the conventional assumption of vanishing 
flux at a surface [Eq. (116)], one realizes that Eq. (115) vanishes, which yields Eq. 





(117). 
dkı 2 2 
div(xıVn):dV = - (Vn) +xı:Vn)-dV=| (x1 :Vn):n:dS 
| E dc ) j (115) 
Vils = 0 (116) 
[ svn) ave -| Bon? dV (117) 


By substituting this into Eq. (113), the final form of the free energy of the entire 
system is derived as Eq. (118), where x, is the gradient energy coefficient defined 


as Eq. (119). 
F= | {fon + x, (v)?} -dV (118) 
dkı 
Ky = u K2 (119) 


For a multivalued free energy function f(m,,...) = f({m;}), one can show 
that the above free energy functional, Eq. (118), is further generalized as Eq. 
(120). 


F= RE SCI m) dv (120) 


This particular expression of the free energy functional is known as the 
Ginzburg-Landau type free energy. 


10.3.2 
Diffusion Equation and Time-Dependent Ginzburg-Landau Equation 


Except for a highly nonequilibrium process for which a free energy itself is not 


suitably defined, a relaxation process can be characterized as the continuous de- 
crease of the free energy with time [Eq. (121)]. 


Z <0 (121) 
By applying Eq. (121) to Eq. (118), one obtains Eq. (122). 


Z| Z.T dvo (122) 
at 


wj OF ôn 
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At this stage, one should distinguish between conservative and nonconservative 
field variables. Concentration, c, and LRO/SRO parameters, respectively, are typi- 
cal examples of each category. In the following discussion, we explicitly employ 
c for the conservative variables, while 7({n;}) is limited to nonconservative vari- 
ables. 

It is soon realized that the necessary condition for Eq. (122) to hold is given in 
Eq. (123), where L is a positive constant. 


0 OF 
a fest (123) 
ot on 

This is generalized to a free energy given as a multivalued function of nonconser- 
vative variables as Eq. (124). 


On; ôF 
a u 124 
at on; (124) 


J 


Corresponding to Eq. (122) when considering the concentration c as a conserva- 
tive variable, the derivative of the free energy is explicitly given by Eq. (125). 


“dv <0 (125) 


| OF ac 
ôt Jyôc at 


Additionally, the conservative quantity should obey the continuity equation (126). 
vy (126) 


By substituting Eq. (126) into Eq. (125), one obtains Eq. (127), for which Gauss’s 
divergence theorem is applied with the boundary condition of the vanishing flux 
at the surface, as we did in Eqs. (114-116). 
OF ôF 
| vav 


=| Spends LOE p-av=| (v) Jav co (127) 


Again, one realizes that the necessary condition for Eq. (127) to hold is that the 
kernel is negative, which claims that Eq. (128) holds, where M is a positive quan- 
tity. 


vc 





J= -MV 5 (128) 
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Substitution of Eq. (128) into Eq. (126) yields Eq. (129). 


oc OF 
qu (mv) (129) 


According to the Euler-Lagrange equation, it is shown that Eq. (130) holds, 
when R is given by Eq. (131). 


a= |(2-2(S)) a (130) 
R= [ie y, y^) - dx (131) 


By comparing Eqs. (118) and (131), one can easily identify the correspondences, 
yc (=n) and y © Vc (= Vu), and derive Eq. (132), where I is given by Eq. 


(133). 
oF = i (= £ G) -ôc - dV = |, (2 = 26.7) -ôc - dV (132) 


I(r,c, Vc) = fo(c) + x.(Ve)? (133) 





By substituting Eq. (132) into Eq. (129), the kinetic equation for composition, 
Eq. (134), is obtained. This is the celebrated Cahn-Hilliard equation (Cahn and 
Hilliard 1958; Cahn 1961). 


0c re 2 (% 2 


By noting that Eqs. (132) and (133) are also held for nonconservative variables 
by exchanging c with 7, substitution of Eq. (132) into Eq. (124) yields Eq. (135), 
which has been known as the time-dependent Ginzburg-Landau (TDGL) equa- 
tion (Fan and Chen 1997) or the Allen—Cahn equation (Allen and Cahn 1979). 


On; = %o 2 
a (E-a Nj (135) 


We note that these two equations, (134) and (135), couple through the free en- 
ergy, fo, of the homogeneous system, and these simultaneous equations have 
been often termed phase field equations to describe versatile temporal micro- 
structural evolution processes. 
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10.3.3 
Width of an Interface 


An inhomogeneous system may be characterized by the existence of an interface; 
an antiphase boundary (APB) is considered in this chapter as an example. It is 
shown below that the width of the APB is derived as one of the static properties 
of an interface. It should be noted that, with a slight modification, the energetics 
involved in the following arguments can be generalized to other kinds of inter- 
face. 

The APB is formed upon collision of two ordered domains (the antiphase do- 
main, abbreviated to APD) with opposite phase of atomic arrangements, charac- 
terized by the opposite sign of the LRO parameters, +y. An APB is regarded as 
the region at which the phase is reversed from +y to —y through y = 0, which 
may be realized at the center of the APB. A typical variation of the LRO parame- 
ter in a certain direction characterized by g is indicated in Fig. 10.11 (Ohno 2004), 
where the center of the APB is located at g = 0. It should be noted that with the 
variation of the LRO parameter, the free energy f changes, and a minimum value 
+n, is attained at equilibrium LRO, while the maximum value is reached at 7 = 0, 
which may correspond to a disordered state. 

We consider a straight APB normal to the direction g which extends in the x 
direction. Let Afo(7) be the free energy with reference to the minimum free en- 
ergy attained at 7 = +n,. Then, the excess free energy, oapg, due to the existence 
of the APB, which separates two ordered domains, is written as Eq. (136). 


amf (so + (2) -dg (136) 


The equilibrium shape of the boundary is determined so that aypz is mini- 


mized and the Euler-Lagrange equation applied to Eq. (136) yields Eq. (137), 
which leads to Eq. (138). 








a 2 
Sr PAR) 2,0 1=0 (137) 
on on dg 
Atay ie 2 (138) 
04) = Kk dg š 
05 \ 
> | 
So 0.0 | 
Rn | 
4 | 
—0.5 \ 
1.0 








-30 -20 -10 0 10 20 30 Fig. 10.11 Profile of the LPO parameter normal to 
8 the antiphase boundary (Ohno 2004). 
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By substituting Eq. (138) into Eq. (136), one obtains Eq. (139). 


+00 2 
CAPB -Í 2( 32) - dg (139) 


Since the equilibrium values of the LRO parameter, +7, and —n,, are attained at 
infinite distance (+00) from the interface, the integrand and the range of integra- 
tion can be converted as in Eq. (140). 


+00 dn 2 +e dn 
CAPR = [, 2k (3) -dg =2 IE (3) -dy (140) 
The substitution of Eq. (138) yields Eq. (141). 


A) i- Jc Mon) -dn (141) 


To determine the equililbrium width of the APB, one needs a detailed expres- 
sion of Afp (y). As an example (Ohno 2004), we consider a double well potential 
given by Eq. (142). 


Afo(n) = Afimax (1 2 a (1 + ny (142) 


e Ne 
This potential is symmetric around 7 = 0; it takes a minimum, 0, at +7, and —7,, 
and a maximum value, Afmax, is attained at 7 = 0. By substituting this into Eq. 


(138), one can readily obtain a profile of the LRO parameter as a function of g in 
the form of Eq. (143). 


ng) = ne tanh (3 u. e) (143) 


This is linearized in the vicinity of g = 0 as Eq. (144). 


ne) = eg (144) 


Hence, the width of the interface is obtained as Eq. (145). 


| K 
Lapp = 2m, x Afnax (145) 


It is understood that the width of the APB is determined by the gradient energy 
coefficient x and the free energy difference Afmax between disordered and ordered 
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phases. The system tends to confine the APB in a smaller region for a larger 
Afmax, While the bigger the « is, the more widely is the APB region extended, so 
that the gradient energy per unit length becomes smaller. 


10.3.4 
Interface Velocity 


In Section 10.3.3, we studied the width of an interface as one of the static proper- 
ties. In this section, the interface motion driven by a curvature is investigated 
(Ohno 2004) as a dynamic property. First, we consider a noncurved APB at the 
steady state at which TDGL equation is given by Eq. (146). 


a ð 62 
= (2 2x z) =0 (146) 
ot on og 





Next, we consider a curved APB shown in Fig. 10.12. With the assumption that 
the LRO parameter y changes only in the direction of g and not in the tangential 
direction t, one obtains Eq. (147), where n is a unit normal vector to the APB. 


on 
Vy = — 14 
N (147) 


Then, the Laplacian of 7, V?n, is transformed as in Eq. (148), and the TDGL equa- 
tion is rewritten as Eq. (149), where H is the mean curvature, which is equivalent 
to V- n in the —g direction. 











an ô 
V= VV = Syn oe) 
ag’ ôg 
a ð 0 82 0 
Mn. HR Rt nt (149) 
at an én ög? ôg 


By noting Eq. (146), Eq. (149) is simplified to Eq. (150). 


ôn ôn 
— = 2kLH — 150 
at ag 0) 


APB Fig. 10.12 A curved antiphase boundary. 
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C) O 


Fig. 10.13 Time evolution of a circular antiphase boundary (Ohno 2004). 























For a contour curve of the LRO parameter y, one obtains Eq. (151), and therefore 
the velocity, vapg, of the APB is given by Eq. (152) 


on ‚(a 
dy = 5) dg 4 er dt=0 (151) 


ERO) asa 


Taking Eq. (152) together with Eq. (150), the final expression of the velocity of 
APB is obtained as Eq. (153). 








VAPB = —2xL-H (153) 


This is the typical case of an interface velocity which is driven by a curvature. 

As a simple example, one can consider a circular interface with the radius R. 
Since the mean curvature of a circle is 2/R, the velocity is given by Eq. (154), in- 
tegration of which yields Eq. (155), where Ro is the initial radius of the circle. 


dR 1 
= = —4xL— 154 
"APB aE ER Sn 
R(t)? — R? = —8xL-t (155) 


It is indicated that the circular interface shrinks with time as shown in Fig. 10.13 
(Ohno and Mohri 2002; Ohno 2004). 


10.4 
Model Calculations 


One of the major trends in computational materials science is toward performing 
multiscale calculations from atomistic to macroscopic scales in a consistent man- 
ner. In Section 10.2, atomistic approaches to configurational thermodynamics 
and kinetics were demonstrated, while in Section 10.3 basic formulae of micro- 
structural statics and dynamics were derived. It is a natural development to 
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combine these two studies to attempt multiscale calculations from atomistic 
to microstructural scales. In this section, as a summary of the present chapter, 
we introduce the author’s recent work (Ohno and Mohri 2001a,b, 2002, 2003, 
2007; Chen et al. 2002a,b; Mohri and Chen 2002, 2004a,b,c; Mohri et al. 
2005a,b, 2006; Mohri 2005) on multiscale calculation covering phase stability, 
phase equilibria, and phase transition by exemplifying three kinds of Fe-based 
binary alloy systems. Numerous contributions to the subjects dealt with in Sec- 
tions 10.4.1 and 10.4.2 have been achieved by Zunger and his collaborators 
(Mbaye et al. 1987; Wei et al. 1987; Zunger et al. 1988; Ferreira et al. 1989, 1999; 
Bernard and Zunger 1990; Wei et al. 1990, 1992, 1993; Lu et al. 1991, 1995; Laks 
et al. 1992; Wolverton et al. 1994; Wolverton and Zunger 1997; Ozolins et al. 
1998) and Vaks and his collaborators (Vaks and Samolyuk 1999; Bellashchenko 
et al. 1999a,b; Pankratov and Vaks 2001; Dobretsov et al. 2004, 2005; Khromov et 
al. 2005), respectively. 


10.4.1 
Calculation of a Phase Diagram 


10.4.1.1 Ground-State Analysis 

An example of the ground-state analysis within the first and second nearest- 
neighbor pair interaction energies was introduced in Section 10.2.2. It was men- 
tioned that the most stable ordered phases at 1:1 stoichiometry were determined 
in the region specified by the magnitude and sign of the pair interaction energies. 
Such a ground-state analysis provides sufficient conditions for an ordered phase 
to be stabilized with specified pair interaction energies. However, for an actual 
system, there is no guarantee that multibody interaction energies can be ne- 
glected or distant pair interaction energies fade away. Moreover, effective interac- 
tion energies are not constant but depend on concentration, atomic distance, and 
atomic configurations. Therefore, examination of the ground-state total energies 
of possible ordered phases is indispensable. Examples are demonstrated below 
for three kinds of Fe-based alloy systems: Fe-Ni, Fe-Pd, and Fe-Pt. 

Ni, Pd, and Pt belong to the column VIII transition elements in the periodic 
table. When each element is alloyed with Fe, a quite systematic behavior of phase 
equilibria is observed (Massalski 1986). For the Fe-Ni system, only one ordered 
phase appears around 1:3 stoichiometry (FeNi;; L12), while for the Fe-Pd system 
an additional ordered phase is found in the vicinity of 1:1 stoichiometric compo- 
sition (FePd; L1,). Finally, for the Fe-Pt system, a third ordered phase appears 
near 3:1 stoichiometry (Fe3Pt; L12). 

Based on the FLAPW (full-potential linear augmented plane wave) method 
(Jansen and Freeman 1984) within the generalized gradient approximation 
(GGA; Perdew et al. 1992), first-principles total electronic structure calculations 
were performed on a series of ordered compounds, Fey_,X, at 1:3 (n = 3), 1:1 
(n = 2), and 3:1 (n = 1) for each Fe-X system (X = Ni, Pd, and Pt) as a function 
of atomic distance r (Chen et al. 2002a,b; Mohri and Chen 2002, 2004a,b,c). In 
view of the fact that the most stable ground state of Fe is not fcc but is bcc, the 
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Fig. 10.14 Atomic arrangement of an L1o ordered phase, 
/ / | The c/a ratio is generally not unity, due to tetragonal 
© distortion. 
r 4 
| a 


total energy of bcc Fe has also been calculated. Then, the heat of formation for 
each compound was obtained according to Eq. (156), where r,,, and rx are the 
equilibrium atomic distances of ferromagnetic bcc-based Fe and fcc X, which are 
the ground states of each species. 


4—n 


AEO (r) = Eral) =, 





n 
ERE (Fec) a Ex, (rx) (156) 


It is understood that the energy reference state assumed in Eq. (156) is the segre- 
gation limit. In view of the experimental results in the phase diagram described 
above, it is natural to adopt the L13 structure for 1:3 and 3:1 compounds and the 
L1o structure for 1:1 compound for the total energy calculations. Also, in order to 
examine the effects of magnetism, both spin-polarized and nonpolarized calcula- 
tions were performed. 

The L1, ordered phase is generally characterized by a layered structure in one 
of the [100] directions, which induces a tetragonal distortion as shown in Fig. 
10.14. Shown in Fig. 10.15 are the heats of formation of L1ọ ordered compounds 
for Fe-Pd and Fe-Pt, respectively, as a function of the c/a ratio. These are ob- 
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Fig. 10.15 Total energy of the Llo ordered phase of (a) Fe-Pd (Chen 


et al. 2002a; Mohri and Chen 2004b) and (b) Fe-Pt (Mohri and Chen 
2004b) as a function of c/a. 
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tained by the spin-polarized calculation. In both systems the energy minimum 
appears not at c/a = 1, but at 0.99 for Fe-Pd and 0.97 for Fe-Pt, indicating that 
the most stable state of L1ọ ordered phases is not a cubic but a tetragonal struc- 
ture. In fact, the experimental observation suggests that the equilibrium values 
of c/a are 0.97 and 0.96, respectively, for Fe-Pd and Fe-Pt, with which these 
ground-state energy calculations agree quite well. However, in view of the fact 
that the c/a ratios are quite close to unity for both systems, we ignore the tetra- 
gonality in the following calculations and assume a fully cubic structure. 

Figure 10.16 shows the heats of formation AE) of the aforementioned five 
phases (Mohri 2005): Fe with fcc structure (n = 0), Fe;X with L1, (n = 1), FeX 


AE(Ry/atom) 
AE(Ry/atom) 
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Fig. 10.16 Heats of formation of fcc Fe, bec two results are obtained. One accounts for 
Fe (ferro), Fe3X L12, FeX Llo, FeX L11, FeX; the ferromagnetic state and the other for the 
L12, and fcc X, where X is Ni, Pd, or Pt. The antiferromagnetic state. (a) Fe-Ni (Mohri 
energy reference state is the segregation and Chen 2004b); (b) Fe-Pd (Chen et al. 
limit, defined as the concentration average of | 2002a; Mohri and Chen 2004b); (c) Fe-Pt 
total energy of bcc Fe and fcc X. For FePts, (Mohri and Chen 2002, 2004b). 
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with Llp (n = 2), FeX3 with L1, (n = 3), and X with fcc structure (n = 4) ob- 
tained from spin-polarized calculations for the Fe-Ni, -Pd, and -Pt systems, re- 
spectively. Since the ground state of Fe is bcc with a ferromagnetic state, the heat 
of formation of bcc Fe is calculated and drawn with a broken line. The lattice con- 
stant of bcc Fe (5.36 a.u.) is quite small as compared with other fcc-based phases 
of interest. In order to facilitate comparison, we converted it to an fcc-equivalent 
lattice constant, keeping the atomic volume. 

For the Fe-Pd system, the heats of formation for FePd (L1o) and FePd; (L12) 
are negative while that of Fe;Pd(L12) is positive, which implies the possible ap- 
pearance of the former two phases in a phase diagram as a stable phase and miss- 
ing the latter phase. This is realized exactly in the phase diagram. For the Fe—Pt 
system, the heats of formation of all three ordered phases are negative, which is 
again in good agreement with the phase diagram. Finally for the Fe—Ni system, 
both Fe3Ni (L12) and FeNi (L19) have negative values. In the phase diagram, how- 
ever, the latter phase is missing. This contradiction is discussed later. 

Two additional features should be noted. The first is that a dot—broken line in 
each figure indicates the heats of formation of the L1, ordered phase at a 1:1 stoi- 
chiometric composition. As compared with the L1p ordered phase, one can con- 
firm that the heats of formation are larger and the L1o ordered phase is more sta- 
ble. This should be emphasized particularly for the Fe-Pd and Fe-Pt systems. 
These additional calculations performed for the L1, ordered phase are utilized in 
order to extract the second nearest-neighbor effective pair interaction energies 
through the cluster expansion method (Connolly and Williams 1983) as will be 
shown soon below. 

The second feature is that for FePt; (L12) two kinds of magnetic states are con- 
sidered, ferro- and antiferromagnetic states, which are designated by fm and af in 
the figure, respectively. One can see that the magnetic ground state is not ferro- 
but antiferromagnetic. This is in agreement with the experimental result (Bacon 
and Crangle 1963), and one will see that this tiny difference in magnetic energy, 
about 3 mRy atom“, affects the resultant transition temperature of the Llo or- 
dered phase significantly. 


10.4.1.2 Effective Cluster Interaction Energy 

The total energy provides a measure of the relative stability of a phase of interest, 
while for the calculation of phase equilibria atomic interaction energies are re- 
quired. According to the cluster expansion method (Connolly and Williams 
1983) introduced in Eq. (43), a set of effective cluster interaction energies can be 
obtained from total energies of ordered phases by matrix inversion. In this sec- 
tion we demonstrate how the effective cluster interaction energies for the Fe- 
based ordered phases can be extracted from the total energies obtained in Section 
10.4.1.1. It should be noted that two terms, total energy and heat of formation, 
are generally employed interchangeably. But for the application of the CEM in 
Eq. (43) to obtain effective cluster interaction energies, one needs to make a clear 
distinction, and the total energy E should be replaced by the heat of formation, 
AE, in Eq. (156). Furthermore, since heats of formation, AE, in Section 10.4.1.1 
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are obtained as a function of atomic distance r, the present CEM is rewritten as 
Eq. (157). 


AE(r) = `’ Vm(r) Gm (157) 


m=0 


It should be noted that the correlation function depends on both the type of clus- 
ter, m, and the phase, n, while the effective cluster interaction energy is deter- 
mined once the type of cluster is specified. Hence, Eq. (157) is modified to Eq. 
(158). 


AE (r) = X> var) “Ee (158) 


m=0 


For each Fe—X system, the number of fcc-based phases for which the total ener- 
gies are calculated in the present study is six, including both L1ọ and L1, ordered 
phases at 1:1 stoichiometry. However, we first include only the L1p ordered phase 
at 1:1 and L1, ordered phases at 1:3 and 3:1 stoichiometries in the CEM. Hence, 
both {AE™} and {vm} constitute a 5 x 1 vector and {¢} } represents a 5 x 5 ma- 
trix (M = 4). The explicit form of Eq. (158) is Eq. (159), and the effective cluster 
interaction energies are obtained as Eq. (160), or (161), or (162-163). 


AEFe 1 1 1 1 1 vo 
AE FesX t 1/2 0 --1/2 -1 v1 
AEFX 1-11 0 -13 0 1 | +] va (159) 
AEF% 1-12 0 1/2 -1 v3 
AEX 1 =i 1 if, 4 v4 
um(r) = {ER} AE™ (r) (160) 
7 
vo 1 1 1 1 1\7! ABFe 
v 1 1/2 0 -1/2 -1 AEF®X 
»nıl=|1 0 -13 0 1 AEF&X (161) 
v3 1-12 0 1/2 -1 AEF 
v4 1 -1 1 -1 1 AEX 
0.063 0.25 0375 0.25 0.063 AEF? 
0.25 0.50 00 -05 —0.25 AEX 
=| 0.375 0.0 -075 00 0.375 |-| AEFX |. (162) 
0.25 —0.5 0.0 0.5 —0.25 AEF 


0.063 —0.25 0.375 —0.25 0.063 AE* 
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Fig. 10.17 Effective cluster interaction energies obtained for (a) Fe-Ni 
(Mohri and Chen 2004b); (b) Fe-Pd (Mohri and Chen 2004b); (c) Fe-Pt 
(Chen, Iwata and Mohri 2002; Mohri and Chen 2004b), by the cluster 
expansion method. Note that by including the heats of formation of the 
L1, ordered phase in Eq. (164), vo is altered while all the other 
interaction energies remain unchanged. 


Recall that the subscript of v2 ı emphasizes the nearest-neighbor pair interaction 
energy in order to distinguish it from the second nearest-neighbor pair interac- 
tion energy v2,2, which will be discussed later. 

The extracted effective interaction energies are shown in Fig. 10.17 for Fe—Ni, 
Fe-Pd, and Fe-Pt systems, respectively (Mohri 2005). The pair interaction energy 
is dominant for an entire range of lattice constants which virtually covers the con- 
centration range from 0 to 100%. This is a characteristic feature of a metallic alloy 
system, for which cohesive properties are controlled mostly by central atomic 
forces. 

In order to obtain further interaction energies, more ordered phases are added 
in vector {AE} and, accordingly, the correlation matrix {£"} is expanded. How- 
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ever, one realizes that when another row and column are added to the n x n ma- 
trix, ®, to create a (n+ 1) x (n+ 1) matrix, ®’, the element 9’, of the inverse 
matrix ©’! is generally different from the element gy of the inverse matrix of 
the original ® (1 < k,l < n). Hence, a simple extension of the energy vector and 
correlation matrix may alter the resultant elements of effective cluster interaction 
energies {vm}, and one cannot extract a net contribution. As can be seen below, 
such a mathematical difficulty is circumvented tentatively for the 6 x 6 case by 
choosing the L1, ordered phase as a sixth phase. 

By adding the total energy of FeX with the L1, structure (n = 6), Eqs. (159) and 
(161) are modified to Eqs. (163) and (164-165), respectively. 


AEFe 1 1 1 1 1 1 vo 
AEF®X 1 1/2 0 -12 -1 1 vı 
AEX | |1 0 -1⁄3 0 1 1 21 (163) 
AE FeXs 1-12 0 1/2 -1 1 v3 
AEX i- =i 1 -1 1 1 v4 
AE FeX;L11 1 0 0 0 -1 -1 v2,2 
vo 1 1 1 1 1 DNS AEF 
vı 1 1/2 0 -12 -1 1 AE FX 
v1} _|1 0 -13 0 1 1 | Ae (164) 
03 1-12 0 1/2 -1 1 AEFeXs 
v4 i =i 1 -1 1 1 AEX 
v2,2 1 0 0 0 -1 -1 AE FeX; 111 
0.063 —2.75 0.375 0.25 0.063 3.0 AEF? 
0.25 0.5 0.0 -05 -025 0.0 AE FesX 
0.375 0.0 -075 0.0 0.375 0.0 AEFeX 
T| 025 -05 00 05 -035 00 | | Ar“ 
0.063 -0.25 0.375 -0.25 0.063 0.0 AEX 
0.0 3.0 0.0 0.0 0.0 —3.0 AEF 
(165) 


By comparing the elements of the inverse matrices of Eqs. (162) and (165), one 
recognizes that the second nearest-neighbor effective pair interaction energy v2,2 
is extracted without altering other interaction energies except vp, which can be 
confirmed in Fig. 10.17. Hence, we can examine the stability of the L1ọ ordered 
phase in terms of the sole effects of vo and v2,2. It should be noted that in the 
present sign convention, a positive (negative) effective pair interaction enhances 
the stability of unlike (like) pairs. The negative v2, , values shown in Fig. 10.17 in- 
dicate clearly the stabilization of the L1o ordered phase, since the second nearest- 
neighbor pairs of the Llp ordered phase are all like pairs. Therefore, one can 
expect the transition temperature to be raised by including v2,2, which will be dis- 
cussed in Section 10.4.1.3. 
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When the elements of the 5 x 5 and the 6 x 6 matrices given in Eqs. (159) and 
(163), respectively, are compared, one realizes that first and sixth columns have 
the same elements down to the fifth component. This is a key ingredient to ex- 
traction of a new cluster interaction energy by minimizing the effects on predeter- 
mined energies. The mathematical foundation and more general conditions are 
found elsewhere (Mohri and Chen 2004c). Without a clear convergence criterion 
for the CEM, however, the best strategy is to include a large set of ordered phases. 
In fact, it is a recent trend that the CEM is operated on more than 100 phases 
(Sluiter et al. 1995; Santai et al. 2003; Zarkevich and Johnson 2004). 


10.4.1.3 Phase Diagram 

Within the nearest-neighbor pair interaction model combined with the tetrahe- 
dron approximation of the CVM, Helmholtz free energies for disordered and L1 
ordered phases were given in Eqs. (74) and (75), respectively. Since the effective 
interaction energies were obtained up to the tetrahedron cluster in the Section 
10.4.1.2, Helmholtz free energies of formation of the disordered and L1o ordered 
phases within the tetrahedron approximation are modified to Eqs. (166) and 
(167), respectively. 
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: [ova - (N!) 
AF# = X> om(r) ém- T- ke In = ; (166) 
m=0 
{Teh 4 [[ ww)! 
i i,j,k, 
= (210) 
AF? = Um(T) - ary 
{To wi: 4 Tey! N! 
uJ LJ tJ 
—T-kg-ln Er Ar 
Has) {Trew Teja! 
i i ij k,l 
(167) 


We note that the Emax and Smax both refer to the tetrahedron cluster and the con- 
dition given by Eq. (81) is satisfied. In view of the fact that the cluster probabil- 
ities are transformed to a set of correlation functions, these free energy formulas 
can be converted to the one written in terms of correlation functions only as in 
Eqs. (168) and (169). 


AF” = FET {vilr}; éi é, 63,4) (168) 
AFI = Puce Eh, ee EZ ERP Ene gP gabb (169) 
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These are simple extensions of Eqs. (90) and (91), respectively, but it is empha- 
sized that the effective interaction energies are no longer constant, but depend on 
the atomic distance r. Hence, the minimization conditions given by Eq. (92) are 
also modified to Eqs. (170) and (171), where Eq. (170) indicates the minimization 
with respect to the configurational variables, while Eq. (171) provides the opti- 
mum volume under an ambient pressure. 














OAF 4s u 8AF! = (170) 

Lem} ksaht {EN} Tight 

OAF4s u OAFL10 Er (171) 
Cee Or Nee 








It should be noted that Eq. (171) only optimizes the volume by keeping the 
global lattice symmetry; hence the system is still in an excited state without allow- 
ing local lattice distortions. Therefore, in a more rigorous calculation, local atomic 
displacement should be incorporated into the free energy. This is the main task of 
the continuous displacement cluster variation method (CDCVM) which has been 
developed recently (Kikuchi and Beldjenna 1992; Kikuchi 1998; Kikuchi and 
Masuda-Jindo 1999; Uzawa and Mohri 2001a,b; Uzawa and Mohri 2002). Un- 
fortunately, however, application of the CDCVM is still limited to a primitive 
case and it is far from being applicable to the present Fe-based alloy systems. 

This inconvenience is partly relieved by incorporating lattice vibration effects. 
According to the Debye-Gruneisen model, which is the quasiharmonic approxi- 
mation, the vibrational free energy can be given as Eq. (172), where the right- 
hand side is the sum of the electronic total energy EC (nr), the vibrational internal 


energy EO (r, T), and vibrational entropy sw) (r, T). 


F® (r, T) = EP (r) + E (r, T) — T- S(r, T) (172) 


Among these three terms, E? (r) is equivalent to AE™ (r) in Eq. (156), and the 
vibrational internal energy and entropy, respectively, are written as Eqs. (173) 
and (174), where on is the Debye temperature and D(x) is the Debye function, 
and the first term of the vibrational energy represents the zero-point energy. 


(n) 
EOG T= aks .OM 43kg- T- 02) (173) 


2 nf on Fy} (174) 


The essential ingredients proposed by Morruzi et al. (1988) is that thermal 
properties such as the bulk modulus, the Gruneisen constant, and the Debye 
temperature are obtained by fitting to a four-parameter Morse potential. They 
reproduced the coefficient of thermal expansion of various metals. Then, Becker 


Sr, T) = 3kg 
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et al. (1990) extended the procedure to ordered compounds. The present author 
also employed the method extensively to phase equilibria calculations (Mohri et 
al. 1991, 1993; Nakamura and Mohri 1993, 1994c; Chen et al. 2005). The reader 
interested in the procedure is referred to the original articles. 

For the calculations of phase diagrams, the CEM operates with AF™ (r, T) 
which is derived through Eq. (175). 


AF” (r, T) = Frey Xn (r, T) a Ao” FRE (re T) =. T Fx, (rx, T) (175) 


It should be noted that by the introduction of lattice vibrational effects, the energy 
reference state is no longer independent of temperature as assumed in Eq. (156), 
which accounts for the temperature dependences of the free energy of formation 
in Eq. (175). Then the resultant effective cluster interaction energies are obtained 
as a function not only of lattice parameter but also of temperature [Eq. (176)]. 


Om(t,T) = X {én} AF” (r, T) (176) 


The calculated L1ọ-disorder phase boundaries with and without lattice vibration 
effects are displayed in Fig. 10.18 for Fe-Ni, Fe-Pd, and Fe-Pt systems, respec- 
tively (Mohri 2005). In these calculations, the second nearest-neighbor pair inter- 
action energy is not included. One can see that the incorporation of the lattice 
vibration effects reduces the transition temperature considerably. This is inter- 
preted as relative stabilization of the disordered phase. As mentioned above, the 
local distortion effects are not included in the model. Under such a condition, the 
extra energy is higher for a disordered phase since atoms of different sizes are 
more often encountered in the disordered phase than in the ordered phase. The 
lattice softening introduced by lattice vibration effects accommodates atoms of 
different sizes, and, therefore, the extra energy is more effectively released in the 
disordered phase. 

For the Fe—Pd system (Fig. 10.18b), the transition temperature is reduced to 
1030 K, which is in close agreement with the experimental value of 1023 K. For 
the Fe—Pt system (Fig. 10.18c), two calculations are performed without vibration 
effects. In one, an antiferromagnetic state is assumed for AE‘? in Eqs. (158- 
162); the other is calculated with AE**?5 in the ferromagnetic state. By introduc- 
ing the correct antiferromagnetic state, the transition temperature is decreased 
significantly. The lattice vibration effects further reduce the transition tempera- 
ture to 1610 K, which is in excellent agreement with the experimental value of 
1600 K. Both lattice vibrations and magnetic structure influence the resultant 
transition temperature quite markedly. 

For Fe-Ni, the existence of an L1, ordered phase has not yet been reported in a 
conventional phase diagram. The calculation shown in Fig. 10.18(a), however, in- 
dicates that this ordered phase can be stabilized below approximately 500 K. In 
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Fig. 10.18 L1o-disorder phase boundaries for the (a) Fe-Ni (Mohri and 
Chen 2004b); (b) Fe-Pd (Mohri and Chen 2004a,b); (c) Fe-Pt systems 
(Mohri and Chen 2002, 2004b). For the Fe-Pd and Fe-Pt systems, 
comparisons are made among phase boundaries with and without 
lattice vibration effects. 


fact, the electronic structure ground-state energy calculations in Fig. 10.16(a) 
clearly indicated the stability of this phase. Experimental examination of phase 
equilibria at low temperatures is not an easy task, but the recent experimental 
confirmation (Ohnuma and Ishida 2004) of the existence of an Fe-Ni L19 phase 
is quite encouraging. 

Finally, two points should be addressed. One is that the congruent composition 
of the Llo ordered phase in Fe-Pd is not at 50 at.% and a considerable deviation 
is reported. This is not reproduced by the above calculations. This inconvenience 
may be attributed to the fact that tetragonal distortion of the L1ọ ordered phase 
and the magnetic entropy at finite temperatures were neglected. This should be 
clarified in the future study. 
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Secondly, it is interesting to calculate the transition temperature, T;, by incorpo- 
rating the second nearest-neighbor effective pair interaction energy, which is ex- 
tracted by extending the CEM as was discussed in Section 10.4.1.2. By noting the 
criterion suggested in Eq. (81), the tetrahedron approximation of the CVM is not 
enough for the inclusion of the second nearest-neighbor pair interaction energy. 
It is indispensable to introduce the tetrahedron—octahedron approximation de- 
scribed in Eq. (72) for the entropy term. Hence, the free energy of a disordered 
phase is written as Eq. (177). 


5 
AF4s = 5 Um(r) Em T- kg 
=0 8 


II: 2x)! ] - (Te) 


i,j,k i 





2 6 


II (N - Vijkimn)! | > lI (N wia)! | - II: yy)! 


ijk imn ijkl ij 
(177) 


In a similar manner, the free energy of the L1, ordered phase can be derived, al- 
though this is not demonstrated explicitly here. 

The transition temperatures calculated earlier are significantly higher for both 
Fe-Pd (Chen et al. 2002b) and Fe-Pt systems (Mohri and Chen 2004a,b,c). The 
increase is attributed to the negative v2,2 which stabilizes the structure of the L1o 
ordered phase; however, the effect of vp should not be overlooked. As was sug- 
gested by Eq. (50), vo is equivalent to the heat of formation of a completely 
random solid solution at an equiatomic composition. Then, the increase in vo 
by incorporation of the L1, ordered phase as shown in Fig. 10.17 results in the 
destabilization of the disordered phase. It is therefore more reasonable to attri- 
bute the increase in the transition temperature to the combined effects of v2 2, 
which stabilizes the Llo ordered structure, and vo, which destabilizes the dis- 
ordered phase. 


10.4.2 
Microstructural Evolution Calculated by the Phase Field Method 


10.4.2.1 Hybrid Model 

Recently, the phase field method (PFM) (Chen 2002) started to attract broad atten- 
tion. In the materials science community, PFM is interpreted as the combination 
of the Cahn-Hilliard diffusion equation and the time-dependent Ginzburg- 
Landau (TDGL) equation which were introduced in Section 10.3.2. The simulta- 
neous solution of both equations is supposed to provide microstructural evolution 
process in terms of both composition and configuration characterized by order 
parameters. The key to the PFM is the interfacial energy, which is included 
explicitly in the Ginzburg-Landau type total free energy in Eq. (120), which is 
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subject to a minimization with respect to field variables. Hence, unlike the situa- 
tion in most of the previous work using a sharp interface model, an interface is 
not a special entity within the PFM to be described separately, but merely a spa- 
tial localization of field variables. 

The local free energy density fọ in Eq. (120) determines the final equilibrium 
state as well as the path toward it; therefore the description of fo plays a deter- 
mining role in the PFM. In the conventional PFM, the local free energy density 
is given by a phenomenological Landau-type series expansion by which the first- 
or second-order character of the transformation is quite easily described. Further- 
more, one may investigate multimode phase transitions which involve more than 
a single phase transition, such as phase separation with ordering, by an efficient 
parameterization. 

All of these fruitful features of the Landau-type free energy are connected with 
the phenomenological nature of this approach. Also, it is indispensable to select 
various parameters, including materials constants, effectively in order to repro- 
duce the transition behavior of interest. These selected parameters, however, leave 
some ambiguities in the calculated results, as will be pointed out later. Recently, 
hybridization of the CVM free energy within the PFM was attempted (Ohno and 
Mohri 2001a,b, 2002, 2003; Ohno 2004). The advantages of the hybrid model are 
addressed by two points. One is that the atomistic nature of the CVM free energy 
provides an opportunity to determine all the necessary quantities appearing in 
the PFM without ambiguities. This is the clue to the first-principles phase field 
method which will be discussed in Section 10.4.2.2. The other is there is the pos- 
sibility of performing multiscale calculations from atomistic to microstructural 
scale. Below, description of the time evolution of the antiphase boundary associ- 
ated with Llo ordering by the hybrid model is exemplified. For simplicity, the 
composition is fixed at 1:1 stoichiometry, and our main focus is placed on config- 
urational kinetics described merely by TDGL equations. 

As stated in Section 10.2.5, the CVM free energy for the L1p ordered phase is 
given symbolically as Eq. (91), and the number of independent variables can be 
reduced at a fixed composition of 50 at.% due to the symmetry of the Llo ordered 
phase and the free energy was written as Eq. (93). 


FIKEN = fase oe ee); (91) 


Free er) (93) 


It should be noted that temperature Tand effective interaction energies {v;} are 
omitted from the original free energy formula. The calculated L1o-disorder phase 
diagram was demonstrated in Fig. 10.8. The transition temperature is 1.893 and 
the spinodal ordering temperature is 1.633 on the normalized temperature scale 
with respect to the nearest-neighbor effective pair interaction energy v2. 

As was demonstrated in Fig. 10.14, the L1ọ ordered phase is characterized by 
the layered stacking of alternative atomic planes of A and B atoms along one of 
<100) directions. For the phase diagram calculations, the distinction between the 
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Fig. 10.19 The Llo ordered phase splits into four 
interpenetrating sublattices. 





three layered directions need not be considered. However, for the microstructure 
calculation, it is essential to distinguish the difference in stacking directions, 
which results in the variants. The free energy given by Eq. (91) should be modi- 
fied so that the four sublattices of the fcc lattice are taken into account. In fact, 
the fcc lattice can be decomposed into four interpenetrating simple cubic lattices, 
and the apices of a nearest-neighbor tetrahedron cluster termed «, p, y, and ô in 
Fig. 10.19 each belong to a different simple cubic lattice. Hence, one can identify 
that the free energy is a function of 15 independent variables written as Eq. (178). 


0 z ô £ er) » sh gy y sap cay By 5 
a ee Se 
(178) 


Accordingly, three distinguishable LRO parameters are defined as in Eqs. (179- 
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Again, by imposing the symmetry of the L1ọ ordered phase at a fixed 1:1 stoichio- 
metric composition, the free energy can be simplified by Eq. (182), in which the 
number of independent variables is reduced from 15 in Eq. (178) to ten. Three of 
these independent variables, 7,, 4), and 73, are LRO parameters as defined by 
Eqs. (179-181). The rest of the correlation functions are regarded as SRO param- 
eters. 


eo, a aßy vap sayd zußyö 
Jas = Ammi éF Er, bad re ser eee”) (182) 


By substituting Eq. (182) into the TDGL equation (135), one can derive the time 
evolution behavior of the LRO parameters through Eq. (183). 
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Fig. 10.20 Microstructural evolution from (a) a disordered state to 

(b) an ordered state. Magnification x. (c) and (d) indicate the atomic 
configuration of the {001} plane at a fixed point of the microstructure 
(x) corresponding to (a) and (b) respectively (Ohno 2004; Mohri 2005; 
Mohri et al. 2005a,b, 2006). 








on; L (* Unit vn (183) 


= 
on; 


The numerical calculation of the TDGL equation was carried out by transform- 
ing it to the difference equation. Each spatially discretized point in the difference 
equation consists of an fcc lattice of which free energy is described by f4. The 
LRO and SRO parameters at each point differ from one another; so interfaces 
are formed. Among such interfaces, we consider an antiphase boundary (APB) 
caused by the heterogeneity of the LRO, which can be observed by transmission 
electron microscopy (TEM) to be associated with an order—disorder transition. 

The microstructural evolution of a system at 50 at.% is shown in Fig. 
10.20(a,b); it is initially in the disordered phase (Fig. 20a) and is quenched to a 
temperature T = 1.6 (see Fig. 10.8), at which the L1, ordered phase is stabilized 
(Fig. 10.20b). The gray level in the contrast is proportional to the square of the 
LRO parameter which simulates the dark-field TEM image. Hence, the dark re- 
gion indicates disordered and APB areas and bright regions suggest ordered do- 
mains. One can see that characteristic APBs are formed in the microstrucure 
and, importantly, one can find a triple point junction which is formed by the col- 
lision of the different variants of ordered domains. One can never reproduce the 
triple point junction if the free energy of f) in Eq. (93) is employed. 

The advantage of the hybrid model is the fact that the atomistic information is 
derived simultaneously, as shown in Fig. 10.20(c,d); these configurations corre- 
spond to the fixed points in the micrographs indicated by the respective crosses 
in Fig. 10.20(a,b). Two edge directions correspond to the (100) direction. These 
are obtained based on the Williams algorithm (Williams 1976). Since the CVM 
free energy provides the correlation functions at each time step, the Williams pro- 
cedure synthesizes the atomic configuration in 120 x 120 x 120 fcc cubic lattice 
points. The error between the input and resultant correlation functions is less 
than 1%. One recognizes that ordering proceeds at the atomic level and the ini- 
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Fig. 10.21 Evolution of the square of the LRO parameter in the vicinity 
of an APB (Ohno and Mohri 2003; Ohno 2004). 


tially disordered atomic arrangements (Fig. 10.20c) transform to the ordered ar- 
rangements (Fig. 10.20d) with some antisite atoms remaining. 

Figure 10.21 shows the time evolution of the LRO in the vicinity of an APB of 
which the center position is located at x’ = 100 on the normalized scale. It should 
be noted that across the APB, the LRO reverses the sign as was described in Fig. 
10.11, while the vertical axis in Fig. 10.21 is the square of the LRO parameter. The 
profiles are calculated at three different times t’, which is the normalized time, 
t' = N-v21-L-t. One can see that in the earlier period (t/ = 0-1) LRO is re- 
duced within each domain, and in the later period (t’ = 1-40) the APB is wid- 
ened. This is well known as the wetting phenomenon which is the characteristic 
of a first-order transition. 

In this way, the hybrid model is capable of investigating multiscale phenomena 
covering the atomistic behavior within each domain (Fig. 10.20(c,d)) and interfa- 
cial phenomena (Fig. 10.21) as well as the microstructural evolution process (Fig. 
10.20(a,b)). It is worth pointing out that the difference in the time constant asso- 
ciated with the spatial scales is also reproduced in the present model (Ohno and 
Mohri 2003; Ohno 2004). 


10.4.2.2 Toward the First-Principles Phase Field Calculation 

Despite various advantageous features, the hybrid model is basically qualitative 
in the sense that the length scale and time scale are not uniquely fixed in a self- 
consistent manner within the theoretical framework. In fact, the spatial scale is 
normalized as in Eq. (184), where the gradient energy coefficient «; remains un- 
assigned although it has been amply demonstrated that the nearest-neighbor ef- 
fective pair interaction energy v2; can be determined by the first-principles calcu- 
lation. 


x! = x: 4/(N - vy,1)/(2xi) (184) 
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Fig. 10.22 Local and global coordinate system in [100] direction. 


In order to assign the gradient energy coefficient within the hybrid model, one 
needs an atomistic model, which should be coherently connected to the continu- 
ous PFM. 

First, we split the entire lattice into cells in a specified direction so that each cell 
contains the same number of lattice planes. Then, as shown in Fig. 10.22 (Ohno 
2004), we represent the position of each atomic plane, pa, by two coordinate sys- 
tems, a global and a local one, through Eq. (185), where the global coordinate, Rj, 
indicates the position of each block, while the local coordinate, rm, gives the posi- 
tion of an atomic plane within each cell. 


Pn = Rit+tm (185) 


The origin of the global coordinate is assigned arbitrarily, whereas for the local 
coordinate it is taken to be the center of each cell, so that Eq. (186) is satisfied. 


pa (186) 


It should be noted that Fig. 10.22 indicates the (001) planes in the bcc lattice, but 
the procedure is the same in all directions (planes) for all structures. 

We describe the free energy of the entire lattice as the sum of the free energy at 
each lattice plane, fevm( pn), by Eq. (187). 


F= X` fevm( Pn) (187) 


This indicates that both the internal energy and the entropy are defined at each 
lattice plane, for which the location of clusters such as the pair, triangle, tetrahe- 
dron, etc., are described consistently by the position of a lattice plane, p,, and a 
lattice constant, a. By knowing that the CVM free energy is a function of the clus- 
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ter probability or correlation function, Eq. (187) is rewritten as Eq. (188), where ¢, 
represents a configurational variable (cluster probability or correlation function) 
and h(a) is a function of the lattice constant a. Such a description of the free en- 
ergy is known as the inhomogeneous CVM (Kikuchi and Mohri 1997). 


F=} fevm[{gs(Ri + tm + h(a))}] (188) 
l,m 


By assuming that the spatial variation of ¢, is smooth enough for it to be ex- 
panded around the global coordinate, Rı, one derives the approximate expression 
in Eq. (189), where the higher-order terms beyond the third-order derivative are 
neglected. 


(Ri + tm + h(a)) = Os(Ri) + (rm + h(a)) - VOC Ri) 


(tm + h(a))? - V74,(Ri) (189) 


NI = 


+ 


The substitution of Eq. (189) into Eq. (188) yields the symbolical free energy ex- 
pression in Eq. (190). 


F= © fevm(tm + h(a), {B(Rı)} {Vb.(Ri) }, {VAR} (190) 
l,m 


If we expand the above free energy around the homogeneous state in which all 
the spatial derivatives of configurational variables vanish and the free energy is 
written only as {¢,}, the resultant expression is given by Eq. (191), where the 
zero indicates that the derivative is evaluated around the homogeneous state. 
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In this expansion, term(s) higher than second order are neglected. Equation (191) 
can be represented concisely as Eq. (192), where C; s(rm, a, {J}) is easily identified 
as a coefficient term in Eq. (191) and depends on the local coordinate rm, the lat- 
tice constant a, and the atomic configuration specified by {J} on the cluster s. It 
is shown (Ohno 2004) that these coefficient terms can be determined by the sym- 
metry of the lattice and the orientation of the crystallographic plane of concern. 
Note that in order to avoid complication, = in Eq. (191) is replaced by an equality 
sign in Eq. (192). 
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By transforming the sum with respect to the global coordinate into an integral, 
one obtains Eq. (193), where L is the length of each cell. 


|| Em {#ds(Rı)} D+ Crsl stm, a, {J}) : (Ws) 


+ 5 C2,s(fm, a, {J} ` (V7¢;) F > C3,s(Tm, a, un) dx 


(193) 


One can readily prove that Y ,, Cı,s(rm,a, {J}) vanishes, and partial integration, 
under the conventional assumption that the gradient of ¢, vanishes at a surface, 
yields Eq. (194), where N’ is the number of the lattice plane in each cell and «s,s 
is given as Eq. (195). Here, V; is called the generalized V-matrix (Ohno 2004), 
which is an extension of the V-matrix in Eq. (25) and contains the information 
about the atomic arrangements in a specified cluster. 


=r] 


kss = N- kg- T- 5 Vs (fm, 4, {MD (195) 
J 


N' - fovm{bs}] + XO rss (VA) (Vgs) | (194) 


sr 





Mathematically, in the course of summation with respect to J, dependences of rm 
are tacitly eliminated from Eq. (195), and therefore the gradient energy coefficient 
is finally given as Eq. (196). 


Ksh = Kss (T, {bs}, 0) (196) 


We see that the mathematical form of Eq. (194) is similar to a Ginzburg-Landau 
type of free energy expression with all the pertinent information at the atomistic 
scale condensed in the generalized gradient coefficient, which is not a constant as 
has been assumed in the conventional PFM calculations, but depends on temper- 
ature and SRO, {¢,}. It should be noted that the present derivation is a general- 
ization of Kikuchi and Cahn’s pioneering work (Kikuchi and Cahn 1962) on B2 
ordered alloys. There are, however, a number of differences between the two der- 
ivations. Since detailed discussion is beyond the scope of the present chapter; the 
interested reader should consult the original articles. By stressing that the (gener- 
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Fig. 10.23 Time evolution of an antiphase boundary for Fe-Pd at 
50 at.% at 880 K (Mohri et al. 2005a,b; Mohri et al. 2006). Time t’: 
(a) 0.01; (b) 0.83; (c) 3.59; (d) 7.04; (e) 16.70. The bar above (a) 
represents a length of 100 nm, and the arrows suggest the [100] 
and [011] directions. 


alized) gradient energy coefficient is now given by all the atomistic information 
derived from the CVM, we go one step forward to calculate the microstructural 
evolution from first principles. 

Figure 10.23(a-e) shows the time evolution of an APB when the 50 at.% Fe-Pd 
system is quenched from a disordered phase region to 880 K, at which the L1o 
ordered phase is stabilized as one can see in the phase diagram in Fig. 10.18(b). 
It is observed that an absolute length scale is assigned in the micrograph and the 
crystallographic orientation is also fixed uniquely. It should be emphasized that 
these calculations are performed based only on the two atomic numbers of Fe 
(26) and Pd (46). Rather strong anisotropy of the APB may be caused by neglect 
of the second nearest-neighbor pair interaction energy. This is one of the remain- 
ing subjects to be settled in future calculations by introducing the tetrahedron- 
octahedron approximation of the CVM (Sanchez and de Fontaine 1978; Mohri et 
al. 1985a,b). 

For a fully consistent first-principles calculation of phase transitions, two re- 
maining subjects should be addressed. One is to assign an absolute time scale. 
In fact, in Fig. 10.23, the time scale is normalized with respect to a relaxation con- 
stant. The other is more rigorous coarse graining operation. For these subjects, 
efficient incorporation of fluctuations into the theory may be inevitable. 


10.5 
Future Scope and Outlook 


The theoretical basis in the present chapter is the cluster variation method, which 
is one of the most efficient tools for investigating alloy phase equilibria. Although 
the CVM is an approximate theory, it is assured that by choosing a bigger basic 
cluster one can approach an exact solution. One hopes that this account has pro- 
vided readers with a rigorous framework for the thermodynamics for alloys, as 
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well as practical techniques for investigating alloy problems through the sound 
theory of CVM. 

It has been demonstrated that by combining the CVM with electronic struc- 
ture total energy calculations, one may perform self-consistent first-principles cal- 
culation of phase equilibria. The main procedure of the first-principles calcula- 
tions within the CVM is nearly established, and applications to versatile systems 
are a matter of numerical technique. In view of the recent development of high- 
performance computers, the author believes that the CVM has been further pene- 
trated with the distribution of user-friendly program codes. The main task one 
has to tackle in the future is the extension to kinetics, i.e., development of the 
path probability method. As was demonstrated in Section 10.2.6, the applicability 
of the PPM has been limited to spin kinetics and no serious efforts have been 
directed to the vacancy mechanism. Upon introduction of a vacancy, one faces 
the difficulty of enumerating and identifying microscopic paths for configura- 
tional evolution/devolution processes. In order to circumvent this problem, it is 
desirable to develop a program code which formulates the path probability func- 
tion for a specified set of crystal symmetry and a level of approximation prior to 
the numerical computation of simultaneous equations. Such an attempt has been 
initiated by author’s group. This should pave the way to combining the PPM with 
the electronic structure calculations to perform a first-principles kinetics investi- 
gation which is expected to resolve various diffusion problems involved in or- 
dered compounds. 

Recent development of the phase field method is quite remarkable and various 
microstructural evolution processes have been analyzed efficiently by visualiza- 
tion of solutions. It was demonstrated that the CVM can even be combined with 
PFM to achieve multiscale calculations. Some preliminary calculations intro- 
duced in the present chapter are precursors to the future development of fully sat- 
isfactory multiscale calculations. For this, a major stumbling block is the coarse 
graining operation. One recipe introduced in Section 10.4.2.2 is an efficient way 
to determine the spatial scale and crystallographic orientation from atomistic in- 
formation in a unique manner, but the time scale is still left as a subject for the 
future. It is believed that the PPM will play an essential role in determining the 
time scale by combining consistently with the PFM. In view of the recent trend of 
multiscale and multiphysics calculations on tera- and peta-flop computers, spatio- 
time coarse graining is quite a challenging topic. 

One of the remarkable trends recently in the materials science community is 
the discovery and development of low-dimensional materials such as thin films, 
nanowires and nanoparticles. In fact, many discussions are provided by various 
authors in this book. A low-dimensional material is a thermodynamic system in 
which the surface predominates. Since the CVM was developed originally for a 
bulk system in which symmetry is preserved all over the crystal, how to deal 
with the broken symmetry at a surface is a central concern in the theoretical 
studies. Unfortunately, CVM-based studies on a low-dimensional material are far 
behind Monte Carlo and molecular dynamics simulations. It may be recalled, 
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however, that the CVM is a unique theoretical tool which is able to perform a syn- 
thetic study involving a phase diagram, atomistic kinetics, and microstructural 
dynamics. The development of an efficient ingredient to extend the CVM to a sys- 
tem in which numerous kinds of symmetries including non-Bravais lattice sym- 
metries coexist is urged. For this, a generalization of the continuous displacement 
cluster variation method (CDCVM) which was introduced in Section 10.4.1.3 has 
been readdressed. 

Finally, it is emphasized that a long-lived theory is soundly based. This is the 
case with the CVM. Quite often, however, a sound theory is found to be too rigor- 
ous and not easily applicable to practical problems. Fortunately, the hierarchic 
structure of CVM assures a broad range of applications. The accuracy of the solu- 
tion depends merely on the basic cluster, which can be further enlarged on tera- 
and peta-flop computers. The author believes that CVM-based alloy thermody- 
namics will be one of the important components of modern alloy physics and 
will be a stimulant to solid-state physics. 


Appendix 
CALPHAD Free Energy 


In the course of this chapter, the main focus was placed on the atomistic descrip- 
tion of alloy phase stability, phase equilibria, and phase transitions. Naturally, the 
final goal of such an approach is calculations from first principles. Although 
much effort has been made in recent studies to generalize the scheme so that ver- 
satile classes of materials can be dealt with, the applicability of the first-principles 
calculation is mostly limited to binary or ternary alloys with highly symmetric 
structures. The application of the first-principles scheme to practical multicompo- 
nent alloy systems with an arbitrary symmetry is not very realistic at the present 
stage. 

One of the most powerful methods which can be applied to study thermody- 
namic properties of practical alloy systems is the CALPHAD method. CALPHAD 
is the abbreviation for Calculation of Phase Diagram and this method has been 
developed over nearly 40 years of international cooperation. The method is based 
on the Bragg-Williams description of the entropy and free energy; hence the 
mathematical treatment is not difficult. The lack of physical exactness originating 
from the primitive Bragg—Williams approximation is circumvented by the effi- 
cient parameterization of the free energy in which the precision of the experi- 
mental input data is the critical issue. An introduction to CALPHAD free energy 
is attempted in this appendix. Among various reference books, the author refers 
to Turchi’s concise summary (Turchi 2003) on this topic. 

Within the regular solution model, the free energy, °G, of a multicomponent 
solution of a particular phase ® can be given by Eq. (A1), where °G° is the free 
energy due to the mechanical mixing of pure components written as in Eq. (A2), 
in which I designates a component, M the total number of components, and 


Appendix 


cı and ?G? are the concentration and free energy, respectively, of a pure compo- 
nent i. 


OG = PG? + Gie t Oa, (A1) 
M 

Q? = X` c- "Gy (A2) 
I=1 


®Gideal originates from the configurational entropy contribution, which according 
to the regular solution model is given by Eq. (A3), which is nothing but the 
Bragg—Williams approximation given by Eq. (56) for a binary system. 


mix 


OGidel _ R.T. y cr- Inc (A3) 
I 


Finally, °GXS. is the excess free energy necessary to incorporate nonideal contri- 
butions. The simplest form is given by Eq. (A4), where Qy, j is termed an interac- 
tion parameter in thermochemistry. 


SCH = 5 > Q; zer x cy (A4) 


T Jol 


In order to further improve the description of the extra energy, one may general- 
ize Eq. (A4) in the series expansion of Eq. (A5), where LÉ, is a kth-order binary 
interaction parameter between species I and J which is expressed as a polynomial 
in temperature T. 


=A A g La o)" (A5) 


T Jol k 


This particular expression is known as the Redlich—Kister expansion. It is readily 
realized that Eq. (A4) is a special case of Eq. (A5) in which the expansion is trun- 
cated to k = 0. 

Two main features of the CALPHAD method are addressed. One is that the 
free energy expression of pure elements, °G?, in Eq. (A2) is given by the particu- 
lar form of Eq. (AG), where HR is the enthalpy at the standard element refer- 
ence state of the substance in its most stable state at 298.15 °C, a, b, c, and d; are 
model (materials) constants, and data have been compiled by Dinsdale (1991) in 
the Scientific Group Thermodata Europe (SGTE). 


Gm — HA’ =a+b-T+e-T-InT+) dT; (A6) 
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It should be noted that this expression is adopted not only for a pure element but 
also for a stoichiometric compound. This is the starting building block for more 
complicated alloy systems and in order to secure accuracy the compiled data have 
often been updated. 

The second feature of the CALPHAD method is that the Gibbs free energy for a 
multicomponent solution phase is described on the basis of a sublattice model. 
Essential points of the model are described below by taking as an example the 
two-sublattice model for an A-B binary alloy system. 

Within the two-sublattice model, a phase consists of two sublattices. The two 
components enter both sublattices, which is denoted as (A, B),(A, B), where sub- 
scripts p and q are the numbers of sites in each sublattice. The free energy, G, of 
a solution phase is then given by Eq. (A7), which is in the same form as °G in Eq. 
(A1). 

G= G? + coe i GZS 


mix (A7) 
Like °G°, G° describes the contribution from the mechanical mixing; however, 
due to the introduction of the sublattices, four kinds of pure components are 
mixed. Two of them are pure elements A and B and the other two are the stoi- 
chiometric compounds, A,B, and A,B,. Then, G° is written as Eq. (A8), where y 
indicates the concentration (site fraction) and the subscripts and superscripts de- 
note the species and sublattice, respectively: y}, for instance, is the concentration 
of species A in the Ith sublattice. 


G° = pi YA Gha + Ya’ Ye Gast Ye: YA Goa + YR: Ye Gap (AB) 


G? j is the Gibbs energy of the compound I,J, expressed by Eq. (A6). G? is equiv- 
alent to °G? in Eq. (A2), and G}; (G}.r) is the new contribution due to the intro- 
duction of the sublattices. One realizes that the prefactor cr in Eq. (A2) is modi- 
fied in Eq. (A8) by the product of the compositions of each species in the 
sublattices of concern. 

Gi#al is the mixing entropy term based on the Bragg-Williams expression 


given by Eq. (A9), and this corresponds to °G'##! in Eq. (A3). 
Gma = P: RT(ya-In ya + yp: ln yp) 
+q: RT(ya In ya + yg -ln yp) (A9) 


One should note that the mixing entropy is evaluated in each sublattice and the 
net contribution is given as the weighted sum of p and q. 

Finally, GX}, is the excess Gibbs energy of mixing due to nonideal contribu- 
tions. This is again the extension of Eq. (A5) and is given by Eq. (A10), where 
Le B: x (LE ap) is the kth-order interaction parameter between the components A 


and B in the first (second) sublattice. 
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xs k k 
ah: He: -X Lisa (va Ya) + ye YO LA aw: (YA Ya) 
k=0 k=0 


k k 

+ yh y (x So Lhar (yh) + yh > Ian OF- 8) 
k=0 k=0 

(A10) 


In this notation, a semicolon separates components occupying different sublat- 
tices, and a comma separates interacting components in the same sublattice. 
These equations can be generalized for phases with multicomponents and multi- 
sublattices, and they reduce to a random substitutional model when there is only 
one sublattice. 

The variables participating in minimizing the free energy are basically only 
concentrations (site fractions), which simplifies the mathematical procedures. 
This is the most significant advantage of the CALPHAD method. Physically, how- 
ever, the description of the free energy through Eqs. (A7-A10) does not take the 
atomic correlations — not only inter-sublattices but also intra-sublattices — into ac- 
count. Such a deficiency is circumvented by the efficient parameterization of the 
interaction parameters used for fitting the experimental data. 

In view of the complexity of practical alloys, the CALPHAD method is quite 
useful and recently (Kaufman et al. 2001; Turchi et al. 2005, 2007; Tokunaga et 
al. 2006) the combination of the first-principles electronic structure calculations 
with the CALPHAD type of description of the free energy has been actively devel- 
oped. 
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11 
Ab-Initio Methods and Applications 


Stefan Miiller, Walter Wolf, and Raimund Podloucky 


11.1 
Introduction 


The idea of an ab-initio approach is to calculate properties of (solid) matter with- 
out the need for any empirical information. For this purpose two basic require- 
ments have to be fulfilled: (a) the interaction of atoms is described by the funda- 
mental laws of physics, and (b) the models and approximations one is forced to 
make are reasonably accurate. Concerning point (a), equations are needed which 
comprise the quantized interaction of electrons. These can be formulated tradi- 
tionally by the Schrédinger equation involving many-body wavefunctions. As 
mentioned in Section 11.2.1 such equations are generally not applicable to solids, 
because of the large number of interacting particles. On the other hand, the elec- 
tron density would be much better suited provided one finds a manageable for- 
mulation for the many-body part of the total energy of the system being a func- 
tional of the density. For practical reasons but still being exact, density functional 
theory is recast in terms of single-particle wavefunctions. Thus, equations are de- 
rived which look like simplified Schrédinger equations but, of course, the many- 
body complications are not lost. They are just reformulated in such a way that 
requirement (b) can be fulfilled, namely that reasonable approximations can be 
introduced giving useful results in a reasonable timeframe — provided the system 
is not too large. In that sense, a true ab-initio concept is established. Of course, 
approximations do not work well in all cases. One should, however, not speak of 
failures of density functional theory itself (as is rather common), since the short- 
comings are due to the approximations to exchange-correlation interactions in- 
troduced into the numerical solutions. 

A variety of numerical techniques and codes are available. Very briefly, some of 
the most important methods are discussed. 

Because of their striking success, applications of density functional theory 
(DFT) became very popular and penetrated the field of materials science. In this 
chapter we will shortly describe DFT methods and give examples of very recent 
investigations in which important materials properties were derived from DFT 
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concepts and calculations. This includes the ab-initio determination of vibrational 
properties, which provides, apart from the vibrational spectrum itself, the most 
important temperature-dependent thermodynamic properties as long as the tem- 
perature is well below the melting point. DFT applications - strictly valid only for 
T = 0 K — can thus be made temperature-dependent. Furthermore, elastic proper- 
ties can be derived in a quite straightforward way from the calculation of the total 
energy or stress tensor as a function of suitable distortions. 

Further exploitation of the total energy might lead to the derivation of struc- 
tural and phase stabilities, for which energy differences between suitable refer- 
ence systems have to be taken. This concept can be carried further for calculating 
defect properties involving a grand-canonical treatment based on total energies of 
supercells. Furthermore, migration of defects and impurities can be evaluated 
quantitatively in terms of temperature-dependent diffusion coefficients within 
transition state theory, making use of transition state barrier energies and vibra- 
tional spectra. More complex planar faults can also be analyzed and, as an appli- 
cation, impurity effects on grain boundary cohesion and intergranular fracture as 
obtained from total energy calculations are dicussed. 

Going beyond consideration of total energies of long-range ordered solids only 
with fixed stoichiometry (apart from studying small variations as for the defect 
and impurity case), a much more general approach is pursued by the cluster ex- 
pansion (see Section 11.3.5). There, DFT is used to provide a database of energies 
for a series of atomic configurations (so-called clusters). Once a convergent ex- 
pansion in terms of these clusters is found, a wide area of materials problems 
can be studied quantitatively, such as the formation of precipitates as a function 
of temperature and time. To this end, the DFT database is used in Monte Carlo 
numerical simulations for probing the probability of site occupations. 

As is shown in this contribution, ab-initio methods have reached a mature state 
and are capable of operating in a robust manner within the context of other ad- 
vanced simulation techniques. This has widened the range of applicability to an 
unprecedently large variety of materials problems. In the present chapter, the 
topics mentioned will be dealt with by discussing very recent examples and pub- 
lications, which — in most cases — are based on the work of the authors’ groups. 
The selection of examples is representative of similar studies currently being 
done within the computational materials science community. 


11.2 
Theoretical Background 


11.2.1 
Density Functional Theory 


The aim is to describe the interactions between electrons and atoms in a solid 
system on the most fundamental level, namely by the basic principles of quan- 
tum physics. Restricting the considerations to time-independent cases (i.e., on 
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stationary ground states described by the time-independent Hamilton operator 
H) one has to solve the time-independent Schrédinger eigenvalue equation (1), 
labeling the states by the (set of) quantum numbers i related to suitable bound- 
ary conditions, including also spin degrees of freedom. 


HY; = EM; (1) 


For our purpose and for simplicity, we consider only nonrelativistic systems for 
which the number of particles is conserved. Then, the spin part of the wavefunc- 
tions can be separated from the space-dependent functions. Furthermore, no 
temperature-driven excitations are considered, which means that all the following 
considerations are strictly valid only for T = 0 K. This, however, for most of the 
applications of DFT is not a severe restriction, because of the thermal properties 
of the (fermionic) electron gas. 

The wavefunction Y; is a function of 4N (spin and space) variables for the sys- 
tem of N electrons, providing the energy eigenvalues E;. Because electrons are 
fermions, the wavefunction must be antisymmetric concerning the exchange of 
indistinguishable particles « and £, i.e., when the sets of coordinates for particle 
a and p are exchanged [Eq. (2)]. 


Wk his Mp ce PS EX Ra) (2) 


The variable x, = (r, |), comprises space and spin coordinates for electron «. A 
direct consequence of Eq. (2) is the so-called exchange interaction as manifested 
by Paulis exclusion principle. This purely quantum statistical interaction is ac- 
companied by the so-called correlation which stems from the repulsive Coulomb 
repulsion 1/|7, — r| between electronic point charges, as it occurs in the Hamil- 
ton operator. Both, these quantum many-body effects are summarized under the 
name exchange-correlation. If one could solve Eq. (1) with acceptable precision 
as it is, all the further considerations would be obsolete. However, in practice di- 
rect many-body wavefunction solutions to the Schrodinger equation (1) are re- 
stricted to rather small molecules and are in general not applicable for solids 
with their ~ 102 electrons. 

The vast majority of ab-initio applications for deriving ground-state properties 
of solids is based on DFT as derived by Hohenberg and Kohn (1964) and Kohn 
and Sham (1965). In the fundamental work of Hohenberg and Kohn the central 
role of the wavefunction is replaced by the electron density, which is assumed to 
determine uniquely the total energy of the ground state E[p] of the electrons. 
Given the so-called external potential v.,; as generated by the nuclei of the system, 
the total energy is decomposed [Eq. (3)] 


Elo] = | reap + Flo (3) 
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The functional F[p] is universal because it depends only on the electron density 
and not on the external potential. Overcoming all the problems of the mathemat- 
ical existence of this postulated functional it can be shown that for reasonable 
electron densities p, such an ansatz exists. The functional F is a sum of the ki- 
netic energy of the electrons T, the classical Coulomb interaction between elec- 
tronic charge densities E,o,;, and the remaining term Exc which comprises all the 
complications of the quantum many-body interactions [Eq. (4)]. 


F[p| = TIpl F Ecoulp] F Exe[p] (4) 


This very general formulation in combination with the variational principle re- 
sults in a very elegant relationship involving only the electron density. It is, how- 
ever, not (yet) suitable for reliable calculations because it is not accurate enough. 
A major step toward practical applications of DFT was made by Kohn and Sham 
(1965) by introducing a scheme of auxiliary wavefunctions which have the proper- 
ties of single-particle wavefunctions (or orbitals). For that purpose, the kinetic en- 
ergy functional T is split into a single-particle term T, and the remainder which 
is supposed to be included in the many-body exchange-correlation functional. 
Although on one hand this functional is not known, on the other hand the re- 
arrangement of terms presumably leads to cancellation of large errors when ap- 
proximations are introduced. 

The only true purpose of the Kohn-Sham orbitals ¢; is to build up the true 
ground-state electron density by summing over a suitable number of states [Eq. 
(5)], defining the total number of electrons N in the system. 


oma) =). [dren 6) 


From this concept of Kohn-Sham orbitals a stationary Schrödinger-like equa- 
tion of the form of Eq. (6) for the ith orbital is derived. The Hamilton operator is 
built up as in Eq. (7). 


Hg, = cp; (6) 
H:= a A + vef (x) = Vet + + (7) 
Zr Def \X), Vef = Vext T Veoul T Uxc 


The effective potential v,g is the sum of the external potential ve, the classical 
Coulomb interactions v,.,;, and a term v,. which contains all the many-body or 
exchange-correlation interactions. Like the functional F[p], the potential v,,[p] 
(being the functional derivative ÖE,./öp) is universal and depends only on the 
electron density p. The potentials generally depend on the spin state (in the most 
simple form by spin polarization of the electronic states into majority and minor- 
ity spin components), which enables treatment of systems with spontaneous 
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magnetic ordering. The basic and most fundamental results of an application of 
DFT is the ground-state electron density p and its corresponding total energy E[p]. 
Other important quantities are derived from the total energy being dependent on 
physical parameters. For example, the force acting on an atom with position 1; is 
determined by the derivative 0E/01;. 

The quality of the results of actual calculations for real systems then depends 
on the quality of the approximation one has to make for v..|p]. The first and sur- 
prisingly successful approximations were based on the properties of the homoge- 
neous electron gas. Locally, the value of true electron density was assumed to be 
the density of the interacting homogeneous electron gas for which the exchange- 
correlation energy can be determined. Consequently, the exchange-correlation 
functional is constructed as in Eq. (8), in which ex, is the (local) exchange- 
correlation energy per particle of the homogeneous electron gas of density p. Ob- 
viously, this ansatz is called the local density approximation (LDA) (e.g., Vosko et 
al. 1980). 


EL (p| = | pleese(p) dr (8) 


Applying the functional derivatives due to p, the LDA version of the Kohn- 
Sham equation is derived as Eq. (9). 





(Aaron + [rar + oa )p = i ©) 


2m r-r| 


A variety of different LDAs exist which basically give rather similar results. 

As is now well known, LDAs lead to serious errors for systems with valence 
electrons of a more localized nature. For example, the ground-state crystal struc- 
ture and magnetic state for Fe are wrongly predicted by LDA calculations. Since 
the beginning of the 1990s extensive work has gone into the systematic improve- 
ment of LDAs, resulting in the concepts of the (semi-local) gradient approxima- 
tions, including the gradient of the electron density Vp (e.g., Perdew and Wang 
1992). The potential vLPA in the Kohn-Sham equations has then to be replaced 
by the corresponding potential v¢° of the so-called generalized gradient approx- 
imation (GGA). Similarly to the development of LDA potentials, a variety of GGA 
formulations exist, which might lead to significantly different results when treat- 
ing rather inhomogenous systems, e.g., calculating adsorption energies and mi- 
gration barriers of molecules on surfaces. Fortunately, for most of the bulk stud- 
ies, the different GGA potentials yield the same results, which in general are in 
better agreement with experimental facts, as demonstrated for Fe in Fig. 11.1. 

Figure 11.1 clearly shows that LDA (right panel) would predict the wrong 
ground-state structure (namely fcc) with an equilibrium volume which is far too 
small. On the other hand, GGA yields the correct bcc ground state and a reason- 
able volume when compared to experiment. 
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Fig. 11.1 Ab-initio total energies for Fe vs. atomic volume (in Å?). The 
energy zero is defined by the minimum of energy of the ferromagnetic 
calculation for the bcc structure. NM: not spin-polarized, FM: 
ferromagnetic calculation. Left panel: GGA results, right panel: LDA 
results. Calculations were made by the FLAPW code flair (as discussed 
in Section 11.2.2). 


Although applying GGA achieves considerable improvement in many cases in 
which LDA is just not sufficiently accurate, for some specific systems the many- 
body interactions need a more sophisticated treatment (for example, for highly 
correlated systems with strongly localized electronic states). At present, strong ef- 
forts are being made for an improved handling of the many-body interactions by 
partially going back to wavefunction-like concepts beyond the Kohn-Sham de- 
scription. A very recent survey of some elaborate methods is given in a study on 
bulk MnO (Franchini et al. 2005). These highly theoretical considerations may 
have a strong impact on the energetics and hence also on the physical properties 
of alloys and metals, for example when Mn is involved (Chen et al. 2005b). 


11.2.2 
Computational Methods 


The goal of performing ab-initio calculations is to work with a computational ma- 
chinery which — without any empirical parameters — provides quantitatively use- 
ful numbers. This can be achieved by making use of density functional theory 
and proper numerical/technical implementations of sufficient methodolgical 
and numerical accuracy. As discussed in Section 11.2.1, within the Kohn-Sham 
formulation of density functional theory Schrödinger-like equations have to be 
solved for single-particle wavefunctions wj(x). If periodic boundary conditions 
are chosen — which is the standard in solid state physics — the so-called Bloch 
states are labeled in terms of the wave vector k of the first Brillouin zone and 
the band index v. Bloch states fulfill the boundary condition given by Eq. (10) 
under a translation by a real lattice vector R. 


11.2 Theoretical Background 
v ikR vy 
vulX+ R) = ey (x) (10) 


As is common for numerical applications, the wavefunctions are composed of lin- 
ear combinations of suitable basis functions ®, ; with the coefficients cy ; [Eq. 


(11)]. 
vr) = I ef Pk, i(X) (11) 


The summation runs over a sufficiently large number of basis functions N so that 
numerical convergency is reached for all important physical properties, such as 
the electron density and the total energy. The total energy functional formed by 
the translationally invariant Hamiltonian Hy describing all the interactions be- 
tween atoms and electrons in the unit cell, Eq. (12), 


< wi (x)| Ally (x)> (12) 


is minimized with respect to the variational parameters cy ; under the normaliza- 
tion condition Eq. (13). 


Cw) |X) = 1 (13) 


Following this variational procedure, the matrix eigenvalue equation (14) is de- 
rived, in which Hj = (Dr ;|H| Oy, j> is the matrix element of the Hamiltonian 
and Sj = <®x ;|®x j> denotes the overlap matrix element formed by the basis 
functions ®x j. 


5 Hycy ; = & 5 Sich j (14) 
j j 


For nonorthogonal basis functions the matrix S is different from the unity matrix. 
The energies ex are the eigenvalues and the coefficients cg j are the components 
of the eigenvectors, which have to be calculated. Depending on the size of the sys- 
tem (i.e., the number of atoms per unit cell) and the chosen numerical methods, 
the dimensions of the matrices Hj and Sj could be quite large (e.g., of the order 
of 10* for about 100 atoms in a plane-wave based approach; see later). This 
means that, for many applications, most of the computational effort goes into 
the diagonalization of such large matrices. 

Once Eq. (14) has been solved, the corresponding electron density is con- 
structed by summing over the squares of the Kohn-Sham orbitals. Because for 
building the Hamiltonian the density must already be known (in order to con- 
struct the potentials), a self-conistency problem has to be solved. Achievement of 
self-consistency could be rather cumbersome, in particular for large systems with 
many electronic and geometric degrees of freedom. Therefore, elaborate schemes 
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of mixing old and new densities and optimizing geometries have to be applied in 
the computer codes. 

The most accurate and successful numerical methods are able to solve the 
Kohn-Sham equations without any further modeling restrictions such as as- 
sumptions about the geometrical shape of electron density and potential. In the 
following, we will give a very short description of pseudopotential methods (PP), 
full-potential linearized augmented plane-wave (FLAPW) applications, and full- 
potential linearized muffin-tin orbital (FLMTO) approaches. These methods are 
widely used in the community. For a more detailed insight the reader is referred 
by the book Martin (2004). Of course, the fundamental many-body electron- 
electron interactions have to be approximated, as described in Section (11.2.1), 
but these approximations should be independent of the numerical technique ap- 
plied. 

A natural choice of basis functions for describing Bloch functions are plane 
waves e'(k+Gi" which are labeled by the reciprocal lattice vectors Gj. This kind of 
strategy is pursued in the PP approaches. The basic problem of a pure plane wave 
approach is the rapid change of the real potential and wavefunction near the nu- 
clei. Any plain Fourier series of these quantities will not converge. Therefore, 
ways had to be invented to overcome this problem, which is done by “pseudoliz- 
ing” the potential: not the bare Coulomb potential of the atomic nuclei but a po- 
tential screened by the inner-shell electronic states is introduced. Furthermore, 
following the argument that for the bonding only the wavefunction in the region 
between the atoms needs to be accurate, pseudoptential schemes and packages 
were developed. The latest development consists in working with the so-called 
projector augmented wave potentials (Blöchl 1994), by which the correct nodal 
structure of the wavefunctions is reintroduced but still most of the efficiency of a 
plane-wave basis is maintained. There is no longer any limitation to treat transi- 
tion metal elements and localized states within this refined pseudopotential treat- 
ment. One of the most powerful modern pseudopotential packages is the Vienna 
ab-initio simulation package (VASP),” by which a variety of ground-state proper- 
ties, including full relaxation of atomic positions and the cell shape, can be de- 
rived and also ab-initio molecular dynamics features can be utilized. 

Close to an atomic nucleus, linear combinations of atomic-like functions cen- 
tered at the nuclear position r; are a more natural choice for basis functions, 
such as those given in Eq. (15) with radial functions R; and spherical harmonics 
Yim labeled by the orbital quantum numbers |, m. 


Da = I) Ain Rille = til, 1) Yim (7 Hi) (15) 
l m 


The coefficients Am are undetermined as yet. Therefore, a combination of such 
atomic-like functions with plane waves in regions between the nuclei might 


1) http://cms.mpi.univie.ac.at/vasp/ 
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be reasonable basis functions, as utilized by the FLAPW method. The key point is 
the division of space into (a) nonoverlapping spheres (so-called atomic or muffin- 
tin spheres) centered at the nuclei and (b) an interstitial region in between the 
spheres. In both regions, the natural choice of basis functions is made, as 
described above. In order to construct suitable basis functions for the whole space 
it is required that plane waves and atomic-like functions match at the sphere 
boundaries, which is achieved by an appropriate definition of the cofficients Ay. 
The matching condition mediates the Bloch property of the plane wave onto the 
atomic-like function; the overall basis function then has the correct translational 
symmetry. By this construction, the plane-wave basis has been augmented by the 
functions inside the spheres. In the late 1970s, the historical assumptions of 
spherically symmetric charge densities and potentials being constant in the inter- 
stitial region were overcome when the full-potential FLAPW approach was pre- 
sented (Wimmer et al. 1981). The FLAPW method is considered to be one of the 
most precise concepts in the field. However, a price for computational costs has 
to be paid, in comparison to PP methods: because of the matching and the more 
complicated potential matrix elements inside the spheres, the FLAPW method is 
less fast than a pseudopotential approach such as VASP. As examples for existing 
FLAPW codes we mention the WIEN2k code” and the flair code,” both are based 
on the mentioned developments at the end of the 1970s. 

The Bloch property of the above two approaches was mediated by plane waves 
involving the reciprocal lattice. However, Bloch functions might also be con- 
structed in another way by summing over the real lattice R, in terms of linear 
combinations of atomic-like function ¢),, centered at the atom with position R 


[Eq. (16)]. 
ux) = >So chim > ebm (x R, 1) (16) 
l m R 


The problem consists in the proper treatment of the tails which overlap and 
therefore comprise the bonding properties. Historically, such a concept was ap- 
plied in a variety of approaches, in particular for space filling (and therefore over- 
lapping) atomic spheres in terms of the so-called muffin-tin orbital method. A 
suitable treatment of the tails of these functions (apart from other technical de- 
tails) originated in the derivation of the FLMTO method. (Methfessel 1988; Meth- 
fessel et al. 1989). An FLMTO code is available at the address given.” 

The three concepts described above solve the Kohn—Sham equations (i.e., DFT 
in its — at present — most useful formulation) in a quantitatively precise way. 
There are numerous other approaches which also make use of approximations 
to the potential and density for describing systems with a large number of atoms. 


2) http://www.wien2k.at/ 
3) http://www.uwm.edu/” weinert/flair.html 
4) http://www.fplo.de 
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11.23 
Elastic Properties 


Elastic constants and elastic moduli are fundamental materials data providing de- 
tailed information on the mechanical properties of materials. The knowledge of 
these data may enable prediction of mechanical behavior in many different situa- 
tions. Single-crystal elastic constants can be computed reliably from first princi- 
ples with an accuracy that rivals experimental methods. For polycrystalline mate- 
rials important quantities such as Young’s moduli and shear moduli are then 
accessible from computed single-crystal elastic constants by suitable averaging 
schemes. Compared to the rather demanding experiments the ab-initio evalua- 
tion of elastic data can be automated and is extremely fast. 

The calculation of elastic data from first principles is based on the general lin- 
ear relationship connecting stress to strain as provided by the generalized 
Hooke’s law (Einstein notation), Eq. (17). 


Omn = Cmnpr€pr (17) 


Herein om, denotes the stress tensor and ey, the strain tensor, and the elements 
of the fourth-order tensor Cmnpr are the so-called elastic constants. Whereas Gmn 
and e,, are symmetric and therefore have six independent elements, the number 
of 81 elastic constants is reduced by symmetry arguments to a total of 21. The 
elastic energy density U, which is defined as the total energy per volume, is ob- 
tained from the stress tensor (force per unit area) by integration of Hooke’s law 
to give Eq. (18). 


E 1 
U= V = 2 Cnnpr EmnEpr (18) 


Introducing the convenient matrix-vector notation, where the six independent ele- 
ments of stress and strain are represented as vectors (denoted here as %; and ¢; 
where i and j run from 1 to 6 in the sequence xx, yy, zz, yz, xz, xy) and the 
fourth-order tensor Cmnpr is rewritten as a 6 x 6 matrix cj, these basic relation- 
ships can be simplified to Eqs. (19) (Voigt notation). 


E 1 
> = Ce; U= V = geie (19) 


Taking into account additional symmetry arguments imposed by the crystal lat- 
tice, the number of elastic constants decreases further. The total number of inde- 
pendent elastic constants for all crystal systems is summarized in Table 11.1. 
There are two direct ways of computing single-crystal elastic constants from ab- 
initio methods: they may be called the energy-strain approach and the stress- 
strain approach. The energy-strain approach is based on the computed total 
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Table 11.1 The number of independent elastic constants for different 
crystal systems and point groups (from Le Page and Saxe 2001). 





Crystal system (point group) Number of independent 
elastic constants 


Triclinic 21 
Monoclinic 13 
Orthorhombic 

Tetragonal (4, 4, 4/m) 

Tetragonal (422, 4mm, 42/m, 4/mmm) 
Hexagonal and rhombohedral (3, 3) 

Hexagonal and rhombohedral (32, 3m, 32/m) 
Hexagonal (6, 6, 6/m, 622, 6mm, 62m, 6/mmm) 
Cubic 


Wann an ~o 





energies of appropriately selected strained states of the crystal. The crystal is 
strained in order to extract corresponding stiffness values preserving as much 
symmetry as possible. For each strain type several magnitudes of strain are ap- 
plied and the corresponding total energies are computed with an ab-initio 
approach. The stiffness is then derived from the curvature of the energy-strain 
parabola by means of least-squares fits making use of Eq. (18). Some of the im- 
posed strains may be related to a single elastic constant while others are de- 
scribed by a linear combination of elastic constants, from which the elastic con- 
stant tensor is finally evaluated. The stress—strain approach, on the other hand, 
relies on the ability of the ab-initio approach to directly calculate the stress tensor, 
which is by no means fulfilled by all available ab-initio methods. Once the stress 
tensor can be computed from first principles, the elastic constants matrix can be 
derived directly from the generalized Hooke’s law of Eq. (17). Whereas within the 
energy-strain approach several magnitudes of strain need to be evaluated to 
probe a particular strain type from a parabolic fit to the energies, within the 
stress—strain approach just one evaluation is in principle sufficient to obtain the 
same information. However, to ensure high accuracy three strain values have 
been applied for all systems calculated here. Both approaches have been imple- 
mented in a symmetry-general form;” the underlying concepts are discussed in 
detail by Page and Saxe (2001) for the energy-strain approach and by Le Page 
and Saxe (2002) for the stress-strain approach. As a further alternative, elastic 
constants can be derived by specialized ab-initio codes within the density func- 
tional perturbation theory (see Hamann et al. 2005). 


5) MedeA MT, Materials Design Inc., Angel Fire (NM), USA, http://www.materialsdesign.com 
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Table 11.2 Elastic constants cj (in GPa) of metals and intermetallic 
compounds from experiment and ab-initio calculations. 





Ti Al TiAl NiAl 





exp. exp. cal pe expl calc. exp! calc. exp.el calle. 


cı 162.4 1716 176.2 108.2 114.30 110.2 186 193.8 204.6 202.9 


C12 92.0 86.6 90.5 61.3 61.92 54.8 72 685 135.4 140.3 
C3 69.4 72.6 69.6 - = = 74 851 - = 
c33 180.7 190.6 188.2 - = = 176 170.8 - = 
C44 46.7 41.1 39.6 28.5 31.62 30.4 101 111.5 1168 112.6 
C66 = = 77 46.6 = = 








[al Allard 1969. 

[bl Tkehata et al. 2004. 
[4X.-Q. Chen, unpublished. 
ld Huntington 1958. 

[el Kamm and Alers 1964. 
(fl He et al. 1995. 

lel Davenport et al. 1999. 


By macroscopic averaging, elastic moduli of polycrystalline materials can also 
be derived from the computed single-crystal elastic constants. There are several 
averaging procedures available to derive the elastic moduli of a quasi-isotropic 
polycrystalline material from its single-crystal elastic constants. The averaging 
runs over all possible orientations of the crystal, and there is a well-defined lower 
and upper limit for the elastic moduli. Based on the averaging procedures, the ab- 
inito treatment for single crystals can be extended to polycrystalline samples. 

In order to demonstrate the capability of calculating elastic properties from first 
principles without relying on any empirical input data, experimental and com- 
puted single-crystal elastic constants for simple metals and intermetallic com- 
pounds are listed in Table 11.2. The selected materials are of strategic importance 
for high-temperature, lightweight applications. In the case of TiAl it is noted that 
the experimental sample has a stoichiometry of Tiy4Als, whereas the calculation 
was performed for exact stoichiometry. Typically, the computed data are well with- 
in the experimental error bars. As further examples, computed and experimental 
elastic data of the rather complex minerals quartz and schorl are compared in Ta- 
ble 11.3. Quartz is used technologically in clocks because of its piezoelectric prop- 
erties, and in addition plays a key role in semiconductor devices as gate oxide. Its 
elastic properties are therefore well known. Schorl has a complex tourmaline 
structure of stoichiometry NaFe3B3Al¢Sig027(OH)3F with 53 atoms per unit cell. 
Summarizing, computed elastic constants are very reliable and are typically with- 
in the errors of experimental determinations. 
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Table 11.3 Elastic constants cj (in GPa) of quartz and schorl from 
experiment and ab-initio calculations. 











Quartz Schorl 

exp. calc.[bl exp.ld calc] 
C11 86.5 81.5 306 322 
c12 6.7 2.0 109 109 
C13 11:7 9.6 53 56 
C14 —18.0 —19.7 8 —16 
033 105.7 97.9 174 182 
C44 58.2 63.3 65 62 





fl Every and McCurdy 1992. 
[bl Le Page et al. 2002. 
[Helme and King 1978. 
IP, Saxe, unpublished. 


11.2.4 
Vibrational Properties 


A detailed knowledge of lattice vibrations is critical for the understanding and 
quantitative prediction of a wide variety of physical properties of solids. The inter- 
pretation of optical, infrared, Raman, and synchrotron spectroscopic methods as 
well as neutron scattering experiments rely on an accurate theoretical approach to 
lattice dynamics. The fundamental thermodynamic functions of internal and free 
energy, entropy, heat capacity as well as nonlinear properties such as thermal 
expansion and heat conduction are to a considerable extent determined by the vi- 
brations of the constituent atoms in the lattice. Further phenomena such as elec- 
trical resistivity of metals, superconductivity, and the temperature dependence of 
optical spectra are governed by electron—phonon coupling. Fortunately, the quan- 
tum theory of lattice dynamics is well developed and has proven to be one of the 
most successful theories of solid-state physics and therewith provides an impres- 
sive reassurance of the validity and applicability of the concepts of quantum 
theory in general. 

From a historical point of view, the fundamental theory of lattice dynamics was 
developed in the 1930s and was reviewed in the book by Born and Huang (1954). 
In those early days, the main interest focused on general properties of the dynam- 
ical matrix, e.g., its symmetry and analytical properties, but less emphasis was 
put on its underlying physical origin based on the interactions of the constituent 
ions and electrons. The first approaches to theoretically determine phonon disper- 
sion relationships were based on lattice-dynamical models with adjustable param- 
eters to be fitted to experimentally measured frequencies, e.g., from inelastic neu- 
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tron scattering experiments, and were mainly used as a means to interpolate be- 
tween experimental data points. These model potentials either did not account for 
electronic polarization at all (force-constant model, Born—von Karman model, 
Born-Mayer and Lennard-Jones potentials (Born and Huang 1954), or polariza- 
tion effects were approximated by electric dipoles generated by relative displace- 
ments of the shells of valence electrons relative to the ion cores (shell models — 
Dick and Overhauser 1958; Cochran 1959; Schréder 1966). For highly anisotropic 
electron distributions as found in covalently bonded crystals, models relying on 
angular forces (Keating models — Keating 1966) or bond charge models (Martin 
1969; Weber 1977) have been developed. The predictive capabilities of these 
methods turned out to be quite limited. Although experimental phonon disper- 
sions were accurately reproduced close to the data used for fitting, predictions 
along lower-symmetry directions frequently failed due to the lack of a rigorous de- 
scription of the interactions among constituent atoms. The first attempts to inves- 
tigate the connections between the electronic structure and the dynamical proper- 
ties were made as late as in the early 1970s and were based on dielectric matrices 
or the electron polarizability (Pick et al. 1970; De Cicco and Johnson 1969). 

The advent of DFT (Hohenberg and Kohn 1964; Kohn and Sham 1965) and the 
progress of numerical methods for solving quantum physical equations together 
with the emergence of more and more powerful computers made it feasible to 
describe accurately the interatomic interactions in crystals and molecules based 
on quantum mechanics. With methods as described in the previous sections it 
is nowadays possible to calculate a large variety of properties without relying on 
input data from experiments. In fact, ab-initio methods revolutionized the practi- 
cal approaches to lattice dynamics, and accurate theoretical phonon dispersion 
curves can be calculated completely independently of any experimental knowl- 
edge. 

Three different techniques for ab-initio evaluation of vibrational properties have 
been developed, namely (a) direct methods based on total energy changes or 
forces calculated for atoms displaced from their equilibrium position, (b) analyti- 
cal calculation of force constants based on a perturbative expansion around the 
equilibrium geometry, and (c) Fourier transform of the atomic velocity autocorre- 
lation function obtained from a molecular dynamics trajectory (Rahman 1964). 
The third option usually suffers from difficulties in reaching equilibrium within 
reasonable simulation times, although several technical tricks have been devel- 
oped to address these problems (e.g., the Nosé-Hoover thermostat (Martyna et al. 
1992) and multiple signal classification (MUSIC) algorithm (Marple 1987; Kohan- 
off et al. 1992; Kohanoff 1994)). The second option makes use of the fact that the 
lattice distortion associated with a phonon is a static perturbation acting on the 
electrons, causing a linear response of the electron density which determines 
the energy variation up to third order (2n+1 theorem: Gonze and Vigneron 
1989). The linear variation of the electron density is determined in terms of the 
linear response of the Kohn—Sham orbitals by applying first-order density func- 
tional perturbation theory (Baroni et al. 1987; Giannozzi et al. 1991). This method 
is quite accurate, efficient, and elegant, but it requires highly specialized ab-initio 
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computer codes and considerable implementation efforts. For a review of the for- 
malism and implementation of density functional perturbation theory and its ap- 
plication see Baroni et al. (2001). 

Direct methods, option (a), require the evaluation of total energy and forces for 
the equilibrium geometry as well as of several distorted geometries from which 
the force constant matrix can be assembled. Several different techniques are sub- 
sumed under the title direct methods: in the early 1980s, the first attempts at ab- 
initio lattice dynamics simulation were based on frozen phonon total energy cal- 
culations to derive accurate phonon frequencies for specific high-symmetry 
points in the Brillouin zone (e.g., Yin and Cohen 1982). Phonon dispersion 
curves along specific high-symmetry directions in reciprocal space were deter- 
mined by the method of interplanar force constants (Kunc and Martin 1982), 
where planes perpendicular to these directions are displaced within an elongated 
supercell. Since these one-dimensional force constants between high-symmetry 
planes are linear combinations of the three-dimensional interatomic force con- 
stants, the phonon dispersion can be derived, in principle, for general directions 
(Wei and Chou 1992, 1994). The most general direct approach to lattice dynamics 
is based on the ab-initio evaluation of forces on all atoms, produced by a set of 
finite displacements of a few atoms within an otherwise perfect crystal. The per- 
fect crystal environment has to be sufficiently large to ensure that interactions of 
the perturbation with all its translational symmetry equivalent copies are small, 
which usually requires construction of suitable supercells. The techniques for se- 
lecting suitable supercells and atomic displacements, assembling force constant 
matrices from the calculated forces, and calculating phonon dispersion relations 
via Fourier transform are described by Kresse et al. (1995), Frank et al. (1995), and 
Parlinski et al. (1997). Phonon dispersion curves determined from direct ap- 
proaches are very accurate, provided the studied crystal area (supercell) is larger 
than the interaction range of all constituent atoms. The main advantages of direct 
methods consist in the fact that no specialized ab-initio codes are required: any 
code capable of deriving reasonably accurate forces can be applied. The necessity 
to consider supercells in the case of a simple crystal structure with small unit 
cell extensions is one of the main drawbacks with regard to the linear response 
methods. 

All applications presented later in this chapter were obtained by the general di- 
rect method as developed by Kresse et al. (1995), Frank et al. (1995), and Parlinski 
et al. (1997). Therefore, the method is briefly outlined in more detail. The ground- 
state energy E of a crystal as a function of atom positions R(n, m) (where n and m 
denote the unit cell and x and v the atom index) can be written as a Taylor expan- 
sion in terms of atomic displacements u around the equilibrium positions [Eq. 


(20)]. 
E(..., R(n, u),...,R(m,v),...) 


= Fo +2 5 ®(n, u; m, v)u(n, pju(m, v) + O(u?) (20) 


n, y, m, v 
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Eo is the equilibrium energy. Terms linear in u are absent due to the condition 
that at equilibrium all forces on the atoms vanish. The elements of the atomic 
force constant matrix ® are defined by Eq. (21), with the gradients taken at the 
minimum energy configuration, for which all first-order derivatives vanish. 


aE 
OR; (n, u)öR;(m, v) 





©; j(n, u; m, v) = (21) 


0 


The frequently used harmonic approximation consists in retaining in Eq. (20) 
only terms up to quadratic order in the displacements. 

Within the harmonic approximation the classical equations of motion for each 
atom are given by Eq. (22), where M, is the atom mass of atom u. 


M,u(n, u) = > ®(n, u; m, v)u(m, v) (22) 


n, y, m, v 


Solutions are required to exhibit Bloch waveform because of translational invari- 
ance. The k vectors are chosen to fulfill Born-von Karman periodic boundary con- 
ditions [Eq. (23)]. 


1 
u(n, u) = woe m 


H 


The equation of motion can thus be cast into the simple form of Eq. (24), where 
for each mode j the phonon frequencies œ? (k, j) are the eigenvalues and the po- 


larization vectors e(k, j) are the eigenvectors of the dynamical matrix D(k), which 
has been introduced as the discrete Fourier transform in Eq. (25). 


D(k)e(k, j) = w?(k, j)e(k, j) (24) 


D(k; u») SE (0, 1, m’) RO -Ri (25) 


1 
= V M.M, m 


The summation m runs over all atoms of the crystal. The forces F(n,) on all 
atoms generated by the displacement u(m,v) of atom v can be written as Eq. 
(26), which relates the generated forces to the force constant matrices and atomic 
displacements. This is the central relationship used in the direct method. 


Fi(n, u) = > 9 (num, v)uj(m, v) (26) 


m,v,J 


The complete quantum mechanical description for a system of ions and elec- 
trons requires additional approximations. Since the nuclear mass is much larger 
than that of an electron, it is reasonable to consider the nuclei in their equilib- 
rium position while dealing with the electronic motion. In mathematical terms 
this so-called adiabatic (or Born-Oppenheimer) approximation yields a separation 
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of the general Schrödinger equation into one for the motion of nuclei and one for 
the electrons. The nuclear motion is determined by the potential field generated 
by the average motion of the electrons and the corresponding nuclear Schrö- 
dinger equation yields formally the same results as the classical theory. The above 
equations remain valid, however, with the exact effective potential. The purely 
electronic Schrodinger equation can be solved within density functional theory, 
as discussed in previous sections. 

In general, the ab-initio calculations for the direct approach are performed for a 
supercell constructed as a multiplication of the original cell large enough to en- 
sure that forces arising from a displaced atom diminish inside the supercell. In 
a first step all atom positions of the unperturbed supercell are optimized. In prin- 
ciple, any computational method providing atomic forces is suitable for the direct 
approach, but usually first-principles methods are employed. After the initial total 
energy minimization process all forces should vanish, and the resulting structure 
is the reference and starting point for all further steps. In order to obtain com- 
plete information on all force constants it is necessary to displace each nonequi- 
valent atom of the supercell along three nonequivalent directions, and to calculate 
for each of these perturbed supercells the forces on all other atoms generated by 
this displacement. The number of displacements may be considerably reduced by 
making use of symmetry relationships. For instance, for a cubic site symmetry, a 
single displacement along a single fourfold axis is adequate. If the site point 
group is tetragonal, one displacement along the fourfold symmetry axis and one 
perpendicular to it are sufficient. From this set of displacements and all the re- 
sulting forces, the force constant matrix can be evaluated from a multidimen- 
sional fit to Eq. (26). From the force constants the supercell dynamical matrix is 
obtained by discrete Fourier transform. Finally, the frequencies »?(k, j) and po- 
larization functions e(k, j) of phonon modes j are obtained by diagonalization of 
the supercell dynamical matrix for each wave vector k. 

Vibrational properties are conveniently expressed by phonon dispersion curves 
along a selected high-symmetry path through the Brillouin zone and as the pho- 
non density of states (DOS), i.e., the frequency distribution of all normal modes. 
The contribution of lattice vibrations to thermodynamic functions such as inter- 
nal energy, free energy, entropy, heat capacity, and thermal displacements are de- 
rived from the integrated phonon density of states g(w) as a function of frequency 
œ. From the phonon DOS the phonon free energy F,, as a function of tempera- 
ture can be derived from Eqs. (27), where the phonon internal energy is given by 
Eq. (28), and phonon entropy is evaluated as in Eq. (29). 


Fyh(T) = Upn(T) = TSpn(T) (27) 


1[® h 
Un(T) = al. g(o)ho coth (ir) do (28) 


Sty j: HB) (er feon (a) | mhi exp( =l) do. 
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Ab-initio calculations have been largely viewed as being restricted to 0 K, or 
more precisely, as being condemned to operate without the concept of tempera- 
ture at all. Based on Eqs. (27-29), this major constraint has been lifted, and 
routine methods to evaluate vibrational properties and temperature-dependent 
thermodynamic functions from first principles have become available.” A few ap- 
plications of this will be presented in the following sections. Another contribution 
from thermodynamics has not yet been taken into account: configuration space 
and configurational entropy. This important part, in particular for alloy physics, 
will also be addressed in the later section on multiscale modeling (Section 11.3.5). 


11.3 
Applications 


11.3.1 
Structural and Phase Stability 


The aim is to find the thermodynamical ground state of an ordered solid. The 
phase with the lowest energy is searched for. Restricting the considerations to 
T = 0 K (or sufficiently low temperatures) all the information needed is included 
in the total energy Uprr of the DFT calculation. This quantity depends on the 
actual atomic distribution, i.e., on the crystal structure and the volume. There- 
fore, energies of different structures can be compared and the equilibrium vol- 
ume found. An important elastic ground-state property is the bulk modulus B, 
which is derived from the second derivative of the volume-dependent DFT en- 
ergy. It can also be expressed by a linear combination of elastic constants; see Sec- 
tion 11.2.3. It should be noted that the bulk modulus may be also denoted by the 
symbol K, as for example in Chapter 4. Phase stabilities are usually compared in 
energies of formation AH(S, Vo) in which the energy of the pure phases serve as 
a reference. For example, the formation energy (or enthalpy of formation at zero 
pressure) of the compound A,B,,(S, Vo) with crystal structure C at its equilib- 
rium volume is defined by Eq. (30). 


AH(C, Vo) = Uprr(AnBm(C, Vo)) — [nU prr(A) + mUprr(B)] (30) 


The reference states for the constituent atoms A and B are in their respective 
ground states, corresponding to the experimental conditions. For example, for 
the fabrication of crystals of intermetallic compounds the pure metals have to be 
melted and put together for reaction. 

As an example of the structural and phase stability the Laves phase compounds 
XCr (X = Ti, Zr, Hf) (Chen et al. 2005a) are discussed. Three different crystal 
structures are considered for the compound. The reference energies are calcu- 
lated for Zr metal in its hexagonal ground state, and for Cr in its bcc magnetic 
ground state, although Cr in the compound is nonmagnetic. Figure 11.2 shows 


6) Materials Design Inc., MedeA Phonon, Angel Fire (NM), USA, http://www.materialsdesign.com 
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Fig. 11.2 DFT equilibrium volumes Vo and bulk moduli B for the C14, 
C15, and C36 structures of TiCr2, ZrCr2, and HfCr2. 


the trend of volumes and bulk moduli B revealing the largest B values for the C15 
structures, which are the stable ground-state phases according to the calculated 
energies, and also according to experiment. One realizes the exceptional nature 
of the ZrCrz compound for which the volumes for all three structures are the 
largest but the bulk moduli the smallest when compared to TiCr, and HfCrp. 
Only two experimental results for B exist, namely for TiCr, in the C14 structure 
(159 GPa; DFT value is 199 GPa), and for ZrCr, in the C15 structure (162 GPa; 
DFT value is 182 GPa). The difference between experiment and calculation is due 
to the smaller calculated volumes (see Table 11.4). Concerning the volumes, the 
differences are attributed to the temperature dependence of the lattice parame- 
ters. This holds in particular for the C14 phase of TiCr,, which only exists at 
high temperatures. 

The calculated equilibrium enthalpies of formation with respect to the equilib- 
rium bulk phases of the pure constituents are defined as the difference between 
the corresponding total energies Uprr calculation according to Eq. (30), and are 
listed in Table 11.5 and illustrated in Fig. 11.3. The C15 structure is always the 
most stable one in accordance with the experimental findings up to rather high 
temperatures. The energy differences between the different structures are rather 
uniform with C36 being less stable than C15 by about 5 kJ mol~1, and C14 being 
even less stable than C36 by about 2 to 3 kJ mol“. 

Currently, three experimental values for the enthalpy of formation of TiCrz are 
published, showing considerable discrepancies. The experimental values seem to 
be rather questionable. For HfCr, an estimate of —5.4 kJ mol”! at 298 K is 
known. However, for both ZrCr, and HfCr, no measured values are available. 
As recently demonstrated for TiMn; (Chen et al. 2003), for which calorimetric ex- 
perimental data were measured also, a high-quality DFT approach yields reliable 
results for the enthalpy of formation, and can be used as a predictive tool. Fur- 
thermore, the DFT result of —141.2 kJ mol“! for ScAl, (Chen et al. 2004a) is in 
very good agreement with a very recent experimental value of —141.6 kJ mol“! 
obtained by reaction calorimetry (Cacciamini et al. 1999). Therefore, the DFT en- 
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Table 11.4 DFT and experimental structural parameters for TiCr2, ZrCra, 
and HfCr, with C14, C15, and C36 crystal structures. Lattice parameters 
a and c are in A, equilibrium volumes Vo per formula unit in A?, and 
bulk moduli B in GPa. 





Structure a c Vo B Method 
TiCr2 C15 6.857 40.3 208 DFT 
6.910 41.2 expt. 
6.993 42.7 expt. 
C14 4.885 7.830 40.5 199 DFT 
4.932 8.005 42.2 expt. 
4.900 7.927 41.2 expt. 
159 expt. 
C36 4.869 15.748 40.4 199 DFT 
4.932 16.001 42.1 expt. 
ZrCrz C15 7.145 45.6 182 DFT 
7.204 46.7 expt. 
7.207 46.8 expt. 
162 expt. 
C14 5.094 8.103 45.5 176 DFT 
5.106 8.292 46.8 expt. 
C36 5.065 16.409 45.6 180 DFT 
5.100 16.611 46.7 expt. 
HfCr, C15 7.079 44.4 199 DFT 
7.140 45.4 expt. 
C14 5.046 8.095 44.6 191 DFT 
5.067 8.237 45.9 expt. 
C36 5.022 16.301 44.5 190 DFT 
5.064 16.470 45.7 expt. 





Table 11.5 DFT enthalpies of formation AH in k} mol”! as compared to 
the available experimental data and values obtained by the application 
of Miedema’s model for the C14, C15, and C36 structures of TiCr,, 
ZrCr,, and HfCro. 





C14 c15 C36 Miedema Expt. 
TiCr —30.6 —35.9 —33.0 —30.3 —27.9; —9.92; 
ZrCr —9.0 —14.5 —12.1 51.6 = 
—14.6 


HfCr, — 29.6 — 34.9 —32.5 —29.2 = 


0.3 
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Fig. 11.3 DFT enthalpies of formation for TiCr,, ZrCrz and HfCr, and 
values derived by Miedema’s model. 


thalpies of formation for the XCrz compounds in Table 11.5 can be considered to 
be reliable. In addition, the enthalpy of formation for ZrCrz of —14.5 kJ mol“! 
agrees well with the value of —14.6 kJ mol"! obtained by another DFT application 
(Krcmar and Fu 2003). 

Figure 11.3 shows the trend of DFT enthalpies of formation, with ZrCr, being 
less stable by a factor of about two compared to TiCr, and HfCr,. Also shown is 
the result obtained from Miedema’s semiempirical model, which is totally un- 
reliable. Miedema’s model fails very badly for ZrCr,, reversing the trend of the 
DFT results. Similar failures were also found for ZrCo2, ZrPd, and ZrPd3. 

Utilizing a simple model, the variation of the average electron density as ex- 
pressed by the number of valence electrons per atom is correlated to the cohesive 
properties of compounds which have a comparable type of chemical bonding. For 
the XCr, compounds the number of valence electrons are ny = 4 and nc, = 6 
which — by application of the relation c = (nx + 2ncr)/3Vo = 16/3Vo — for the 
C15 structures results in values of c = 0.132,0.117,0.120 e A} for TiCr,, ZrCr,, 
and HfCr,, respectively, for the equlibrium volumes Vo as listed in Table 11.4. 
Clearly, this results in the correct trend compared to the formation energies, due 
to the opposite trend of the equilibrium volumes. 

So far, nonmagnetic compounds have been discussed. For deriving enthalpies 
of formation for magnetic compounds the concept is the same (Chen et al. 
2005b). However it should be realized that the gain of bonding energy due to the 
formation and ordering of magnetic moments (i.e., due to spin polarization and 
orientations of moments) could be substantial. Even the structural stability could 
be changed, as is the case for ZrMn; and ZrFe, (Chen et al. 2005b). Therefore, 
contributions to the total energy due to magnetism could strongly affect thermo- 
dynamic properties and -consequently- the finding of the ground state. 

Vibrational properties are of particular importance for temperature-dependent 
phase and structural stabilities. In the following, some corresponding examples 
for the Laves phase compounds ZrCr, and ZrMn; (Chen et al. 2005a) are pre- 
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Fig. 11.4 Ab-initio phonon dispersion relations and density of states for 
ZrCr, in the C15 (a) and C14 (b) structure. 


sented. All these results are derived by applying the concept presented in Section 
11.2.4. Figure 11.4 shows the phonon dispersion relationships and the density of 
states (DOS) for ZrCr, calculated at the equilibrium volumes of the C14 and C15 
structures. There are 18 (for C15) and 36 (for C14) phonon branches due to the 
number of atoms in the unit cell (6 and 12 for C15 and C14, respectively). Be- 
cause of the higher symmetry, the features of the DOS for the C15 structure are 
much sharper. A further significant difference between the dispersions of the two 
structures is seen in the frequency range between 2 and 3.5 THz. There, no 
modes exist for the C15 structure, in contrast to C14 for which highly dispersive 
bands are seen in particular around the point A, and close to A a soft mode arises. 

From the phonon DOS the phonon free energy F,, can be derived from Eqs. 
(31) combining the phonon free energy U,,(T) and phonon entropy Sy, (T). 


F(T) = U 


ph(T) — TSp,(T) (31) 


11.3 Applications 








6.2 


5.8 





5.6 

















‘ZrMn, | ZrCr 
1 | 1 | 1 1 1 L 1 
0 200 400 600 0 200 400 600 
TK) T(K) 


Fig. 11.5 Differences in total free energies AF = Fcı4 — Fcis. Results 
with (solid line) and without (broken line) temperature-dependent 
electronic contributions (Chen et al. 2004b). Left panel: ZrMnz; right 
panel: ZrCr2. Positive values: C15 is the stable structure; negative 
values: C14 is the stable structure. 


The difference in total free energies AF = Fcı4 — Fcıs corresponding to the 
C14 and C15 structures of ZrCr and ZrMn) are shown in Fig. 11.5. The energy 
scales in the two panels are different because of the very different structural ener- 
gies: for ZrCrz the energy Uprr for the C15 structure is much lower than for C14 
whereas for ZrMn; the corresponding energies are close to being degenerate. Fur- 
thermore, the temperature dependence of the structural free energy difference 
(Chen et al. 2004b) for ZrMn; is totally opposite to ZrCr,. Although for ZrMn, 
at very low temperatures C15 is stable, the C14 phase is stabilized above a transi- 
tion temperature of 160 K due to the phonon free energy difference. The struc- 
tural stability of ZrCrz is quite different: at very low temperature C15 is already 
much more stable than is the case for the Mn compound. Figure 11.6 shows the 
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Fig. 11.6 Lattice specific heat for C14 and C15 structures of ZrCrz. Full 
phonon calculation is denoted by ph; Debye results are marked by D. 
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specific heat for the C14 and C15 structures of ZrCr,. Whereas for the full pho- 
non calculation the curves nearly coincide, for the results based on the Debye ap- 
proximation a significant splitting between C14 and C15 is observed with all the 
Debye results being larger than the full phonon results. The largest differences 
occur at about 200 K. 


11.3.2 
Point Defects 


Here, point defects will be discussed for binary compounds in terms of super- 
cells. The basic concepts can be easily extended to more complicated cases. The 
corresponding defect energies are derived from total energies of suitable super- 
cells. The supercells containing the defect should be sufficiently large in order to 
avoid defect-defect interactions. Therefore, such supercells could be large and the 
calculations costly, depending on the type of material and defects under study. 
The advantage of the supercell concept is that all the advantages of a standard 
DFT calculation are available (such as accuracy, freedom of choice of crystal sym- 
metry, geometry relaxations, spin polarization). True point defects might be mod- 
eled ideally by one defect atom within a matrix of an infinite number of host 
atoms. In that case, other techniques based on Green functions have to be applied 
which, however, are less generally applicable than a supercell approach. The 
Green function approach could be also costly, depending on the actual system 
and the accuracy required. 

Returning to the treatment of point defects by supercells, within a sufficiently 
large unit cell for the perfectly ordered A,,B, defects are introduced by replacing 
one of the atom types A or B by a defect species X, resulting in the compositions 
Am—1By,X or AmBn-ıX. Species X might be a third atomic species, a vacancy, or 
an A, B atom sitting at a “wrong” lattice site (a so-called antisite defect). Another 
important type of defect is an X atom at an interstitial position. Considering as an 
example the case of Am-1BnX, a defect formation energy can be formulated in 
terms of the differences in DFT total energies by Eq. (32), including suitable ref- 
erence states for the pure phases. This type of formation energy is a formation 
energy for a ternary compound. 


AE,(X) = Eprr(Am-ıBnX) — [mEprr(A) + nEper(B) + Eprr(X)] (32) 


Quite often, one wants to know the energy for the formation of a defect X in the 
perfect compound of composition Am Bn. Therefore, one might express another 
type of formation energy by Eq. (33). 


AE,(X) = Eprr (Am-ı B,X) = [Eprr(AmBn) — Eprr(A) + Eprr(X)] (33) 
Immediately the question arises about the physical state of the pure phases of A 


and X: should Eprr(A) and Eprr(X) be calculated for the solid phase, or the gas 
phase, depending on the history of how A is removed from the compound and is 
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replaced by X? Clearly, such a concept is not very useful. The way out consists in 
constructing a thermodynamical model with suitable particle reservoirs described 
by chemical potentials for the species A and B, as derived from suitable supercell 
calculations. The formulation of a concept based on statistical mechanics for a 
compound is more complicated than for a one-atom type of system by the condi- 
tion that the overall stoichiometry has to be preserved, which means that various 
kinds of defects have to coexist. Therefore, one has to work with a grandcanonical 
ensemble for which the number of particles can be adjusted by chemical poten- 
tials. 

For low concentrations of substitutional defects, statistical models can be for- 
mulated (Meyer and Fahnle 1999; Rasamny et al. 2001) which make use of the 
supercell energies <4 for a defect species X at lattice site 4. For example, e} de- 
notes the supercell energy of an antisite defect for which an A atom is sitting on 
a site of the B sublattice. The supercell energy for a vacancy, e.g., at the A sub- 
lattice is denoted by «4 


vac’ 
trations of defects the energies eg are independent of the concentration. The de- 


For noninteracting defects and sufficiently low concen- 


fect systems should also be at their ground states. However, this assumption can 
be lifted and more general expressions can be formulated including pressure- 
volume terms (Meyer and Fahnle 1999; Rasamny et al. 2001). 

By minimization of the Gibbs free energy with respect to the number of point 
defects, expressions for the concentrations of point defects are derived. For exam- 
ple, for a vacancy at an A site one derives Eq. (34), in which M represents the total 
number of available sites, and M4 the available sites of the A sublattice. 


a M4 e Plenetna) 


Coac 





M 1 + CH) + era +Ha—He) (34) 


In a similar way, the concentration for an antisite defect is expressed by Eq. (35). 


MA leg +Ha—KB) 
cf = a (35) 
M 1 + e As +a) + AG, Hau) 





The concentrations for the A-rich defects (B vacancy and A antisite) are formu- 
lated correspondingly. The so-far unknown chemical potentials obey the condi- 
tion expressed by Eq. (36). 


G = ua Na + upNp (36) 


The general composition of the compound including the defects in a suitable way 
is given by Na/Ng = x/(1 — x) whereby N; is the total number of atomic species, 
i = {A, B}, and x varies in a small range around the stoichiometric composition. 
Considering the conditions discussed and including the supercell energy eu for 
the perfect compound, the chemical potentials can be calculated from a set of 
coupled equations. As visualized by Eqs. (34) and (35), the defects are intermixed 
in order to maintain the overall composition, which is kept fixed. Of course, only 
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small deviations of the overall composition from the perfect stoichiometry are 
reasonable, which are typically in the order of 1-2%. In general, the chemical po- 
tentials depend on pressure, temperature, and concentration x. For example, the 
energy cost for the formation of an A vacancy for a given concentration x 
amounts to AHA, = cå, + xua, in which the chemical potential u,(x) represents 
the energy for adding an A atom to the system. 

The thermodynamic model presented contains some simplifications because at 
high temperatures more complicated defect combinations may contribute signifi- 
cantly to the configurational entropy. Furthermore, vibrational effects are not 
included. 

As an example, calculated results for point defects in the compound ZrMn; for 
two crystal structures are presented, as published very recently (Chen et al. 
2005b). Vacancies V(Zr) at the Zr and V(Mn) at the Mn sites, and antisite defects 
Zr™® for Zr occupying a Mn site and Mn” for Mn occupying a Zr site were con- 
sidered. Because of symmetry this results in four possible point defects for the 
high-symmetry cubic C15 structure, and in six different point defects for the hex- 
agonal C14 structure due to crystallographically inequivalent Mn sites. Further 
complications arise because the Mn atoms carry magnetic moments, which de- 
pend on their local environement. For the DFT calculations, however, this is easy 
to handle: the total energy is minimized with respect to spin polarization. 

The enthalpies of formation for Zr,Mn,_, compounds derived from the total 
energies of the supercells are summarized in Fig. 11.7. For both structures the 
formation of antisites is clearly preferred when compared to vacancies, and the 
Mn?" antisite defects are the most favorable ones. For the C14 compound 
the supercell formation energy for a Mn” defect is so low that it even falls below 
the broken line in Fig. 11.7 marking the boundary of thermodynamic stability 
with respect to the perfect compound ZrMn) and pure Mn. This means that 
the ordered compound with the supercell structure for the Mn” defect is ener- 
getically more stable than a mixture of pure Mn and ZrMn, phases, which seems 
not realistic. This result indicates the possible existence of other ordered Zr-Mn 
compounds being richer in Mn, and it explains a broadening of the homogeneity 
range of the ZrMn, phase toward Mn-rich compositions. Up to now, the only ex- 
perimentally prepared solid compound of the Zr-Mn system is the Laves phase 
ZıMn,. Therefore, thermodynamical experiments on the Mn-rich part of the 
phase diagram are encouraged. 

Table 11.6 lists the defect formation energies at 1500 K for the stoichiometric 
composition. Antisite defects are clearly preferred, as is also demonstrated by 
Table 11.7 showing the temperature dependence of defect concentrations. 

According to Fig. 11.8, with increasing temperature the minimum of the free 
energy of C14 ZrMn; no longer occurs at the stoichiometric composition but is 
shifted toward Mn richer compositions. This implies that the congruent melting 
point also may not occur at the stoichiometric composition but at the Mn-rich 
side. In addition, it is expected that Mn antisites may produce a broad nonstoi- 
chiometric range toward the Mn-rich side, in agreement with the experimental 
phase diagram. It is well known that ZrMn, samples produced by the standard 
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Fig. 11.7 DFT enthalpies of formation for Zr,Mnı_, versus the 
concentration x of Zr atoms for the C15 structure (upper panel) and the 
C14 structure (lower panel). V(Zr): vacancy at Zr site, V(Mn): vacancy 
at Mn (2a) and (6h) sites, Zr": Zr at Mn (2a) and (6h) sites, Mn": 
Mn at Zr site. 


melting techniques are off-stoichiometric with a slightly enriched Mn concentra- 
tion. 

Finally, the change in magnetic moments upon defect formation was investi- 
gated. A very striking effect is observed for the Mn antisite Mn” defect, which 


Table 11.6 Defect formation energies for the C14 and C15 structure of 
ferromagnetic ZrMnz at 1500 K. 





V(Mn) V(Zr) Mn?’ Zen 


C15 2.64 3.74 1.19 1,19 
C14 2.53 2.96 0.89 0.89 
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Table 11.7 Temperature-dependent defect concentrations for 
ferromagnetic and stoichiometric ZrMnp. 


























T (K) c15 c14 
V(Mn) V(Zr) anti V(Mn) V(Zr) anti 
500 2.6E-27 1.8E-38 1.0E-12 2.7E-28 1.4E-30 1.1E-09 
1000 5.1E-14 1.4E-19 1.0E-06 1.6E-13 1.2E-15 3.3E-05 
1500 3.7E-09 2.6E-13 1.0E-04 3.0E-09 1.1E-10 1.0E-03 
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Fig. 11.8 Temperature-dependent defect free energy for Zr,Mnı_, for 
the ferromagnetic C14 (solid lines) and C15 (broken lines) phases. 


is also the dominant defect type at Mn-richer compositions: the local moment of 
the Mn atom increases from about 1 up for the defect-free compound to 3.2 jp for 
the Mn” defect, for both the C14 and C15 structures. This massive increase 
in the magnetic moments is due to the change in local coordination: compared 
to the defect-free compound the Mn defect atom at a Zr site has significantly in- 
creased nearest-neighbor distances to the surrounding Mn atoms, which causes 
stronger localization of the Mn states, and therefore the local magnetic moment 
is increased. Experiments should be able to detect the predicted large mag- 
netic moments of Mn” which - in reality — might appear as disordered local 
moments. 


11.3.3 
Diffusion Processes 


Solid-state diffusion processes are behind the scene of a large variety of materials 
properties and processes such as segregation, phase transformation, environmen- 
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tal embrittlement, and corrosion (see Chapter 5). Despite this fundamental im- 
portance, reliable experimental mass diffusion data are rather scarce in some 
areas. For instance, the diffusion coefficients for hydrogen in metals such as tita- 
nium (Papazoglou and Heppworth 1968; Brauer et al. 1981) and aluminum 
(Young and Scully 1998) are scattered over many orders of magnitude and diffu- 
sion data for many important systems are not reported at all. Ab-initio calcula- 
tions provide an alternative, reliable route to derive temperature-dependent diffu- 
sion coefficients quantitatively and, in addition, are capable of providing insight 
into the diffusion processes at an atomistic level. In this section the strategies 
and procedures to derive diffusion coefficients quantitatively from first principles 
are reviewed and, as an example of an application, results for diffusion processes 
in the lithium-aluminum alloy system are presented (Müller et al. 2007). 

As a first step it is necessary to figure out possible diffusion mechanisms that 
might be responsible for observed mass diffusion processes. This requires the 
identification of stable configurations in the first place and possible rearrange- 
ments of atoms to reach adjacent stable configurations. In the case of impurity 
diffusion, for instance, the site preference of the impurity (interstitial or different 
substitutional sites) need to be established by total energy calculations and suit- 
able thermodynamic models (along the lines discussed in Section 11.3.2). Once 
stable sites are identified, possible diffusion paths between these sites are figured 
out and energy profiles along these diffusion paths are calculated. Technically 
speaking, the so-called minimum energy path is required. The energy maximum 
along this path, i.e., the saddle point energy, defines the transition state that is of 
central importance for our further considerations in the framework of harmonic 
transition state theory (see Chapter 5). Of course, it might be necessary to investi- 
gate several possible diffusion paths and each diffusion path may involve multi- 
ple metastable sites and transition states. Those paths with lower-lying saddle 
point energies will dominate the diffusion process. 

A number of ab-initio based methods have been developed to follow reaction 
paths (in chemistry) or diffusion paths in materials physics, and to identify the 
saddle points (see Henkelman et al. (2000) for a review). Some methods focus 
on the initial state only and explore reaction or diffusion possibilities either by fol- 
lowing stepwise the path of slowest ascent, or by evaluating normal modes of a 
harmonic approximation to the local potential energy surface and following each 
of these modes until a saddle point is found. However, slowest ascent paths may 
not lead to saddle points and the latter approach requires evaluation and diago- 
nalization of second derivatives that either may not be available or may quickly 
become computationally impracticable for larger systems. Other approaches take 
into account both the initial and the final configurations and try to find the min- 
imum energy path between them. The most straightforward approach is to drag 
the migrating atom step by step along the diffusion path by linear interpolation 
between initial and final states. For each step this degree of freedom is kept fixed 
and all other degrees of freedom are optimized (molecular statics). This approach 
may work in simple cases but tends to fail for more complex energy hypersur- 
faces. More generally applicable algorithms for finding minimum energy paths 
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between two points are the so-called chain-of-states methods. There, a chain of 
images is generated between the initial and final states. All the images are con- 
nected to trace out the diffusion path and are simultaneously optimized in a con- 
certed manner. Whereas the initial images are located on a line connecting initial 
and final configurations, after simultaneous optimization of all images they lie 
close to the minimum energy path. Several variants of these methods have been 
developed; Henkelman et al. (2000) may serve as a review in this field. In a recent 
study of diffusion in the lithium—aluminum alloy system (Müller et al. 2007) the 
nudged elastic band method (NEB) (Jonsson et al. 1998) was applied, a variant of 
the chain-of-state methods including springs between the images to ensure that 
the images keep their distance. In addition, a minimization of the elastic band 
between images is carried out by projecting the perpendicular component of the 
spring force and the parallel component of the true force along the minimum en- 
ergy path. This procedure ensures that the component perpendicular to the 
spring force does not drag images off the minimum energy path, and that the 
component parallel to the true force in the direction of the path does not slide 
images down the minimum energy path. The NEB method has been applied suc- 
cessfully for a wide range of applications such as diffusion at surfaces, sputter de- 
position, dissociative adsorption of molecules on surfaces, premelting of metal 
clusters, cross-slip of screw dislocations, and atomic exchange processes at semi- 
conductor surfaces (see Jonsson et al. (1998) and references therein). 

Here we show results of the NEB method for the identification of diffusion 
paths in the Li-Al alloy system (Müller et al. 2007). The calculations were per- 
formed for 2 x 2 x 2 supercells containing about 32 atoms to determine vacancy 
and antisite formation energies, site preferences, and migration profiles of Li and 
Al atoms in LiAl; and aluminum. As an example, Fig. 11.9 shows the NEB re- 
sults for the minimum energy path of an Al atom migrating into an adjacent Li 
vacancy within a supercell of stoichiometry LiyAly4. The NEB identified the mini- 
mum energy path, the saddle point energy, and the transition state geometry. 
Thus, the energy barrier to be overcome by the Al atom amounts to about 65 
kJ mol”! (0.67 eV) and the difference between initial and final states is about 
32.5 kJ mol“! (0.34 eV). These are the most important ingredients for the quanti- 
tative prediction of diffusion coefficients. Furthermore, the optimized structures 
of initial and final states and the transition state are required for the evaluation 
of temperature-dependent thermodynamic functions, which is the second prereq- 
uisite for applying harmonic transition state theory. 

The quantitative description of diffusion is based on Eyring’s concept of the ac- 
tivated complex (Eyring 1935) as applied by Wert and Zener (1949) to impurity 
diffusion in solids. Eyring’s theory of the activated complex assumes that the re- 
action proceeds over a well-defined transition state, the so-called activated com- 
plex, which is in thermodynamic equilibrium with its surroundings. Further- 
more, one translational degree of freedom along the diffusion path replaces the 
particular vibrational motion of the initial state leading to the transition state. 
The reaction rate is then given by (a) the ratio between the partition functions of 
the system in the transition state and the initial state including the Boltzmann 
factor exp[—AE/(kgT)], which involves the activation energy AE, and (b) the 
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Fig. 11.9 Energy profile for the minimum calculation is shown: blue spheres represent 
energy path for migration of an Al atom into Li and yellow spheres Al atoms; the small 
an adjacent Li vacancy in LiAl; as obtained sphere in the center symbolizes the Li 
by the nudged elastic band method. The vacancy. The movement of the Al atom into 
diamonds represent the total energies of the the Li vacancy is shown by a red arrow 


optimized images. The supercell used for the (Müller et al. 2007). 


mean velocity of the reactants crossing the transition state. The first term is di- 
rectly obtained from ab-initio total energy and phonon calculations. The second 
term is taken from the temperature-dependent velocity distribution of classical 
particles at a temperature T. 

Applying this approach to the diffusion of an atom in a solid, Eq. (37) is ob- 
tained for the temperature-dependent diffusion coefficient. 


AG(T) 


D(T) = nfd?v exp — LT 
B 





(37) 


Herein, n is the number of nearest-neighbor stable sites, / is the probability that 
a jump to a nearest-neighbor site leads forward in the direction of diffusion, and 
d is the length of the jump projected onto the direction of diffusion. The fre- 
quency v corresponds to the mode of the stable state which leads toward the tran- 
sition state and AG is the effective free energy difference between transition state 
(TS) and initial state (denoted by 0) including electronic energies Eą obtained di- 
rectly from ab-initio calculations and temperature-dependent vibrational energies 
Eyp(T) and entropies S,i,(T) from phonon calculations [Eq. (38)]. 


AG(T) = Eq’ — Eg + Eg (T) — Ega (T) — TIS p(T) — Siy(7)] (38) 
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Fig. 11.10 Computed phonon dispersions for supercells AlzoLi 
modeling vacancy-assisted diffusion of Li impurities in an aluminum 
matrix. Left panel: initial state with a Li atom substituting for an Al 
atom adjacent to an Al vacancy. Right panel: transition state of the Li 
atom migrating into the adjacent vacancy site. The imaginary frequency 
corresponds to the motion of the Li atom across the barrier and is 
visualized by a negative frequency (Müller et al. 2007). 


As an example, the phonon dispersions of the initial state and transition state 
for a Li impurity migrating into an adjacent Al vacancy in aluminum within a 
supercell model of 32 atoms are shown in Fig. 11.10 (Müller et al. 2007). This 
represents the most probable model for diffusion of Li impurities in Al, i.e., a 
vacancy-assisted diffusion process. The phonons have been calculated by the di- 
rect method as described in Section 11.2.4. The Li atom in the initial state, sub- 
stituting for an Al atom adjacent to an Al vacancy, is associated with three optical 
modes of high frequency (due to its smaller mass as compared to Al). Two of the 
modes are almost dispersionless and well separated from the other modes at 
10.80 and 10.65 THz and the third one is located just above the Al modes be- 
tween 9.2 and 9.4 THz and shows some dispersion. Inspection of the associated 
vibrations in real space reveals that the highest modes at 10.80 THz and 10.65 
THz correspond to two different vibrations perpendicular to the diffusion path. 
The lowest-lying Li mode between 9.2 and 9.4 THz represents a vibration along 
the diffusion path leading into the transition state. The attempt frequency v is 
therefore associated with this particular phonon mode. It is noted that all phonon 
frequencies of the initial state are positive, indicating a true minimum on the en- 
ergy hypersurface. At the transition state, the highest Li-related frequency is 
shifted toward higher frequencies (about 14 THz). This mode is a vibration of 
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Fig. 11.11 Computed and experimental diffusion coefficient of lithium 
impurity atoms in aluminum (Miller et al. 2007). 


the Li atom perpendicular to the diffusion path. By definition, a transition state 
is characterized by the occurrence of one negative eigenvalue in the dynamical 
matrix. This corresponds to a motion of the Li atom along the diffusion path. 
This mode exhibits an imaginary frequency that is represented in Fig. 11.10 by 
a negative frequency of about —5.4 THz. By sampling phonon frequencies 
throughout the whole Brillouin zone the phonon density of states is computed, 
which makes it possible to derive the temperature-dependent difference in free 
energy between the transition state and the initial state required for Eq. (37) of 
transition state theory. 

The temperature-dependent diffusion coefficient as computed from Eq. (37) is 
shown in Fig. 11.11 and compared to available experimental data. Experimental 
diffusion data from Refs. (Bakker et al. 1990; Wen et al. 1980; Costas 1963; Ver- 
linden and Gijbels 1980) typically agree somewhat better than within an order of 
magnitude, and the computed diffusion coefficient constitutes an upper envelope 
to these measured data. That the calculation may provide an upper limit for the 
diffusivity is consistent with the fact that in the theoretical model calculations 
ideal crystals are considered, and neither trapping at impurity and defect sites 
nor grain boundaries that may decrease the diffusivity are included in the present 
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model. Whereas at higher temperatures the model is quite consistent with exper- 
imental determinations, at lower temperatures the experiments indicate an in- 
creasingly lower diffusivity as compared to the ab-initio results. The thermal en- 
ergy at lower temperatures might not be sufficient to escape defect and impurity 
trapping. 

The ab-initio approach for solid-state diffusion paradigmatically discussed here 
for the Li-Al system has so far been applied successfully to diffusion of impu- 
rities in various metals, both for interstitial impurities such as hydrogen, deute- 
rium, tritium, oxygen, carbon, and helium, as well as for self-diffusion processes 
and vacancy-assisted diffusion of substitutional impurities (Wimmer et al. 2007). 


11.3.4 
Impurity Effects on Grain Boundary Cohesion 


Ab-initio modeling has been used to investigate point defects in solids but planar 
defects such as grain boundaries or antiphase boundaries are also within its 
scope. As an application in this field we summarize here investigations on the 
effects of impurity elements He, Li, S, H, C, Zr, P, Fe, Mn, Nb, Cr, and B and 
some combinations thereof on the cohesion of a nickel 25{001} twist grain 
boundary. These calculations were able to explain metallurgical effects (e.g., why 
boron strengthens nickel grain boundaries) and the mechanisms of irradiation 
embrittlement (e.g., how boron transmutation results in grain boundary embrit- 
tlement), and address the influence of impurities on the environmentally assisted 
cracking resistance of nickel-base alloys (for a more detailed discussion see Young 
et al. (2003)). 

For the quantitative assessment of the effect of impurities on the grain bound- 
ary cohesive strength, a Griffith-type fracture criterion [Eq. (39)] was used 
(Griffith 1920). 


AEcusstn = (Esurt — EP") — (Egg — Bag”) (39) 


The Griffith energy Ecriffith is the energy required to cleave the grain boundary 
(GB) creating two surfaces (surf). A positive value of the change in Griffith en- 
ergy as expressed by Eq. (39) indicates weakening of the grain boundary due to 
the presence of impurity atoms. A negative value indicates grain boundary 
strengthening by the impurity. The Griffith criterion can also be expressed in 
terms of the critical energy release rate of a crack, given by Eq. (40), where a is 
the stress, a is half the crack length, and E is the Young’s modulus. 


no a 


E 





Fracture Energy per Area = (40) 


The computational procedure to determine the fracture energy in the presence 
of a given impurity is as follows: (1) construction and optimization of the pure 
nickel grain boundary, (2) determination of the site preference of the impurity 
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by optimization of corresponding models, (3) cleavage of the grain boundary, and 
(4) structure optimization of the free surfaces. For the pure nickel 25{001} twist 
grain boundary a theoretical fracture energy of 3.33 J m“? is computed, which is 
considerably smaller than the energy of 4.30 J m”? required for cleaving the 
nickel single crystal along an (001) plane. The determination of the site prefer- 
ence yielded hydrogen, boron, and carbon as interstitial impurities, while helium, 
phosphorus, sulfur, chromium, iron, manganese, and niobium are substitutional 
impurities. 

The main results of the calculations are summarized in Fig. 11.12. The upper 
panel compares the tendency of the impurity element to segregate from the bulk 
to the grain boundary or to the free surface while the lower panel shows the effect 
of the impurities on the grain boundary fracture energy. The impurity elements 
are ranked in the sequence He, Li, S, H, C, Zr, P, Fe, Mn, Nb, Cr, and B in order 
of most embrittling to most strengthening. Helium is strongly embrittling 
(AEcrimth = 1.07 J m?), phosphorus has little effect (AEGrisith = —0.05 J m?) 
and boron has to offer considerable strengthening of the grain boundary 
(AEcritith = —0.54 J m7). These ab-initio results are consistent with experimen- 
tal observations since He, S, and H are, for instance, known embrittling agents 
whereas boron is applied as a strengthener in nickel-base alloys. Boron is em- 
bedded in the center of the grain boundary in a highly symmetric eight-fold coor- 
dination to Ni atoms and the analysis of the electronic structure reveals s-to-p 
electron transfer and hybridization of B p and Ni d electrons. For instance, in 
the case of carbon the s electrons are lower than the p electrons and s-to-p elec- 
tron transfer and subsequent hybridization with Ni d electrons is inhibited. The 
strengthening of nickel-base alloys by boron can therefore be understood in terms 
of the electronic structure. 

In addition to single impurities, selected impurity pairs H-Li, H-B, H-C, 
H-P, and H-S have been investigated, and the results are included in Fig. 
11.12. Essentially no strong attraction between H and S, P, B, and C atoms is 
found in bulk, grain boundary, and surface environments. Since there is little in- 
teraction for the concentration tackled by the model, linear superposition of 
single impurity effects is a good approximation for the combined impurity ef- 
fects. The results from ab-initio modeling are again consistent with the experi- 
mental observation that S and H atoms as well as H and P atoms embrittle grain 
boundaries, but that phosphorus alone has little effect on intergranular fracture. 
In addition to the intrinsic effects of an impurity on hydrogen embrittlement, ef- 
fects on the hydrogen uptake have been studied. The experimental indication that 
phosphorus may promote hydrogen entry by poisoning the hydrogen recombina- 
tion reaction 2Haas > H2(g) is consistent with the calculations, since a weak in- 
teraction energy of about 8 kJ mol”! between H and P atoms at the free surface is 
computed. This weak bonding suggests that an impurity enhancing the stability 
of adsorbed H atoms at the free surface can act as a hydrogen recombination poi- 
son. For a detailed discussion on implications of the study in the field of metal- 
lurgy, irradiation embrittlement, and cracking resistance of nickel-base alloys, see 
Young et al. (2003). 
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Fig. 11.12 Comparison of the tendency of impurity elements and pairs 
of impurities to segregate to the free surface or to the Ni &5{001} twist 
grain boundary (upper panel). Lower panel: Comparison of the effect of 
impurity elements and pairs of impurities on the cohesive strength of a 
Ni 251001} twist grain boundary (Young et al. 2003). 
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11:3.5 
Toward Multiscale Modeling: Cluster Expansion 


Although DFT-based methods as described in the earlier sections allow for a de- 
tailed prediction of structural, vibrational, magnetic, and electronic properties of 
ordered compounds, standard DFT methods can be applied for only a rather 
small number of atoms per unit cell. The largest number to be feasible for ab- 
initio DFT calculations is a few hundreds of atoms in the unit cell. In this section, 
it will be demonstrated how the limitation to this rather small number of atoms 
can be overcome without losing the accuracy of DFT calculations, namely by com- 
bining DFT results with the concept of the cluster expansion (CE) (Sanchez et al. 
1984), by which the atomic and mesoscopic length scales are joined together. For 
an understanding of processes such as decomposition reactions (i.e., formation of 
precipitates) it is essential that the energetics can be separated into elastic (me- 
chanical) and chemical parts, which requires the treatment of the elastic energy 
in the framework of a direction- and concentration-dependent elasticity theory, 
which will be discussed first. The extension to finite temperatures is then realized 
by Monte Carlo (MC) simulations. Besides modeling thermodynamic properties, 
the use of the CE Hamiltonian in a kinetic MC approach allows the treatment of 
the alloy problem on a time scale of hours or even days (Müller 2003). 

The starting point for modeling the elastic energy is the ideal, flat interface con- 
necting crystals consisting of only A- or B atoms, as sketched in Fig. 11.13: the 
two “blocks” of material A and B possess a common in-plane lattice constant a, 
perpendicular to the chosen direction G. Along this direction both parts are al- 
lowed to relax geometrically. This situation is comparable to a pseudomorphic, 
epitaxial system without dislocations at the interface. 

From the discussion above, it becomes clear that the epitaxial strain energy 
AE (a, Ĝ) is the energy needed to deform the elemental solid A epitaxially (or 
biaxially) to the common lattice constant a, in the two directions orthogonal to 
G while relaxing along G. Forbidding relaxations along the direction G corre- 
sponds to a hydrostatic deformation. The correponding energy is called the hydro- 
static deformation energy AEA (a) for the lattice parameter a. The relationship 


[Maer] |G 











Fig. 11.13 Ideal interface perpendicular to direction G formed by two 
pieces of material A and B with a common in-plane lattice constant ap. 
In the case of an epitaxial (biaxial) deformation both sides are allowed 
to relax along G. The corresponding energy is called the epitaxial strain 
energy of the element A or B, respectively. 
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AEN" (a) > AEP (a, G) is obvious, because it is always easier to deform a mate- 
rial epitaxially (biaxially) than hydrostatically (triaxially). The ratio of these two en- 
ergies (Eq. (41) for 0 < q < 1) defines the epitaxial softening function (Wood and 
Zunger 1988; Zunger 1994a) 


epi A 
g(a, 6) = aS (41) 
AE, (a) 

Small values of q(a, G) indicate elastically soft directions G. As an example, Fig. 
11.4 shows the calculated softening functions, g(a, G), for the fcc elements Pt and 
Rh along the three fundamental crystal directions. The vertical lines mark the 
DFT-derived equilibrium lattice parameters of Pt and Rh. For these values the 
harmonic elasticity theory must be valid, i.e., the directions have to follow a well- 
defined sequence with respect to their elastic softness. Indeed, in harmonic elas- 
ticity theory, q depends on only the direction G, but not on the chosen lattice con- 
stant a, (Bottomley and Fons 1996; Laks et al. 1992; Zunger 1994a), as in Eq. (42) 
with the cubic bulk modulus B = (C11 + 2C12) and the anisotropy parameter 
A = Cu — 4 (Cn — Co). 


B 
Cu Sr AY narm(G) 





qharm( Ĝ) =1 (42) 


As described in Section 11.2.3 the elastic constants C11, C12, and C44 can be ob- 
tained easily from DFT calculations, and consequently A and B, too. The geomet- 
ric function Yharm depends on the polar angle © and the azimuth angle ® of the 
direction vector G [Eq. (43)]. 


Yharm(®; ©) = sin?(2®) + sint (O) sin?(2®) 


= Je Ko(®, ©) — 


3 K4(®, 0) (43) 


2 
V21 
Here, the parameter K; is the cubic harmonic for the angular momentum I. How- 
ever, this definition fails to describe the observed crossing of individual curves as 
found for Pt and Rh (Fig. 11.14) for values different from the equilibrium lattice 
parameter. This means that modeling strain in the framework of the harmonic 
theory is not sufficient for metals. Obviously, the crystallographic order of elastic 
softness can change as a function of the lattice parameter. For example, for Rh a 
decrease of 1% (Fig. 11.14) from the equilibrium lattice constant leads to a 
change in hierarchy with respect to the q values: now, the [110] is elastically 
harder than the [111] direction, while at equilibrium the opposite is true. This 
clearly indicates that for a description of strain effects in metals, not only the di- 
rection dependence of strain (anisotropic strain effects), but also the dependence 
of strain on the lattice parameter (anharmonic strain effects) must be taken into 
account (Ozolins et al. 1998a,b). This is achieved by introducing a geometric func- 
tion y, which depends on the lattice parameter a, according to Eq. (44). 
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Fig. 11.14 Epitaxial softening function q(a, G) Eq. (41), for Pt and Rh 
calculated by DFT. The vertical lines mark the calculated equilibrium 
lattice constants. Lines are to guide the eye. 


Imax 


z(a, 6) = Yharm(G) + I bi(a)Ki(G) (44) 


1=0 


This relationship also includes now higher-order cubic harmonics as necessary to 
go beyond the harmonic approximation (for more details see, e.g., Ozolins et al. 
(1998a)). Then, Eq. (42) can be reformulated as Eq. (45). 


A B 


1-1 Cu + Ayla, G) (s) 





With the function q(a, G) as derived from DFT calculations (Fig. 11.13), the quan- 
tity »(a, G) is defined by Eq. (45) and, in turn, the coefficients bj(a) result from 
applying Eq. (44). The determination of the coefficients bj(a) permits a general- 
ization of the calculated epitaxial energies AE)? (a, G) for a discrete set of direc- 
tions to a function of arbitrary directions G. 
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Fig. 11.15 Relationship between lattice parameter and concentration for 
Pt-Rh superlattices along five different directions. 


The next step is parameterization of the equilibrium constituent (or coherency) 
strain energy AE (x, G) which is defined as the strain energy required to main- 
tain coherency between a “piece” of material A and a “piece” of material B along 
an interface with orientation G. Thus, a superlattice A,B, along a certain direc- 
tion G with n — co is constructed. In practice, the calculated elemental epitaxial 
energies are used to derive the constituent strain energy, which is determined by 
the equilibrium value of the composition-weighted sum of the epitaxial energies 
for A and B atoms [Eq. (46)]. 


AECL (x, G) = min[xAE (ap, G) + (1 — x)AE?" (ap, G)] (46) 


Gp 


The parameter a,(x) is the lattice parameter which minimizes AE, at each con- 
centration x. The minimization according to Eq. (46) allows a direct relationship 
between a, and the average superlattice concentration x, as shown in Fig. 11.15. 
In this figure it is noticeable that, dependent on the direction, deviations from 
Vegard’s law clearly occur. Figure 11.16 presents the resulting equilibrium con- 
stituent strain as a function of concentration and direction. As discussed above, 
the hierarchy of the elastic softness depends on the concentration of the super- 
lattice. 

The equilibrium constituent strain can be illustrated by a three-dimensional pa- 
rameterization in terms of a sum of cubic harmonics allowing for a prediction of 
the constituent strain energy for any arbitrary direction, as shown in Fig. 11.17 
for Pt-Rh superlattices corresponding to three different Rh concentrations. 
Thereby, the distance from the surface to the center of the cube represents the 
size of the strain energy in this crystallographic direction. Therefore, if the devia- 
tion from an ideal sphere represents a depletion as observed for Pto.75Rho.25 along 
the square face diagonal (or along the [110] direction), then this direction is elas- 
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Fig. 11.16 Constituent strain energies AEc, Eq. (46), for Pt-Rh 
superlattices vs. composition for five different directions. All energy 
differences are given with respect to the ideal undistorted fcc phases of 
Pt and Rh. 


tically soft, while an elongated shape — such as occurs for the [111] direction — 
represents an elastically hard direction. In summary, Fig. 11.17 demonstrates the 
necessity to consider the concentration and direction dependence of strain in 
metal alloys. 

A very efficient tool to bridge the gap between atomic and mesoscopic length 
scales is the concept of the cluster expansion (CE). The basic idea of CE (Sanchez 
et al. 1984) is to express the geometrically relaxed energy, E(c), for arbitrary lat- 
tice configurations g on an underlying lattice as a linear expansion of energies 
which are characteristic for geometrical figures (or clusters) such as biatoms, tri- 
atoms, etc. (Fig. 11.18). For practical reasons, the “alloy problem’ is transformed 
to an Ising model. To each atom i of the alloy Aj_,B, a spin value is assigned in 
such a way that S; = —1 (i represents an A atom), and S; = +1, (i represents B). 
Then, the energy of each configuration can be expressed by an Ising expansion, 
Eq. (47). 





Fig. 11.17 Parameterized three-dimensional presentation of the 
constituent strain energy AEC? [Eq. (46)] for the Pt-Rh system for 
compositions of 25%, 50%, and 75% Rh. The distance from the surface 
to the center of the cube represents the size of the strain energy. 
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Fig. 11.18 The concept of cluster expansions. The crystal structure is 
decomposed into characteristic geometrical figures (or clusters), as 
shown for the fcc lattice. The energy of any configuration can then be 
written as a linear combination of the characteristic energies Jr of the 
figures. 


E(a) = Jo + X ,ISi(o) + I Jy Silo) Slo) + X Jie Silo) Slo) Selo) +++ 
i j<i k<j<i (47) 


The first two terms on the right define the energy of the random alloy (with zero 
mutual interactions), the third term contains all pair interactions, the fourth all 
three-body interactions, and so on. This equation can be made more compact by 
introducing a correlation function Il for each class of symmetry-equivalent geo- 
metrical figures F (Zunger 1994b), as in Eq. (48). 


Telo) Se). Sulo) (48) 
f 


The symbol Dr denotes the number of geometrical figures of class F per site of 
the lattice. The index f sums over all the ND; geometrical figures in class F, 
whereas m denotes the number of sites of geometrical figure f. In this way, Eq. 
(47) transforms to Eq. (49) (Laks et al. 1992). 


F 


The key point now is how the interactions Jr can be determined. In principle, the 
direct inversion method of Conolly and Williams (1983) can be applied. In this 
procedure, a set (about 15-40) of N, mostly simple ordered compounds with typ- 
ically no more than 2-16 basis atoms is chosen, for which the energies Eprr of 
the geometrically relaxed structures are derived from DFT calculations. Then, 
the N, energies are used to construct the Nr interactions Jr by solving Eq. (50) 
with respect to the Np values of the unknown cluster interactions Jr. 


2 
= Minimum (50) 


No 


2 





Nr 
Eprr(a) — NX DrJrlr(o) 





11.3 Applications 


Clearly, for finding solutions the relationship Nr < N, must hold. Obviously, 
there are two remaining problems which have to be solved, namely (a) how to se- 
lect the set of input structures which describes the energetics of the system and 
(b) how to select relevant geometrical figures for which the cluster interactions 
will be constructed. As recently shown (Hart et al. 2005), the second problem 
can be handled by applying a genetic algorithm for the selection of geometrical 
figures. Typically, this algorithm can select a set of figures from all figures with 
up to, say, six sites (six-body interactions) for given maximum distances between 
the atomic sites. This is called the “inner loop” (for details, see, e.g., Hart et al. 
2005). Regarding the first problem, one efficient tool with which to find struc- 
tures with important “structure information” for the determination of the inter- 
actions is a ground-state search (Ferreira et al. 1991) in the early stage of the con- 
struction: for a “starting set” of about 15-30 DFT energies of arbitrary input 
structures with typically no more than 2-20 basis atoms, a fit by CE is made. 
The resulting interactions Jr are then used to predict the energy of all possible 
structures with, e.g., up to 16 atoms per unit cell. (This is indeed a very reasonable 
restriction, since most known stable structures of binary metal alloys have fewer 
than 16 atoms per unit cell.) Such an analysis based on Eq. (49) needs only a few 
minutes on a standard PC. At the end, the corresponding CE formation enthal- 
pies AH, of all the structures are plotted versus composition, and a ground-state 
line is constructed. Hereby, AHș (ø) is defined as the energy gain or loss per atom 
with respect to the bulk constituents at their equilibrium lattice constants. For an 
intermetallic compound A,B, (consisting of m A and n B atoms per unit cell) 
AHf is given by Eq. (51), with a4 and ag being the equilibrium lattice constants 
of the elements A and B in their bulk, and E"(a,) and EH! (ap) the respective 
total energies. 


1 ‘0 ‘01 ‘0 
AHy¢(AmBn) = a "(AmBn) — m: E (aa) — n- Ek (ag)] (51) 


The result of such a ground-state search is shown schematically in Fig. 11.19. An 
individual structure o only contributes to the ground-state line if the energy aver- 
age of the stable structures at the next higher and lower concentrations is energet- 
ically less favorable than the formation enthalpy of ø. More precisely, for three 
structures a, a, and £ with x(a) < x(a) < x(f) which are the lowest in energy 
for their individual concentrations, the structure o has to fulfill the thermo- 
dynamical stability condition of Eq. (52) to be the ground state for x(a). 


x(a) = x(B) x(a) = x(a) 
——— AH 52 
xB) — xa) 9) en 
If Eq. (52) holds, then a mixture of the phases « and £ would be higher in 
energy than structure ø. With the ground-state line constructed, one checks for 
all structures which lie on or very close (“excited states”) to it, if they have already 
been considered as input structures for the CE. If not, their DFT energy is calcu- 
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Fig. 11.19 Schematic groundstate diagram of line is formed by the three structures «, a, 
a binary alloy Aı_xB, (Müller 2003). The and £$ for the concentrations x = 0.25, 0.50, 
groundstate line was constructed from 60 and 0.75, respectively. If the energy of o 
energies of geometrically relaxed structures would be above the dashed tie line between g 
(given by dots) by means of Eq. (52). Besides and £, a mixture of « and $ would be more 
the pure elemental crystal the groundstate stable than o. 


lated and added to the input set of structures, while input structures with AH 
values far above the ground-state line are removed from the input structure list. 
The procedure is repeated until the ground-state line becomes stable. This is 
called the “outer loop.” An example for a ground-state line is discussed in Section 
11.3.6. 

Although the concept discussed allows for a prediction of formation enthalpies 
of ordered compounds and the random alloy case, it fails to predict the stability 
for long-range coherent systems on a mesoscopic scale, such as coherent nano- 
clusters in metal alloys or long-period superlattices: for a given superlattice 
AnBn, Eq. (49) predicts the asymptotic formation enthalpy AHy = 0 for n — co. 
As first shown by Laks et al. (1992), this is an intrinsic fault of any finite CE and 
easy to understand: if one considers an atom A of an A,B, superlattice “far” away 
from the A/B interface (i.e., all geometrical figures f connect the A atom exclu- 
sively to other A atoms), then according to the finite CE, atom A is in a bulk en- 
vironment and consequently AHy = 0. However, the formation enthalpy of such 
an infinite superlattice should then be given by the remaining elastic energy, be- 
cause in the limit n — o the superlattice formation enthalpy depends only on its 
strained constituents, and not on the interface properties. The problem can be 
solved (Laks et al. 1992; Zunger 1994b) by transforming a group of interactions 
to the reciprocal space and treating the constituent strain term explicitly as dis- 
cussed at the beginning of this section in refs. (Wood and Zunger 1988; Zunger 
1994a). The transformation is easiest for pair interactions. For that case, the Four- 
ier transform of real-space pair interactions, Jpair(k) and the structure factor 
S(k, a) are introduced [Eqs. (53) and (54)]. 
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Jpair(k) = I Jrir (Ri — Rj) exp(-ikR;) (53) 
J 

S(k, o) = 2, Sj exp(-ikR;) (54) 
J 


Then the formation enthalpies for any arbitrary atomically relaxed configuration 
a is given by Eq. (55) (Zunger 1994b). 


AHcz(0) =) Jpair(k)|S(k, o)|* +), DeJrllr(o) + AEcs(o) (55) 
k F 


This relationship — as introduced by Zunger and coworkers (Laks et al. 1992; 
Zunger 1994b) — is the so-called mixed-space cluster expansion (MSCE). The first 
term includes all the geometrical pair figures in k-space. The second term repre- 
sents many-body interactions and runs over symmetrically inequivalent clusters 
consisting of three or more lattice sites. It also includes Jo and J; from Eq. (47). 
The symbol Dr again represents the number of equivalent clusters per lattice site, 
whereas [(c) denotes the structure-dependent geometrical coefficients of Eq. 
(48). The last term represents the constituent strain energy of o, AEcs(a), and 
can be derived by expanding the equilibrium constituent strain energy [Eq. (46)], 
AEA (x, k), as in Eq. (56), (Laks et al. 1992; Wolverton et al. 2000), utilizing Eq. 
(57). 


AEcs() =) Jes(x, k)|S(k, 0) |? (56) 
k 
Jcs(x,k) = soe (57) 


In this relationship, Jcs contains the correct long-period superlattice limit, 
namely the constituent strain energy. 

Analogously to the real-space expansion in Eq. (49), the formation enthalpies 
AHf(AmBn) of ordered compounds are needed as an input for the determination 
the coefficients {Jpair(k)} and {Jr} of the CE. The pair and multi-body interac- 
tions result from the fit to the N, formation enthalpies {AH;}, minimizing the 
root mean square (rms) error [Eq. (58)] (Zunger 1994b). 


Ae = yD wolAHcelo) — AHroalo)]? + ÈE Joar [VE] Jpair(k) (58) 
OG k 


In this equation, 2 and t are free parameters and « is a normalization constant 
(Laks et al. 1992). The second term on the right, i.e., the k-space smoothness cri- 
terion, automatically selects essential short-range pair interactions. After Eq. (58) 
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Table 11.8 Formation enthalpies of the Al-Zn system derived from DFT 


calculations (“direct”) and cluster expansion (“CE”) (compounds are 


sorted by superlattice direction and composition) .@l 





Stoichiometry 


Zn 
direct: 
CE: 


AlZnz 
direct: 
CE: 


AlZn, 
direct: 
CE: 


Al Zn; 
direct: 
CE: 


AlZn 
direct: 
CE: 


Al, Zi n 
direct: 
GE: 


Al; Zm 
direct: 
GE; 


Al, Zna 
direct: 
GE; 


Al; Zm 
direct: 
CE; 


Al Zn 
direct: 
CE: 


Al; Zn 
direct: 
CE: 


Xal 


0.0 


0.25 


0.333 


0.40 


0.50 


0.50 


0.50 


0.50 


0.60 


0.667 


0.75 


Formation enthalpies [meV atom "] 


[100] 


p2 
14.9 
18.3 


Z7* 
7.0 
72. 


Lio 
23.5 
26.9 


Z2 
9.0 
9.6 


Z6* 
6.2 
5.7 


Z5* 
6.0 
6.2 


Bl 
17.4 
20.4 


Z1 
10.2 
14.1 


[110] 


Y3* 
24.8 
24.2 


y2 
17.7 
21,3 


Y2 
24.6 
26.8 


Y6* 
18.8 
16.4 


yl 
32.6 
33.7 


Y1* 
26.0 
27.2 


0111] 


V3 
4.3 
3.7) 


a2 
2.0 
3.3 


L1: 
7.4 
9,9 


V2 
4.8 
73 


V6 
2.8 
3.2 


V8* 
1.8 
1:7 


al 
15.9 
16.7 


vi 
14.6 
19,7 


[201] 


D022b 
13.3 
12.5 


CH(40) 
24.8 
28.8 


D022a 
27.6 
31.6 


311] 


W3 
9.0 
11.0 


W2 
18.6 
22.3 


W1* 


23.0 
25.8 


others 


0.0 
0.6 


L12 
5.3 
10.2 


SQS8* 
18.4 
20.9 


L12 
35.1 
34.9 
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Table 11.8 (continued) 








Stoichiometry Xal Formation enthalpies [meV atom—"] 
[100] [110] [111] [201] [311] others 
Aly Zn 0.875 D7 
direct: 10.7 
CE: 12.4 
Al 1.00 fic 
direct: 0.0 
CE: —0.6 
* Enthalpies not used in the CE fit, but represent predictions. 





[al Average error (standard deviation of the CE enthalpies of 25 input 
structures) = 1.5 meV; average prediction error (standard deviation 
of all 9 predicted structures) = 2.2 meV; maximum error (largest 
deviation between the CE and DFT value of the structures 
considered; Müller et al. 1999) = 4.1 meV. 


has been applied to the pair interactions, a genetic algorithm is applied to select a 
set of figures allowing for a quantitative modeling of the alloy’s energetics. As an 
example, Table 11.8 lists the N, = 25 input formation enthalpies AHf (“direct”) 
and the MSCE enthalpies (“CE”) for Al-Zn. All enthalpies with an asterisk de- 
note structures not used in the fit, i.e., they present “real” predictions. The table 
gives the stoichiometry of all input structures as well as information on whether 
the structures comprise a superlattice of Al and Zn layers along a certain direc- 
tion. For example, V3 is defined by an AlZn3 superlattice in the [111] direction 
for which there are always three Zn(111) layers followed by one Al(111) layer. 
The average error of fitting N, = 25 enthalpies is 1.5 meV whereas the resulting 
prediction error for the nine structures not used for the fit is 2.2 meV. Figure 
11.20 shows the resulting pair and multibody interactions for the Al-Zn system. 
Clearly, pair interactions converge rapidly, and therefore 15-20 pair interactions 
are sufficient. 

For finite temperature studies, Eq. (55) can be used in MC simulations. For 
studying thermodynamic properties we applied a simple Metropolis algorithm 
(Metropolis et al. 1953) allowing for mutual flipping of pairs of A and B atoms 
in an arbitrary distance. The aim is to reach the equilibrium configuration as 
fast as possible. In addition to the temperature dependence of the free energy, 
MC simulations can be used to calculate coherent phase boundaries in the phase 
diagram. Following the fluctuation-response theorem (Toda et al. 1983), the spe- 
cific heat c, at a certain temperature can be determined from the observation that 
Cy is proportional to the equilibrium fluctuations of the energy, <E?» — <E>?. 
Since the energy exhibits a point of inflection for a second-order phase transition 
at the transition temperature Tyans, its response function c, = (0E/OT), has a 
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Fig. 11.20 Pair and multibody effective cluster interactions for Al-Zn 
deduced from the fit of Table 11.8. The symbols characterize individual 
multibody interactions: “3” denotes three- and “4” four-body inter- 
actions. In general, the distance between atoms increases with the 
letter, i.e., for example “J” means only nearest neighbors, “K” nearest 
neighbors and one second-nearest neighbor, and so on. 


maximum at Trans (Fig. 11.21a). Although a phase transition is strictly defined 
for an infinite system, one usually also speaks about a phase transition of a finite 
system, given by the maximum of c, at the transition temperature, as illustrated 
in Fig. 11.21(a). If the MC simulations are applied for different concentrations x, 
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Fig. 11.21 (a) Schematic plot of the specific heat vs. temperature near a 
second-order phase-transition: c, has a maximum at Trans; (b) calculated 
coherent phase boundary for Al-rich Al-Cu (Müller et al. 1999) and 
comparison to experimental data (Murray 1985) (open circles). 
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the resulting temperatures Trans can be used to construct the coherent phase 
boundary of a system, as displayed in Fig. 11.21(b) for the Al-rich side of the 
Al-Cu phase diagram (Müller et al. 1999). The open circles are measured values 
(Murray 1985). A small piece of the incoherent phase boundary is also shown. 
However, this boundary cannot be calculated by the method presented because it 
is restricted to coherent problems. 

Besides the problem of bridging the length scales, many materials properties 
require simulation times reaching from fractions of a second to weeks. One im- 
portant example is the decomposition of an alloy into its constituents by precipi- 
tation. Precipitates represent an important part of the microstructure of many 
alloy systems (see Chapter 7). Hereby, the dynamic evolution of precipitates takes 
place on a time scale of several hours, days, or even months. The CE Hamiltonian 
can help to solve this second scaling problem, too, by using the effective interac- 
tions in kinetic Monte-Carlo (KMC) simulations, which constitute one of the 
most successful approaches to describe diffusion, growth, and microstructure 
evolution in alloy systems (Jacobsen et al. 1995). The combination of MSCE and 
KMC simulations can be applied to simulate the aging of coherent precipitates in 
binary alloy systems. This decomposition reaction is sketched in Fig. 11.22 by a 
simplified two-dimensional presentation: a quenched solid solution (left frame) 
is aged at a given temperature. During this aging process islands are formed 
(right frame) which may show a characteristic size and shape distribution (it is 
assumed that islands are formed by black B atoms in an A-rich A;_, By alloy). 
The question is whether the distribution of these islands as a function of aging 
time can be calculated from first principles. 

Since the main focus is on the distribution of the islands as a function of 
temperature and aging time while the path of an individual B atom through the 
crystal is less interesting in this case, it is assumed that the islands are formed 
by nearest-neighbor site flips only and not by continuous atomic motion. The 
activation barrier for the exchange process can be expressed in terms of the 
temperature-dependent diffusion coefficient D(T). In order to calculate D(T) by 
a first-pinciples approach, it is assumed that the exchange of atoms is given by a 
vacancy-controlled diffusion. Therefore, in a first step, activation barriers must be 
calculated as a function of the structural environment. In the case of precipitation 
in which the alloy contains only a tiny amount (typically 1-5%) of the precipitat- 





Fig. 11.22 Schematic crystal plane of an Aj_,B, alloy with characteristic 
islands formed by B (black) atoms during the aging process. 
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Fig. 11.23 Basic assumption in our simulations (Müller 2003): while 
the energy difference between two neighboring atoms can be derived 
easily from the MSCE, an average and temperature-dependent 
activation barrier is calculated from experimental temperature- 
dependent diffusion data. 


ing element, one often restricts the calculation of activation barriers to the case of 
the dilute limit (atom B in an A crystal) and the structural environment at the in- 
terface between solid solution and precipitate. Although such activation barriers 
can — in principle — be used directly in KMC programs, they do not allow for a 
consideration of the temperature dependence as well as a transformation to real 
time scales. For this purpose, the complete phonon spectra for the relaxed struc- 
ture corresponding to the vacancy formation, the migration, and the final config- 
uration have to be calculated. This might be used in the framework of a transition 
state theory to predict the temperature-dependent diffusion constant of the sys- 
tem, D(T) (see Chapter 5 and 11.3.3). Following classical diffusion theory the ex- 
change frequency is proportional to the square of the atomic distance divided by 
the diffusion constant and the number of possible “jump directions” (e.g., six in 
a simple cubic lattice). If an exchange process between two certain neighbouring 
atoms has already been chosen, then, consequently, the frequency 1/1 for a 
chosen exchange process as a function of temperature T is connected to D(T) by 
Eq. (59), with ann being the average nearest-neighbor distance between atoms. 
Now, one can easily transform the KMC steps to real time. 


a? 


to(T) = Dao (T) (59) 


The strength of the MSCE to control a huge configuration space can now be 
utilized to calculate the energy difference for all possible exchange processes, 
even if there are millions of them. This makes it possible to force atoms to move 
and to calculate the time which corresponds to this individual exchange process. 
The more unlikely an exchange process the longer is the corresponding time for 
this process. The concept is related to the “residence time algorithm’ (Bortz et al. 
1975) as discussed in Chapter 12 for nearest-neighbor exchange processes only 
(Müller et al. 2002). An accepted spin-flip demands a recalculation of S(k,o) in 
Eq. (55). However, as shown by Lu et al. (1994), the MSCE method helps to avoid 
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the necessity of recalculating S(k,o) after each atomic movement by directly cal- 
culating the change in Jpair(k)|S(k, c)|” for each movement in real space (Lu et al. 
1994). In the applied algorithm, a single KMC step is now no longer a constant 
real-time unit, but depends on the corresponding probability W,.,. A single kinetic 
MC step corresponds indeed to only a single exchange of one B atom with one A 
atom and not to one trial-flip for each B atom. Since the “flip channel” i is always 
chosen randomly and usually a large number of B atoms (typically 10°-10°) is 
considered to describe real aging processes, the probability that the same B atom 
is chosen in step i — when chosen already in step (i— 1) - is extremely small. 
Therefore, due to the large system size it is not necessary to forbid certain ex- 
changes between A and B atoms, i.e., we do not have to give up the restriction 
that the algorithm should be based on the Markovian process. 


11.3.6 
Search for Ground-State Structures 


In general, to find the thermodynamic ground-state structure for a solid com- 
pound at low temperatures seems to be straightforward: start from a given struc- 
ture, relax all atomic positions and the cell shape, and finally select the structure 
with the lowest total energy. Although this procedure can be made to work (e.g., 
by a suitable implementation in a computer code such as VASP minimizing 
forces and relaxing the components of the stress tensor), it does not guarantee 
that the deepest energy minimum can be found. Because of symmetry, fixed 
numbers of atoms and large energy barriers, certain paths in structural configu- 
ration space could just be forbidden or prohibited, and — consequently — the 
search for the true ground-state structure is restricted to a certain class of struc- 
tures. However, strategies have been developed which could at least substantially 
enlarge the configuration one is searching through. We will discuss such a proce- 
dure for binary compounds. 

The search for the ground state of binary compounds by the standard proce- 
dure and diagrammatic approaches suffers largely from the requirement that the 
small group of crystallographic configurations considered must include the solu- 
tion. No room is left for “surprises”, but the set of possible ground states is re- 
stricted to the “usual suspects.” The MSCE as introduced in Section 11.3.5 partly 
permits us to circumvent this problem by restricting the search to compounds 
with a limited number of basis atoms and calculating the formation enthalpy of 
all possible configurations via the MSCE Hamiltonian. In particular for metal alloys, 
where most of the observed long-range ordered ground states do not contain 
more than 16 basis atoms, such an ansatz appears to be a very reasonable 
approach. One should keep in mind that this restriction still allows for more 
than about 130000 structures, a number which could never be treated by direct 
DFT calculations for all compounds. The calculation of all these formation enthal- 
pies corresponding to the geometrically fully relaxed structures by applying the 
MSCE Hamiltonian can be done within about an hour on a high-performance 
PC. 
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Fig. 11.24 Ground-state diagram for T = 0 K for the Pt-Rh system. 

(a) Enthalpies directly calculated from DFT calcualtions; (b) MSCE 
results considering all possible compounds with up to 16 basis atoms. 
The effective interactions were constructed from the DFT energies as 
shown in (a). The broken line represents the energetics of the random 
alloy. 


As an example, Fig. 11.24 compares the ground-state diagram for the fcc-based 
Pt-Rh alloy system constructed (a) by simply plotting the formation enthalpies as 
derived from DFT calculations and (b) by using these enthalpies to construct ef- 
fective interactions in order to consider all compounds with up to 16 basis atoms. 
Each point in the figure represents the formation enthalpy of a given structure. 
In case (a), the constructed ground-state line (for technical details, see Section 
11.3.5) consists of two long-range ordered ground-state structures, namely the 
so-called D1, and CH structures (names correspond to the Strukturbericht). These 
two intermetallic compounds are displayed in Fig. 11.25: both structures are 
superlattices along [201] with the stacking sequence PtRh, (D1,) and Pt,Rh2 
(CH). If the ground-state search is now extended to all compounds with up to 16 
basis atoms, a number of new ground states is predicted for the Rh-rich, but not 
for the Pt-rich side. Without going into detail, it should be emphasized that the 
consideration of more than 130000 compounds leads to a drastically different 
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(PtRh, [201}) (Pt,Rh, [201]) 





Fig. 11.25 The two ground-state structures as predicted directly by DFT 
enthalpies: D1, (PtRh4 superlattice along [201]) and CH (Pt2Rh2 
superlattice along [201]). As can be seen from the MSCE-based ground- 
state diagram (Fig. 11.24b) the extension of the configuration space to 
16 basis atoms identifies D1, no longer as the ground state. 


ground-state line. In particular, the D1, structure, which was predicted to be sta- 
ble (Fig. 11.25a) is not the ground state any longer. Instead, no long-range ordered 
structure is found to be a ground state for a concentration of 20%. This result 
makes it clear that predictions with respect to the stability and formation of inter- 
metallic compounds are only trustworthy if they are based on a huge configura- 
tion space. Besides the energetics of ordered compounds, the MSCE is also able 
to predict the enthalpy of the random alloy as a function of concentration in a 
quite straightforward manner. The corresponding curve is shown in Fig. 
11.24(b) by a broken line, demonstrating that the energy of the random alloy 
clearly lies above the constructed ground-state line. One should, however, keep 
in mind that such a ground-state diagram describes the situation for T = 0 K (or 
suffiently low temperatures). Considering the fact that all energies displayed in 
Fig. 11.24 are within a range of 20 meV atom”! it is not surprising that for the 
Pt-Rh system no intermetallic compounds are observed at finite temperatures. 
Nevertheless, such a ground-state diagram can provide important hints about 
what kind of substitutional short-range order is to be expected, because the 
short-range order in the solid solution is strongly influenced by the correspond- 
ing T = 0 K ground-state structure. Finally, it should be mentioned that - as dis- 
cussed in Section 11.3.5 — the application of the MSCE Hamiltonian in MC sim- 
ulations allows for calculations of mixing enthalpies for any arbitrary temperature 
and concentration. As recently shown (Kolb et al. 2006), the mixing enthalpies of 
Pt-Rh are negative for all temperatures, indicating that the assumed miscibility 
gap in the phase diagram of Pt-Rh for low temperatures does not exist. This re- 
sult is supported by very recent experimental studies (Steiner et al. 2005). 


11.3.7 
Ordering and Decomposition Phenomena in Binary Alloys 


As discussed in Section 11.3.5, the combination of the MSCE with MC simula- 
tions allows for a parameter-free description of materials systems on a meso- 
scopic scale for any arbitrary temperature. In the case of metal alloys this includes 
two important and characteristic properties which cannot be treated by DFT di- 
rectly, namely short-range order and precipitation. This section will present exam- 
ples of both phenomena. Regarding ordering, so far no definition has been intro- 
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Fig. 11.26 The dilemma in describing ordering (see Ziman 1979): does 
the atom marked by a large arrow belong to “a cluster of pure A 
atoms,” or to a “region of perfect AB order”? 


duced of how to quantify the distribution of the different types of atoms over the 
lattice. Even more importantly a predictive model for the most probable distribu- 
tion is missing. Ziman (1979) illustrated the difficulty of handling ordered zones 
in a disordered matrix by the help of Fig. 11.26: for the given configuration, one 
cannot decide whether the atom marked by an arrow belongs to a “cluster of pure 
A atoms” or to a “region of perfect AB order.” By applying percolation theory Zi- 
man demonstrated that almost every A atom belongs to an infinite cluster of A 
atoms. Paradoxically, if one is looking for ordered domains (Fig. 11.26), then al- 
most every atom belongs to an infinite domain with perfect AB ordering. In this 
situation, statistical concepts (Muto and Tagaki 1955; Guttman 1956; Ziman 
1979) may help: for a system consisting of N sites each surrounded by M neigh- 
bors, the probability of a bond being of AB-type is given by Eq. (60), with Nap 
being the total number of AB-type bonds. 


Pap = limno | Nas (60) 





1 
1MN 


The denominator gives the total number of bonds in the system. It is now easy to 
introduce a nearest-neighbor correlation parameter T 4g. Assuming that each site of 
the system is independently occupied by an A or B atom with probability x4 or 
xp (xa + xg = 1), Pag would be 2x4xz. Then, Tag is defined as the difference in 
Eq. (61). 
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1 
TaB = z Pas — XAXB (61) 


Dividing Tıs by —x,xp leads to the well-known Warren—Cowley short-range or- 
der (SRO) parameter [Eq. (62)] (Cowley 1950); see also Chapter 7. 


J 
Pap 


2XAXB (62) 


y=1- 


Here, a is already extended to arbitrary neighbor distances j. The sign of «j indi- 
cates whether atoms in a given distance j prefer AB ordering («j < 0) or cluster- 
ing («j > 0). The SRO parameter is normalized so that —1 < 4 < +1; a = 0 for 
all j stands for a perfect random alloy, i.e., an alloy without any atomic corre- 
lations. Since the parameters «; (being the Fourier coefficients of the diffuse scat- 
tering intensity) can be determined from diffuse X-ray and neutron diffraction 
experiments (Sparks and Borie 1966; Krivoglaz 1996; Schönfeld 1999), a 
quantitative comparison between calculation and measurement is possible. For 
this purpose, Eq. (62) is reformulated as Eq. (63), where pa) is the conditional 
probability that - given an A atom at the origin — there is a B atom at site (Imn). 
For comparison with experiment, the so-called “shells” Imn are introduced which 
are defined by the distance between A and B atoms in terms of half lattice param- 
eters, (14, m4,n4). For example, for an fcc lattice the nearest-neighbor (NN) dis- 
tance would be described by the shell (110), the second-neighbor (NNN) distance 
by (200) and so on. As already mentioned, the sign of « indicates whether atoms 
in a given shell prefer to order (« < 0) or to cluster (x > 0). The SRO parameter 
may be written in terms of the CE pair correlations as Eq. (64) (Ozolins et al. 
1998a), 


A(B) 
mn (%) =1- sae (63) 
mn) — 4 
dima (x) = =a (64) 


where q = 2x — 1 and (Tl...) denotes the pair correlation function (according to 
Eq. (48)) for shell (Imn). In diffraction experiments the diffuse scattering due to 
SRO is proportional to the lattice Fourier transform of &„(x) (Sparks and Borie 
1966; Krivoglaz 1996). 


nR 


a(x, k) = 5 dimn (x) eE Rin (65) 


lmn 


In this relationship, nr is the number of real-space shells used in the transform. 
Equation (64) in combination with Eq. (65) allows the comparison of both the 
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Table 11.9 Warren—Cowley SRO parameter omn as derived from 
combined CE and MC calculations, in comparison with experimental 
results (Reinhard et al. 1990) for Cuo.¢g9Zho311 at T = 473 K. 





exp qtheo (Im n) exp qtheo ( Im n) exp theo 


(Imn ) mn Imn mn Imn mn mn 

000 1.0831 1.0000 400 0.0296 0.0279 510 —0.0107 —0.0186 
110 —0.1373 —-—0.1689 330 —0.0134  —0.0211 521 —0.0019 —0.0092 
200 0.1490 0.1863 411 0.0141 0.0306 440 —0.0050 —0.0104 
211 0.0196 0.0196 420 0.0050 0.0825 433 0.0038 0.0092 
220 0.0358 0.0883 332 —0.0005 —0.0050 530 —0.0066 —0.0057 
310 —0.0404  -—0.0453 422 —0.0050 —0.0050 442 —0.0084 —0.0145 
222 —0.0077 0.0371 431 0.0068 0.0148 600 0.0130 0.0017 


321 —0.0036 —0.0132 





measured and calculated diffuse diffraction patterns (in reciprocal space) and the 
SRO parameters (in real space). 

For a quantitative comparison of the computed SRO with experiment, the 
Warren—Cowley SRO parameters oj, are calculated [Eq. (64)| for the first 22 
shells of an Cuo.689Zno.311 alloy at T = 473 K and compared to data from neutron 
scattering experiments (Reinhard et al. 1990) (see also Chapter 12), shown in 
Table 11.9. Considering the fact that the experimental error of «ooo amounts to 
as much as 8% (since x99 = 1.000 by definition), the predicted and measured 
values agree very well. As can be seen, «110 is negative, indicating that Zn atoms 
prefer Cu atoms as nearest neighbors. Furthermore, all «2,99 are positive, while 
all &2n_1)10 are negative. 

In experiments, the SRO parameters, Table 11.9, were used to construct a real- 
space image of the alloy by using them as input for an inverse MC approach in 
order to obtain characteristic interactions (Reinhard et al. 1990). In the case of 
a-brass, real-space images are of special interest for the following reason: as ap- 
parent from Table 11.9, all SRO parameters described by (Imn) = (2n;0;0) are 
positive, while all SRO parameters described by (2n — 1;1;0) are negative. This 
property should lead to characteristic chains of Zn atoms along the [100] direc- 
tion. Figure 11.27 compares the real-space structure deduced from experiment 
(Reinhard et al. 1990) and from “CE plus MC” calculations (Zn atoms are dis- 
played as dark atoms). In both cases, chains of Zn atoms are visible along [100], 
indicating that short-range order is essential for a quantitatively correct descrip- 
tion of the physical properties of the disordered solid solution of «-brass. 

Quenching a solid solution of a metal alloy deep into the two-phase region of 
the phase diagram, followed by sample aging, leads to the formation of character- 
istic nanoclusters, the nuclei of the precipitating phase, which define an impor- 
tant part of the microstructure of many alloy systems (see Chapter 7). The early 
stage of these reactions typically involves the formation of coherent precipitates 
that adopt the crystallographic lattice of the alloy from which they emerge (Guin- 
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Fig. 11.27 Visualization of a (001) plane of «- simulations using AHce. In both cases, 
brass (cut through the crystal) for T = 473 K. chains of Zn atoms (dark atoms) along [100] 
While the picture on the left results from a can be seen, indicating that SRO is present, 
model crystal based on diffuse neutron and therefore the observed solid solution 
scattering experiments (Reinhard et al. 1990), cannot be described as a random alloy. 

the right-hand picture is the result of MC 


ier 1959). Coherent precipitates have practical relevance, as they impede disloca- 
tion motion, and thus lead to “precipitation hardening” in many alloys (Guinier 
1959; Khachaturyan 1983; Cohen 1986). Therefore, their size versus shape distri- 
bution as a function of temperature and aging time is of particular interest. De- 
spite their importance, precipitate microstructures were amenable for the first 
time just a few years ago (Wolverton 1999) to first-principles theories, since their 
description requires “unit cells” containing 10°-10° atoms or more. The concept 
presented in Section 11.3.5 gives access to such huge model systems. 

The shape of precipitates is controlled by two competing energies (Khacha- 
turyan 1983): interfacial or chemical energy Echem and strain energy Ecs. While 
the former leads to a compact shape, the latter leads to a flattening along the elas- 
tically soft direction of the precipitate. The MSCE Hamiltonian allows for a sepa- 
ration into these two characteristic energy parts: as already mentioned in Section 
11.3.5, the first two terms of Eq. (55) include information about the strength and 
importance of the indvidual pair and multibody interactions and therefore repre- 
sent the chemical energy of the system. The last term, however, reflects the elastic 
properties of the alloy. This separation is used to analyze the ratio between chem- 
ical energy Enem and strain energy Ecs as a function of the precipitate size. As 
examples, Al-rich fcc-based Al-Li and Al—Zn alloys are discussed. Their precipi- 
tate distributions for a given aging time and temperature are shown in the upper 
part of Fig. 11.28, where only the Li and Zn atoms are displayed. The MC cell 
used for the simulations consists of 56 x 56 x 56 = 175616 atoms in order to 
achieve sufficient statistics. 

In the case of Al-Li, precipitates of the size considered up to 5 nm never flat- 
ten, but always possess a spherical shape, in excellent agreement with experimen- 
tal results (Lendvai et al. 1989; Sato and Kamino 1991). This behavior becomes 
understandable by analyzing Echem/Ecs for different precipitate sizes as shown in 
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Fig. 11.28 Size and shape distributions of precipitates in Al-rich fcc- 
based Al-Li and Al-Zn alloys (no Al atoms are shown) and their 
corresponding percentages of strain and chemical energy as functions 
of the precipitate size. Precipitates of Al-Li form the L12 structure as 
shown in the top left-hand corner. 


the lower part of Fig. 11.28 for a sample containing 9.7% Li atoms: for all sizes 
the chemical energy Echem (white bars) clearly predominates over the strain en- 
ergy Ecs (black bars). Following experimental results, it must be expected that 
this dominance of Ehem over Ecs holds even for precipitates about 50 nm in di- 
ameter, because they still possess a spherical shape. The Li atoms seem to form 
a simple cubic lattice. This is due to the fact that the precipitates themselves show 
an Al;Li stoichiometry forming the L12 structure sketched at the top left-hand 
corner of Fig. 11.28: while all the corners of the unit cell are occupied by Li 
atoms, all the faces are occupied by Al atoms. Since only the Li atoms are dis- 
played in the real-space image, they form a simple cubic lattice. This observation 
also makes clear that there are practically no antiphase boundaries within the 
Al Li precipitates which would demand the occupation of Al sites by Li atoms. 
Contrarily to Al-Li, for Al-Zn a strong size dependence of the precipitate 
shape is found: Zn precipitates below 2.5 nm are more spherical, i.e., chemically 
dominated, while larger precipitates become more and more ellipsoidal (strain- 
dominated) (Müller et al. 2000). It is interesting to note that for low temperatures 
a third component becomes important, namely the anisotropy of the chemical 
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Fig. 11.29 Shape (c/a) vs. size relationship of Zn precipitates for 
two different temperatures. The lines denote the results from calcula- 
tions (Müller et al. 2001), the open points are taken from different 
experimental studies: expts. 1-5 correspond to previous reports 
(Laslaz and Guyot 1977; Bubeck et al. 1985; Fumeron et al. 1980; 
Deguercy et al. 1982; Gerold et al. 1987, respectively). 


part which controls quantitatively to what extent the precipitates facet at low tem- 
peratures. 

Another remarkable feature of the coherent Zn precipitates is the fact that their 
short axis is always along the [111] (and symmetrically equivalent) directions. In- 
deed, at a first glance, this appears a bit of a surprise, because most fcc elements 
are elastically soft along the [100] direction, and consequently hard along [111]. 
However, fcc Zn is an unusual phase. While Zn is stable in the hcp structure, 
fcc Zn shows an instability when deformed rhombohedrally along [111] (Müller 
et al. 1999). As a consequence, fcc Zn precipitates flatten along this direction 
(Müller et al. 2000). This feature allows the definition of a c/a ratio, and therefore 
a quantitative measure for the description of the precipitate shape as used in 
many experimental studies and shown schematically on the left-hand side of 
Fig. 11.29: while a represents the long axis of the ellipsoid (perpendicular to 
[111]), c is its thickness (parallel to [111]). The size is given by the radius of the 
associated sphere having the same volume as the corresponding precipitate. Fig- 
ure 11.29 compares the experimental size-shape relationship for two different 
aging times and concentrations (Laslaz and Guyot 1977; Fumeron et al. 1980; De- 
guercy et al. 1982; Bubeck et al. 1985; Gerold et al. 1987) with those predicted by 
the calculations. For both temperatures, the agreement is excellent, i.e., the calcu- 
lational approach allows for a quantitative prediction of the size versus shape ver- 
sus temperature relationship of coherent precipitates (Müller et al. 2001). 

The examples above make it clear that the combination of DFT with MSCE and 
MC is probably one of the most successful approaches to studying binary alloy 
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properties without any empirical parameters. At the moment, the limitation of 
the presented access is given by the underlying lattice which, for example, does 
not permit study of melting processes. Regarding ordering and decomposition 
phenomena in the solid phase, the method allows for a quantitative prediction 
of experimental data. The method presented is by no means restricted to ordering 
phenomena of a bulk phase, but can also be applied to, e.g., surface problems, 
such as adsorbate systems (Drautz et al. 2003), or surface segregation (Wieck- 
horst et al. 2004). 


11.4 
Outlook 


The intention of the present chapter has been to give a flavor of density func- 
tional theory methods and some recent applications for studying properties of 
materials, in particular of intermetallic compounds. The central quantity we 
have focused on was the total energy of a system of atoms of a periodic solid. 
One should, however, keep in mind that for each evaluation of the total energy 
the complete electronic structure is available, because the Kohn-Sham orbitals 
(according to Eq. (10)) defining the electron density must always be calculated. 
The electronic structure can be used to analyze the bonding, the magnetic struc- 
ture, or — going further — to calculate matrix elements and more intricate proper- 
ties such as susceptibilities, optical spectra, and so on. 

As demonstrated, for the actual calculation of “hard numbers” the approxima- 
tions to the many-body electron-electron interactions (i.e., to the exchange- 
correlation functional) must be reasonably accurate, which — concerning ground- 
state energies — in many cases they are. For example, to derive enthalpies of 
formation for intermetallic compounds with a reasonably large number of atoms 
per unit cell (about 100 or fewer, to be conservative) the calculated enthalpies of 
formation (as discussed in Section 11.3.1) are at least as accurate as reliable ex- 
periments by calorimetric methods. Usually, the calculation is much faster and 
cheaper provided a reliable and fast computational package — as mentioned in 
Section 11.2.2 - is at hand. Standard density functional theory does not include 
the concept of temperature and is therefore applicable at T = 0 K only or, prag- 
matically speaking, at low temperatures. However, the most important tempera- 
ture effects (at least for temperatures well below the melting point) can now be 
included by the determination of the vibrational states and derived quantities. As 
discussed in Section 11.2.4, within the harmonic approximation the phonon spec- 
tra may be derived from density functional theory potentials by calculating suit- 
able force fields. Thus, consistent temperature-dependent structural phase stabil- 
ities are derived. 

A first natural step beyond strict stoichiometries of particular ordered com- 
pounds are intrinsic defects, such as vacancies and antisites. Based on supercell 
calculations, as elaborated in Section 11.3.2 for binary compounds, chemical po- 
tentials can be derived which are then included into a statistical mechanics ma- 


References 


chinery in order to obtain temperature- and concentration-dependent properties 
of point-like defects. The migration of defects and impurities is dicussed in Sec- 
tion 11.3.3 and temperature-dependent diffusion coefficients are derived within 
transition state theory based on ab-initio total energy and phonon calculations. 
Extended defects such as grain boundaries are also within the scope, which is 
demonstrated in Section 11.3.4 by analyzing impurity effects on grain boundary 
cohesion and intergranular fracture. 

A rather challenging task is to study structural phase stabilities over a wide 
range of compositions. For binary alloys, a very successful strategy is at hand in 
terms of the cluster expansion, described in Section 11.3.5. The basic concept is 
to structurally decompose the solid into small building blocks — clusters of atoms 
- considered to be glued together by cluster interactions. Provided these interac- 
tions are known and a finite cluster expansion reaches convergence, then a large 
variety of different structures can be built with numerically well defined energies. 
In fact, the cluster interactions can be derived from a suitably large database of 
supercell calculations performed in the framework of density functional theory. 
Such a strategy could assist to find the ground-state phase diagram as a function 
of composition, as discussed in Section 11.3.6. Once the cluster interactions are 
known, one could go far beyond the ground-state properties by making use of 
Monte Carlo techniques. Systems of millions of atoms might be treated without 
giving up the accuracy and predictive power of ab-initio methods. Thus, the most 
probable atomic distribution for a given temperature and concentration could 
be found, enabling for instance the study of precipitation formation, as discussed 
in Section 11.3.7. System sizes have already reached such large dimensions that 
they are tractable by continuum theory approaches. Therefore, with the atomistic- 
based strategy of the cluster expansion in combination with density functional 
theory approaches, length and time scales can be bridged, reaching from nano- 
scale atomic bonds to mesoscale microstructures and from picosecond atomic 
movements to realistic time scales of microstructural evolution. We are convinced 
that this development will be one of the hot topics of the future, both for the de- 
velopment of advanced simulation techniques and for applications of density 
functional theory based methods in alloy physics. 
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12 
Simulation Techniques 


Ferdinand Haider, Rafal Kozubski, and T.A. Abinandanan 


12.1 
Introduction 


Availability of computers opened completely new directions for the development 
of computational methods in all fields of science, but in particular in physics and 
in materials science. Using quantum mechanical models and refined computa- 
tion methods, one is able to predict the electron structure of complicated alloys 
and thus the total binding energy of a given structure and of defects therein. Ad- 
ditionally, it is possible to compute forces between atoms, if there are deviations 
from equilibrium. Nevertheless until now it has been difficult to predict high- 
temperature stability, dynamics, and kinetics in an alloy. 

In this chapter we present simulation techniques on a somewhat coarser scale, 
that is, on an atomistic or even continuum level. In molecular dynamics simula- 
tions (presented in Section 12.2) the ensemble of atoms forming the material is 
treated as a set of interacting classical particles. The interactions are often derived 
from quantum mechanical considerations, but are taken as given in the form of 
so-called phenomenological potentials. In this framework the dynamics of atoms 
(e.g., lattice vibrations) is correctly treated, but neglects any deviations from the 
Born-Oppenheimer approximation, that is, any influence of the atomic positions 
on the electronic system. 

A further level of coarse graining is taken in Monte Carlo simulations (pre- 
sented in Section 12.3 of the chapter). Here the atoms are fixed to rigid lattice 
positions, and the microstructure evolves via atomic jumps onto neighboring 
sites. The microscopic degrees of freedom are taken into account only as a heat 
bath, i.e., as a given temperature or thermal energy of the atoms. 

Section 12.4 describes phase field models, a continuum technique which has 
become very popular during the last few years. Here, instead of considering dis- 
crete atoms, only fields of parameters describing the microstructure of an alloy 
are studied. Examples are the local concentration and local order parameter, but 
also crystallinity, orientation, or others. To follow the spatial and/or temporal evo- 
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lution of a sample, one has to make assumptions on the underlying thermo- 
dynamics and mobility of the system. 

The methods described here show limitations in the range of times accessible 
to the simulation as well as the maximum system size, and thus also limit the 
applicability of the respective method to “real world” problems. These limitations 
get less strict in the sequence of methods — while molecular dynamics imposes 
the strictest limits on both time and size, in Monte Carlo methods studies of 
the long-term evolution of an alloy are feasible, and in phase field models even 
complicated phenomena like solidification, sintering, or recrystallization can be 
studied. 

The hierarchy of methods described in this chapter is far from being complete: 
on the fine scale, methods which couple electronic and structural degrees of free- 
dom like the Car—Parrinello method (Car and Parrinello 1985) are missing, while 
on the continuum scale, the very powerful finite element (FEM) techniques 
for the study of external or internal fields like stress, temperature, or others are 
missing. 


12.2 
Molecular Dynamics Simulations 


Molecular dynamics was originally developed in the 1950s to understand the 
properties of hard-sphere systems [Alder and Wainright 1957, 1959]. The first 
studies using continuous potentials were published in the 1960s, where problems 
of radiation damage were studied. Today it is a very common technique in com- 
puter simulation of materials — a literature database search using the keywords 
“molecular dynamics” and “alloys” gave 1080 hits, and for the period of January 
2005-July 2005 still nearly 70. It is applied to all kinds of problems (see below), 
examples being radiation damage, properties of molten alloys, defect properties, 
and many more. General introductions can be found in many papers (e.g., Erco- 
lessi 1997) and books (Rappaport 1995; Allen and Tildesley 1987). 


12.2.1 
Basic Ideas 


Molecular dynamics is a simulation technique for studying the motion of a set of 
particles (in most cases single atoms) on a classical level. This may sound contra- 
dictory, since, as is well known, the behavior of atoms and their interaction are 
governed by quantum mechanics. Nevertheless, a simple estimate shows that, 
even though the interaction of atoms via their valence electrons is in principle a 
quantum mechanical problem, thanks to the Born principle one may to a good 
approximation decouple the changes in the electronic system (which occurs on a 
time scale of femtoseconds) from those of the atoms, occurring on a time scale of 
picoseconds. The basic explanation for that decoupling is based on the large mass 
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ratio of electrons to atoms, roughly speaking 10~>. The characteristic times of 
each process scale like the square root of the respective masses, so are different 
by a factor of 107-103. 

The elementary idea of molecular dynamics is to integrate numerically the clas- 
sical equations of motion (Newton’s equations) for a large set of atoms, as in Eq. 
(1), where x; denotes the location of atom i, m; its mass. The force f(x;) acting on 
atom i is the sum of all the forces exerted by other atoms on the atom under con- 
sideration and leads to an acceleration of the atom. 


mx; = f (xi) (1) 


After integration of the set of equations (1), one obtains a trajectory x;(t) in space 
and time for all atoms (which naturally includes also the time derivatives, i.e., the 
momenta p; or velocities x; for all atoms; thus it is a complete description in a 
6N-dimensional phase space for a system containing N atoms). Of course, the in- 
tegration includes any microscopic motion of an atom, i.e., all lattice vibrations 
and relaxations. To realize this, a time step for integration must be short com- 
pared to any atomic motion in the lattice — that is, in the order of femtoseconds. 

In contrast to Monte Carlo methods, which introduce an element of stochastic- 
ity to simulate a heat bath in the broadest sense, MD is (at least in principle) a 
completely deterministic simulation technique. Nevertheless, since a simulation 
yields the evolution of the system in phase space, it can be used to compute any 
kind of thermodynamic averages of observables. Besides this, MD methods can 
be (and are) also used, but perhaps not mainly, for simulations of nonequilibrium 
processes such as, for example, irradiation effects, plastic deformation, fracture, 
etc. 

Molecular dynamics is a powerful technique, but of course shows certain limi- 
tations. The first is a restriction of the method itself — treating atoms as classical 
particles requires that any quantum mechanical effects can be neglected. This 
holds in most cases, but can be a problem, for example, for light atoms like hy- 
drogen, where tunneling can play a role in diffusion properties. The next problem 
(discussed in more detail below) is whether the forces give a realistic description 
of the physics of the system. The conventional approach - using a given interac- 
tion force or potential between the atoms - is restricted to cases where the elec- 
tronic structure of the solid under consideration does not change during the pro- 
cess studied. This can be violated for cases where, for example, during a 
structural phase transition the electronic structure changes. An extreme case 
would be the transition of graphite to diamond, with a transition from sp* bonds 
to sp? bonds. 

More technical restrictions are limits in space and time. The typical system 
usually consists of several thousand atoms (up to millions) and, more severely, 
since a time step is typically in the order of femtoseconds, the time span which 
can be treated in MD is limited to nanoseconds (up to microseconds). Restric- 
tions in space (that is, finite size effects) can be partly overcome by using periodic 
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boundary conditions, if the problem under consideration is appropriate. To over- 
come the limitations in time, there have been proposals for the use of “acceler- 
ated algorithms,” which try to bridge the potentially very long time between “im- 
portant” events (such as, for example, lattice vibrations until a diffusive jump 
occurs) either by an artificially increased temperature or by a lowered barrier 
(Voter 1997; Sorensen and Voter 2000; Voter et al. 2002). 


12.2.2 
Atomic Interaction, Potential Models 


In order to perform the MD integration, the most important step is to set up a 
model for the interaction forces. These in most cases are derivatives of the poten- 
tial energy with respect to atomic displacements, as expressed by Eq. (2), where 
f(x;) denotes the force on the atom at position x;, V({x}) the potential energy 
of the system with atoms at positions {x} = (x1, x2,...,%3), and V,, the derivative 
with respect to atomic position x;. The potential energy can be obtained with dif- 
ferent degrees of sophistication, i.e., with more or less physical justification. 


F (xi) = -Vx VXI) (2) 


12.2.2.1 Pairwise Interaction 
The simplest choice for the potential is to assume pairwise interactions [Eq. (3)]. 


VGH) =F EE Ml- w) 6) 
i pi 


An example of a simple pair potential is the so-called Lennard-Jones (LJ) potential 
(Lennard-Jones 1931), [Eq. (4)], which gives a reasonably realistic description for 
the interaction in solid noble gases; otherwise (in most cases of interest) it is a 
model potential, suitable for qualitative but not for quantitative studies: 


wosa E -2(2) 2 


The minimum of the potential is at r = ro, and the depth is given by Vo. It is at- 





tractive for large distances and strongly repulsive at short distances. For practical 
reasons, a truncated LJ potential is often used, which is artificially forced to zero 
beyond a certain distance R. [Eq. (5)]. 


V(r) = T — Vy(Re) ifr < Re T 


0 else 


One particular problem of pair potentials (which makes them rather unsuitable 
in most real cases) is that they fulfill the Cauchy relation for elastic constants, 
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meaning that in Voigt notation the elastic constants c12 = c44, while in real alloys 
c44 is considerably larger than c12. 

Additionally, other physical quantities are only poorly modeled within the 
Lennard-Jones model or other pairwise interactions. For this reason in the 1980s 
many-body potentials were developed for alloys. 


12.2.2.2 Many-Body Potentials, the EAM Method 

Most of the many-body potentials for alloys in use today are tailored following a 
similar scheme: the interaction between a solute atom and its surroundings is as- 
sumed to be given by the density of neighbors (or, more precisely, by the elec- 
tronic density acting due to the neighbors at the location of the atom). Qualita- 
tively, using arguments from a tight binding approximation (Finnis and Sinclair 
1984), one expects a dependence of the binding energy attributed to atom i, e; 
such as is given by Eq. (6), where <i|H|j> denotes the overlap integral between 
electronic states from atom i to its neighbors, and Z; is its coordination number. 
This implies that the interaction energy should rather follow a square-root depen- 
dence of coordination instead of a linear dependence found in pair interaction. 


a= [SoSH = VZi (6) 
J 


In more general schemes, this square-root dependence has been liberated in 
favor of an adjustable function, which led to the so-called embedded atom 
method (EAM) potentials. The total potential energy of the crystal [Eq. (7)] is 
made up of two parts, one being a pair potential, the other the so-called embed- 
ding part (Daw and Baskes 1984; Foiles et al. 1986; Baskes 1987, Voter and Chem 
1987). 


U(p;) (7) 


N 
i=1 


1 N 
V= 22,1) + 


The variable p; denotes the electron density at the position of atom i, given in Eq. 
(8) again as the sum over the neighboring atoms. 


P= 3 Ap;(rj) (8) 
j=1 


The embedding function U describes the contribution of this density to the total 
potential energy. In practice, these three functions are fitted to experimental data 
or to data obtained in ab-initio calculations, using suitable parameterizations 
such as combinations of polynomials and exponential functions. 

For binary alloys, one needs in total seven functions: three pair potentials Dxy 
for AA, BB, and AB pairs, two density functions Apy for A and B atoms, and two 
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Fig. 12.1 EAM potential for tungsten and thorium (Eberhard 2006). 


embedding functions Ux. In other words, one needs the full set of three func- 
tions for both atomic species and additionally only one (pair) function, describing 
the A-B interaction. A recent example derived from a fit to ab-initio data is shown 
in Fig. 12.1. 

It must be emphasized that this type of potential contains no angular- 
dependent terms, so will be suitable only for cases where bonds are more or less 
undirected. To some extent this holds for metals, but certainly not for ionic or co- 
valently bound materials. Therefore there are other approaches, such as the Ters- 
off or the Stillinger-Weber potential for semiconductors. But also for 4d and 5d 
transition metals or even more pronouncedly for f metals (e.g., the rare earth ele- 
ments) problems can arise. 

Another problem arising in the use of EAM and related potentials is the ques- 
tion of transferability, that is, whether it is really justified to use the same single 
element parts of a potential in different alloys. Due to an electronic transfer there 
is the possibility that even this part changes with alloying a second element, so 
that the simple pair potential ansatz for the A-B interaction is too simple. 

A completely different approach, which does not refer to empirical potentials, is 
that introduced by Car and Parinello (Car and Parinello 1985), where the motion 
of the atoms is still treated as a classical problem, but where the forces are ob- 
tained directly by solving (in an adiabatic approximation) the electronic structure. 
This method of course avoids many of the problems related to empirical poten- 
tials, but at the cost of a much higher numerical effort, which then drastically 
limits the size and time scale of problems which can be treated. 

Another approach, which also refers to ab-initio data, the so-called force match- 
ing method, (Ercolessi and Adams 1994) uses a somewhat similar idea: forces 
computed by ab-initio methods are parameterized for different atomic configura- 
tions and then used in MD. In some senses, this method can be seen as an “off- 
line version” of the Car—Parrinello method, but of course it is difficult to predict 
all the relevant configurations appearing in the course of a simulation. 
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12.2.3 
Practical Considerations 


Before one applies a potential from literature or other sources, one should verify 
its suitability for the problem under consideration. Usually, different measured 
quantities are reproduced with different precision. Quantities often used are: lat- 
tice constant, cohesive energy, bulk and other elastic moduli, stacking fault en- 
ergy, vacancy formation energy, and others. The fitting of these quantities acts 
on different parts of the potential; elastic coefficients, for example, are essentially 
sensitive to small deviations from equilibrium positions, while vacancy properties 
reflect the long-distance tail of the potential. 


12.2.4 
Different Thermodynamic Ensembles: Thermostats, Barostats 


In discussing the integration of Newton’s equations of motion above, it was as- 
sumed implicitly that we are interested in a microcanonical ensemble, that is, a 
situation where total energy, volume, and number of particles are conserved 
(therefore it is often also called an NVE ensemble). For a typical experimental 
setup, it is usually much easier to control temperature (instead of energy) and 
pressure (instead of volume), and so to work in an NPT ensemble. This can be 
achieved by changing dynamically the average kinetic energy of the system (thus 
adjusting the temperature) or the system size (to adjust pressure) (Andersen 
1980). An even more refined scheme was introduced by Parrinello and Rahman 
(1980, 1981), who allowed not only for changes in the system size but also for 
changes in the box shape. This can be important if phase transformations with a 
change in symmetry occur during the simulation. 

Temperature in terms of microscopic quantities of the simulation is given 
through the equipartition formula, which states that in equilibrium the average 
of the kinetic energy K(t) [Eq. (9)] 


Ki =F milo? 0) 


is equally distributed over the 3N degrees of freedom, each carrying a fraction 
5keT [Eq. (10)]. 


K= Š NkaT (10) 


The simplest implementations of a thermostat just rescale the particle velocities 
periodically in order to keep temperature constant. This method, despite its ap- 
pealing simplicity, suffers (at least theoretically) from a severe drawback: it 
does not respect the fluctuation-dissipation theorem [Eq. (11)], which relates the 
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fluctuations of one thermodynamic observable (here: the temperature) with a cor- 
responding response function (here specific heat). 


«any = BE (11) 


cv 





Better algorithms have been introduced as the Nose-Hoover thermostat and vari- 
ants (Hoover 1985). In the first case, the system is coupled to an external heat 
bath, which (via one extra degree of freedom) by itself follows an evolution equa- 
tion similar to the evolution of particle trajectories. 

Similar methods were introduced to keep pressure constant (Nosé and Klein 
1983; Nosé 1984). Pressure follows from the virial function, as in Eq. (12), where 
F?” denotes the total force acting on atom i. Its average is given by Eq. (13). 


N 

Wir) = oe (12) 
i=1 

<W) = -3Nk;T (13) 


The total force acting on particle i can be decomposed into an internal force 
(from the other atoms) and an external force (due to pressure) [Eq. (14)]. 


Fi = F; + Fe (1 4) 


Similarly, the virial can be decomposed into an internal and an external contribu- 
tion, where the external part can be evaluated by Eq. (15). 


(we) = —3pV (15) 
Solving Eqs. (13) and (15), one gets Eq. (16). 
1 
E22 (16) 
Again, the simplest idea, periodically readjusting the system size to keep pressure 


constant, violates the fluctuation-dissipation theorem (here relating the volume 
fluctuations AV with the isothermal compressibility x). 


<(AV)?) = —kpTVox (17) 


More appropriate is the method of Andersen (1980), which introduces the box 
size as an extra degree of freedom and integrates this in order to keep pressure 
constant. The Parrinello-Rahman scheme (Parrinello and Rahman 1980, 1981) 
in principle works similarly, but uses six additional degrees of freedom, the sizes 
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and angles of the simulation box. The particle coordinates are then integrated in 
coordinates scaled in terms of the box coordinates. The extra degrees of freedom 
all need a virtual mass (used in the Newton equation of motion), which in prin- 
ciple can be chosen arbitrarily but in a real application can cause problems by in- 
troducing oscillations or slow relaxation toward equilibrium. 


12.2.5 
Implementation of MD Algorithms 


The essential part of any MD code is the time integration algorithm. For a time 
step At new positions, velocities, and forces have to be calculated. The simplest 
(but worst) way to do the integration would be to write Eq. (18). 


xilt + At) = x;(t) +o()-at+ ap (18) 


Such a simple forward integration finite difference scheme is known to be nu- 
merically unstable and would require extremely short time steps to give reason- 
able accuracy. 

Two popular and more efficient methods often applied in MD codes are the 
Verlet algorithm (Verlet 1967) and different kinds of predictor-corrector methods. 
The Verlet algorithm uses a forward and a backward Taylor expansion of posi- 

















tions, Eqs. (19) and (20), which can be added to give Eq. (21). 
xilt + At) = x;(t) + v(t) - At aU At? 4 (HAP 0(At*) (19) 
A = art EN i F,(t) 2 1 3 4 
x(t — At) = x;(t) — v;(t) - At Fi At pa O(At*) (20) 
xilt + At) = 2x;(t) — x(t — At) 4 = At? + 0(At*) (21) 


The truncation error is of the order of At*, so it is called a fourth-order method. 
A variant is the velocity Verlet method (Swope et al. 1982) with the time steps 
described by Eqs. (22-25). 











2 


xilt + At) = x;(t) + o;(t)- At + Lalat 22) 
vi (: + =) = v;(t) + Sarlı)ar 23) 
a;(t + At) = Bus) 24) 
vi(t + At) ul: | >) | Tat + At)At 25) 
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The second algorithm, the predictor-corrector method, also called Gear algorithm 
(Gear 1966, 1971), consists of three steps: from a Taylor expansion similar to that 
above, in the first step new positions, velocities, and accelerations are predicted. 
With these predicted quantities one evaluates the forces and computes the differ- 
ence in acceleration between predicted and correct values. These corrections are 
used to correct positions, velocities, etc. The Gear scheme can be used to quite a 
high order (often to order six), but the efficiency is not necessarily better than that 
of the Verlet method. 


12.2.6 
Practical Aspects: Time Steps 


As the first step in an MD simulation one has to choose a suitable atomic interac- 
tion potential which is known to fulfill the conditions discussed above. Addition- 
ally, one has to select a thermodynamic ensemble (e.g., NPT) and an algorithm to 
control the thermodynamic variables. 

The next step is to set up initial atomic positions which define the problem, 
e.g., atoms in a perfect lattice or atoms at a grain boundary. In order to define a 
starting temperature, initial velocities (taken from a Maxwell distribution with 
zero average, corresponding to the temperature) have to be assigned randomly to 
the atoms. 

Rather crucial for the efficiency and correctness of an MD-simulation is the 
proper choice of the integration time step. On the one hand, it has to be chosen 
as large as possible in order to increase the computation speed; on the other 
hand, it must not be too large, so as to avoid instabilities of the integration algo- 
rithm, and, more importantly, to avoid spurious overshooting of the particle mo- 
tion across, e.g., local potential minima. The time step must be short compared to 
vibrational times in the solid, this being of the order of 1071-107" s. A fairly 
pragmatic but efficient way to choose an adequate time step is to start with a 
value like, say, 1 fs, then testing, in a short run in a small simulation box under 
constant energy, whether energy is conserved. If not, one has to reduce the time 
step; otherwise, one can try to enlarge it. 


12.2.7 
Evaluation of Data: Use of Correlation Functions 


The first and sometimes most important results of an MD simulation are the 
atomic positions after a simulation run. Often it is useful not to take the instanta- 
neous positions as obtained for a certain endpoint but to relax the positions to lo- 
cal minima of the potential energy (“quenching”). There are different ways to do 
this: either particle velocities are abruptly decreased to zero if the potential energy 
would increase during the next step, or individual particle velocities are set to 
zero if for that special atom the potential energy would increase. Another way is 
to use different minimization techniques (such as steepest descent or conjugate 
gradients) to find a local minimum of the potential energy. 
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Here good visualization techniques can be important, perhaps supported by an 
overlay of other quantities, such as local energy or coordination, onto the atomic 
positions. 

Related to the inspection of atomic positions is the study of displacements and 
relaxations, where the difference between a reference and the actual position is 
computed. 

Statistical averages over the whole simulation cell which are easily obtained as 
functions of simulation time are the average total, kinetic, and potential energy, 
total cell volume (in an NPT ensemble), or total pressure (in NVT). These quanti- 
ties make it possible to monitor changes and equilibration. 

At equilibrium one can perform time averages over the trajectories of single 
atoms or sets of atoms to obtain mean values of observables. The fluctuations of 
such observables again can be used to determine quantities like elastic moduli or 
specific heat by applying the fluctuation—dissipation theorem. 

A further useful but somewhat more involved evaluation of simulation data is 
the computation of correlation functions in space and time. Real-space correla- 
tions are, for example, the density-density correlation g(r), which gives the prob- 
ability of finding an atom at a distance r away from another atom [Eq. (26)], or its 
spherical average. The density-density correlation function is especially useful for 
the study of liquid and amorphous structures, where it decays for large r. Partial 
correlation functions, taken only between one species of atoms, can also be of 
interest. 


alt) => (x a(r - m) (26) 


i=1 j=1 


Comparison with scattering experiments is possible by the use of reciprocal space 
correlation functions like the (static) structure factor, Eq. (27) where p(k) [Eq. 
(28)] is the spatial Fourier transform of the atomic distribution. It is easy to 
show that the structure factor is the Fourier transform of the density—density cor- 
relation function. The static structure factor is the quantity measured in an elastic 
scattering experiment from a diffraction pattern. 


S(k) = È <p(k)p-k)> (27) 
p(k) = > exp(ik - ri) (28) 


The dynamic structure factor, Eq. (29) contains information about vibrational ex- 
citations and is measured experimentally in inelastic scattering (compare Chapter 
13.1). The positions of peaks in the (k, )-plane show the dispersion of an excita- 
tion like a phonon. 
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N 1 
Slk, o) = 5, | Krk, Do(—k, 0)> exp( iot) dt (29) 


A final important correlation function is the velocity autocorrelation function, Eq. 
(30), which is directly proportional to the phonon density of states. 


2290) cos(@ 
so =F fento ONO A 


12.2.8 
Applications to Alloys, Alloy Dynamics, and Alloy Kinetics 


MD methods allow to study static and dynamic properties of alloys, but are rather 
limited in the study of kinetic processes in alloys, since the total time accessible 
to MD simulations is limited to microseconds at most, while (diffusion- 
controlled) kinetic processes in alloys occur rather on time scales of seconds, 
minutes, or hours. Prominent examples are the study of defects like interfaces, 
especially grain boundaries (as shown in Fig. 12.2; see also Sutton and Ballu 
1995), or of vacancies (Fig. 12.3). Figure 12.2 shows an MD computation of grain 
boundary energies for symmetrical tilt and twin boundaries. Here the MD 
approach essentially serves as an efficient method to relax the structure into a 
stable configuration. The example shown in Fig. 12.3 used finite temperature cal- 
culations to measure not only the internal energy but also the entropy by apply- 
ing the method of overlapping distributions (Bennett 1976) (see also below). 

As described above, phonon densities of states can be obtained by using veloc- 
ity autocorrelation functions. An alternative way to determine phonon dispersion 
relationships is to measure the dynamic force matrix and to compute the phonon 
spectra in a harmonic approximation. An example is shown in Fig. 12.4. 

Other important values which need to be determined are diffusion properties. 
In principle, MD simulations offer the possibility of measuring diffusion coeffi- 
cients in alloys directly by following the mean square displacements [Eq. (31)], 
but since the time scale in MD simulation is limited to microseconds at best, 
this possibility is limited to either high temperature (and of course to liquids) or 
to situations with increased diffusivities, for example short-circuit diffusion in 
grain boundaries (compare Chapter 5). 


(x(t) = Salt — x;(0))? = Dt, (31) 


Studying diffusional phase transformations in solids is more or less impossible 
since the kinetics of such transformations are too slow. More promising are stud- 
ies of nondiffusional phase transitions such as martensitic transformations 
(Schneider and Stoll 1978). 
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Fig. 12.2 Energy of symmetric tilt (above) and twist (below) grain 
boundaries in tungsten (Eberhard 2006). 
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Fig. 12.3 Left: vacancy formation entropy and right: energy and free 
energy in tungsten (Eberhard 2006). 
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Fig. 12.4 Phonon dispersion curves for tungsten (upper panel) and 
thorium (lower panel), computed using a force matrix derived from MD 
simulations (Eberhard 2006). The right-hand inset shows the phonon 
density of states. 


Nevertheless it is at least possible to study single-jump processes, the jump 
path, and the jump energetics as well as other quantities relevant for diffusion, 
such as vacancy formation enthalpies and entropies, solute—vacancy binding en- 
ergies, etc. Figure 12.5 shows another example in tungsten, the energy profile of 
an atom displaced towards a vacancy along two different jump paths, one along a 


<111) and the other along a (100) direction. 


12.3 Monte Carlo Simulations | 667 


== static 
relaxed 


Energy Barrier (eV) along <111> 





e] h 
006 05 10 15 20 25 39 35 
Reaction Coordinate ( 


2 


6.0 


4.0 





Energy Barrier (eV) along <100> 


Ni 05: 10: i 20 25 30 33 
Reaction Coordinate (A) 


Fig. 12.5 Energy along the jump path of a tungsten self-diffusional 
jump along a <111> (upper panel) and a <100) direction (lower panel) 
(Eberhard 2006). 


12.3 
Monte Carlo Simulations 


12.3.1 
Foundations of Stochastic Processes — Markov Chains and the Master Equation 


Statistical physics dealing with macroscopic systems, often composed of a num- 
ber of components comparable to Avogadro’s number, appeared an important 
beneficiary of new computer facilities. Each one of such systems may be found 
in one of its macroscopic states determined by particular microscopic states {0} 
of all the components and classically represented by points in a 6N-dimensional 
phase space (N is the number of system components). The macroscopic states 
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characterized by macroscopic parameters (observables A) such as energy, volume, 
degree of chemical order, magnetization, etc. are usually highly degenerate with 
respect to the microscopic ones {ø}. The values of observables A measured under 
particular conditions (temperature, pressure, external field, etc.) are identified 
with corresponding averages (A) over all microscopic states {a} in which the sys- 
tem may be found under these conditions. 


12.3.2 
The Idea of Sampling 


The central problem of statistical physics (statistical thermodynamics) is a calcu- 
lation of the values of (A). The averaging is performed over an appropriate en- 
semble of macroscopic systems representing the related microscopic states. An 
ensemble is characterized by a so-called density p(o) defined in a way that P(o) 
[Eq. (32)] is a probability that a system in the ensemble is in the microscopic state 
a (see, e.g., Huang 1963). The principle achievement of the founders of statistical 
physics was the derivation of formulae for the densities p,,(a) corresponding to 
ensembles of systems in the equilibrium state. 


5) = 2 
P(o) ae (32) 





A complete description of the system thermodynamics is derivable from the sum 
Z [Eq. (33)], called a partition function. 


Z=) p4(0) (33) 


The types of ensembles and the rules concerning their choice when solving par- 
ticular problems are discussed in text books on statistical thermodynamics (e.g., 
Huang 1963). Here, we will only summarize the problem by listing the ensem- 
bles and the corresponding densities p,, in Table 12.1. 

In particular, in the case of a microcanonical ensemble of isolated systems in 
equilibrium, all microstates o with the energy E may appear with uniform proba- 
bility. 

If A(c) denotes the value of the observable A in the microscopic state a, then 
Eq. (34) applies, where the sum covers all possible microscopic states o (whose 
number is most often extremely large or even infinite) and its strict calculation 
is usually unfeasible.” The basic idea is to approximate the complete sum (34) 


1) In an isobaric-isothermal ensemble 


_ Jo dV, ACs, V) x plo, V)] 


Mee Trav ae, VI] 





Table 12.1 
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Type of ensemble 


Microcanonical ensemble 


Canonical ensemble 


Isothermal-isobaric 
ensemble 


Grandcanonical ensemble 


Usage 


isolated systems with fixed 
energy E 


systems with fixed volume V 
and number of particles N 
studied at fixed temperature 
T determined by thermal bath 


systems with fixed number of 
particles N studied at fixed 
pressure P and temperature T 
determined by thermal bath 


Opened systems with fixed 
volume V studied at fixed 
temperature T determined by 
thermal bath 


Density function peq"! 


ÖH(a;), E 





er 


H(oi) + PV 
exp ae 





a (oi) + Yr lai) Ni] | 
kpT 

N is the number of particles, 

fis chemical potential 





[al H denotes the Hamiltonian of the system. H(o;) used in Monte 
Carlo simulations does not contain kinetic energy terms. 


by a partial one performed over some subset {o;,i=1,...,M} of the microscopic 


states ø [Eq. (35)]. 


<A> = I [P(o) x 


A(a)] = ZI |p(o) x A()] 


Me 
= 
9 
x 
> 
9 


<A) x = 


M 


k=1 


X slo) 


(34) 


(35) 


It may be demonstrated that the efficient way to select states øg; — called sampling 
-is a random choice realized numerically by means of the random number gen- 
erators (see, e.g., Binder and Heermann 1997; Landau and Binder 2000 for nu- 


merous references). 


Two variants of sampling may be applied: 
e Simple sampling, where the random choice of the states c; 
runs according to a uniform distribution. The drawback of 
such an option is a possibility that many o; states may 
correspond to the low value of the density p, making the 
approximation [Eq. (35)] poor. 
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e Importance sampling. The procedure is essentially improved 
when the so-called importance sampling is applied. In this 
case the states c; are chosen randomly with a nonuniform 
distribution II(c). In such a case, the average (A) is given by 
Eq. (36) (Heermann 1986), which is simplified dramatically 
by assuming Eq. (37), which leads to Eq. (38) 


Si (a;) x A(o;) x I! (a,)] 





<A> = (36) 
Nilok) x I (ox) 
k=1 

Io) = Po) = 29 (37) 


>, plo) 
(Ay = S A(a;) (38) 


12.3.3 
Markov Chains as a Tool for Importance Sampling 


The solution of a problem of generating a set of the system states showing the 
distribution [Eq. (37)] was proposed by Metropolis et al. (1953), who indicated 
that such a set may be constructed by generating a Markov chain (see, e.g., van 
Kampen 1987) with appropriate transition rates W(o; — gj) (i.e., transition prob- 
abilities per time unit) between the states {o;}. 

If states of the systems in an ensemble change due to Markov processes, the 
time evolution of the probability distribution P(c) is given by a master equation, 
Eq. (39) (compare Chapter 5). 





ee W(a; > aj) x P(c) +), W(aj > 0;) x P(o) (39) 
J 
The evolution leads to a stationary distribution P,(c) (for which aP, a = 0)? 
provided that Eq. (40) holds. 
XO Wai > a) x Palai) = XC W(oj > 0i) x Palaj) (40) 


j j 


2) Ps(c) defines a stationary state, which 
differs from the equilibrium one by possible 
appearance of systematic flows. 
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eg 

di 
sufficient, though not necessary, condition is that the transition frequencies W 
satisfy the so-called detailed balance of Eq. (41). 





= 0 must obviously hold for the equilibrium distribution P.4(c), for which the 


(41) 





Wai > aj) = Palaj) = Palaj) 
W(aj = Gi) Peq(ai) Peg Gi) 


It can be proven (see, e.g., Binder 1976) that the detailed balance [Eq. (41)] guar- 
antees a convergence of a Markov chain to P,4(a). 

The goal to generate a set of states {o;} with the distribution of Eq. (37) is thus 
attainable by the realization of a Markov chain of the system microstates ø with 
the transition frequencies fulfilling the condition in Eq. (41). 

This condition constitutes the basis for MC methods developed for particular 
problems of statistical physics solvable by considering appropriate ensembles de- 
termining the densities p(c) (Table 12.1).” 


12.3.4 
General Applicability 


The Monte Carlo method may be basically applied to diverse kinds of problems in 
statistical physics: 


12.3.4.1 Simulation and Characterization of System Properties in Thermodynamic 
Equilibrium 
The procedure starts from a system in some (arbitrary) initial state. Subsequently, 
an evolution of the system is simulated as a Markov chain of microscopic states 
ai with the transition frequencies W(a; — aj) obeying detailed balance [Eq. (41)] 
corresponding to the particular conditions of the equilibrium state in question. 
The algorithm applied must enable the evolution of some macroscopic parameter 
of the system (for example its energy) to be followed, so that it is possible to ob- 
serve the approach of equilibrium (microscopic states are in dynamical equilib- 
rium with the distribution P,,(0)).” Once this stage is attained, the microscopic 
states oj of the system appearing at particular time moments may be randomly 
sampled and used in the averaging procedure [Eq. (38)] (effectively, time averag- 
ing is done). 

Among a number of tasks solvable in this way, the determination of thermo- 
dynamic potentials from which all fundamental properties of macroscopic systems 
may be derived is of basic importance. 


3) In order to accelerate the saturation of a Bhattacharya and Sethna 1998) for 
Markov chain, various modifications of the “multicanonical sampling”). 
sampling procedure have been proposed: 4) More accurately, the most slowly relaxing 
e.g., so called reweighting schemes (see, macroscopic parameter should be 


e.g., Binder and Heermann 1997; monitored. 
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The procedure consists entirely or partially of importance sampling as de- 
scribed in Section 12.3.2, which may be combined with integration of standard 
thermodynamic equations. Let us focus attention on the evaluation of Helmholtz 
free energy F of a system, given by Eq. (42), where U denotes internal energy, T 
stands for absolute temperature, and S denotes entropy. 


F=U-T-S (42) 


The internal energy U is conventionally evaluated according to Eq. (38) applied to 
system energies E(o;) sampled from the generated Markov chain of microstates 
o;. As explained earlier, the sampling is performed on that part of the Markov 
chain which corresponds to thermal equilibrium with thermal fluctuations, i.e., 
on its finally quasi-constant part. 

The entropy S may be evaluated either indirectly by means of other observables 
determined directly in MC (such as specific heat C, magnetization M, etc.), or by 
direct averaging, as proposed by Ma (1981). 

The first technique involves standard thermodynamic equations; for example, 
the entropy may by found by performing an integration (Heermann 1986) to ob- 
tain Eq. (43) (6 = 1/kgT), which, however, requires that (a) the reference value of 
entropy S(T’) is known (which is often possible at either very low or very high 
temperature) and (b) U is evaluated along the whole path of integration. 


B 
S(T) = S(T’) + -p` p U dp (43) 


Ma (1981) postulated a direct evaluation of S from Eq. (44), where P(o;) is the 
probability of the occurrence of the microstate gj. 


S= —kg > P(a;) x In[P(o;)] (44) 


In practice, the determination of P(a;) is highly nontrivial due to the low fre- 
quency of the occurrence of particular microstates with respect to their total 
number. 

When discussing the Monte Carlo techniques for free energy determination 
one should mention a method proposed by Bennett (1976) which is commonly 
applied and often called the “overlapping distribution method”. It is shown that 
given two ensembles (two systems or phases) defined in the same configurational 
space and described by Hamiltonians Ho and Hj, respectively, one can find a 
ratio Zı/Zo of their partition functions [Eq. (33)] by analyzing the distributions 
of these two ensembles over the configurational space. The distributions are gen- 
erated by Monte Carlo simulation of a specific Markov chain over the configura- 
tional space, each step of which means a switch between Ho and H; phases. Eval- 
uation of Zı/Zo means the direct evaluation of a difference between the 
corresponding free energies Fı and Fo. 
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12.3.4.2 Simulation of Relaxation Processes Toward Equilibrium 

The master equation (39) basically can be regarded as describing relaxation and 
migration processes (see e.g., Kozubski 1997 for a concise review and references). 
The Monte Carlo method becomes, therefore, a natural tool for simulating such 
processes. Instead of sampling the microscopic states o; after the saturation of the 
particular Markov chain, one now simulates and observes an ensemble of inde- 
pendent parallel Markov chains, and performs the averaging of the observable of 
interest over all of them at particular consecutive time moments also before satu- 
ration. In this way the time evolution (relaxation) <A)(t) of the observable is ob- 
tained. Since, for a unique choice of transition rates, the condition of detailed bal- 
ance is not sufficient, the treatment of such problems is sophisticated, involving a 
number of problems including, e.g., the relationship between the computer and 
real-time scales (see Section 12.3.6). A choice of transition rates following a well- 
based microscopic model is therefore necessary. In general, the relaxation path 
toward equilibrium can depend sensitively on the particular physical model for 
the transition rates. 


12.3.4.3 Simulation of Nonequilibrium Processes and Transport Phenomena 
Nonequilibrium processes constitute a wide group of phenomena, very often of 
fundamental interest: diverse growth processes are typical examples. Such pro- 
cesses, consisting of effective transitions between microscopic states, constitute a 
natural subject for studies by means of MC methods (see Landau and Binder 
2000 for an extended review). The nonequilibrium character of the phenomenon 
means that a detailed balance no longer must be obeyed by the transition fre- 
quencies which are to model particular microscopic reactions involved in the pro- 
cess. MC simulation of crystal growth according to the Kossel model (Kossel 
1927) is a common example. In this model three atomistic-scale processes — 
deposition, evaporation and diffusion — compete. A particular frequency (rate) is 
modeled and attributed to each process and its selective influence on the overall 
effect (e.g., crystal growth rate) may then be studied by means of MC. 

Transport phenomena may be studied by means of MC in both stationary and 
nonstationary states of the systems, simply by monitoring the process of trans- 
port during the simulated Markov chain. 

The most common example is MC simulation of various kinds of diffusion and 
the determination of particular parameters, such as the diffusion constant or cor- 
relation factor. A basic review of the related methodology and topics examined 
has been given by Murch (1984). 

The standard method (see, e.g., a recent study of B2-ordered FeAl by Wein- 
kamer et al. (1998) consists of monitoring mean-square displacement R?,(v(t) 
of a tracer atom (A)/vacancy (V) as a function of MC time. The vacancy/tracer dif- 
fusion constant Dy,,) is then calculated by means of Eq. (45) (Zhao et al. 1996). 


1 ð 
Dya = li = Re (t 45 
ER nn 3 va (t) (45) 
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Similarly to the case of relaxations, the analysis makes no sense without a proper 
definition of the time scale. 

Monitoring of R? 4v) (t) also leads to the estimation of the correlation factors 
for atoms A (Einstein 1905) by Eq. (46), where n4 denotes the number of jumps 
of length a performed by the A atom - as considered, e.g., by Murch (1982). 


_ SRAMA) > 
a? x NA 


fa (46) 


The procedure may basically be implemented with various particular conditions 
(concentration/chemical potential gradient, external stress, etc.), which allows dif- 
ferent types of diffusion to be simulated. 


12.3.5 
Limitations: Finite-Size Effects and Boundary Conditions 


Thermodynamic laws in statistical physics correspond to the thermodynamic 
limit N — œ, where N denotes the number of particles in the system. It is obvi- 
ous that when simulating a system numerically, one always operates with a finite 
value of N. This generates so-called finite size effects, resulting in a variety of 
problems in the thermodynamic interpretation of simulation results. The “sim- 
plest” task aiming in the elimination of the parasitic influence of the sample 
limits on the simulated effects is usually realized by the application of periodic 
boundary conditions consisting of the consideration of the opposite sample 
boundaries (e.g., the very first and the very last crystallographic planes if a 
crystalline system is in question) as neighboring parts. The well-known two- 
dimensional analogue of this idea is a transformation of a plane into a torus. De- 
spite compensating the boundary (surface) effects, periodic boundary conditions 
cannot remove the limitation caused by finite size in the consideration of any dis- 
tance dependencies. This applies, for example, to correlation lengths č (which 
cannot exceed the sample size) and results in characteristic blunting (rounding) 
of singularities marking continuous and discontinuous phase transitions. The ef- 
fect may be described by specific finite-size scaling rules derivable for particular 
relationships (Binder and Heermann 1997). An example are effects concerning 
the relaxation times 7 and the transition temperatures T, for simulated discontin- 
uous “order-disorder” transitions: both quantities scale with characteristic criti- 
cal exponents according to the sample size L. 

Instead of diverging to infinity at the temperature of phase transition, the relax- 
ation time increases up to a maximum value as in Eq. (47). 


insz20c: LE? (47) 


In the case of the transition temperature, we observe, in turn, a shift from the 
thermodynamic value described by Eq. (48). 
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Ti(L) — T;(L > œ) œ L} (48) 


Another unwanted effect of periodic boundary conditions can be the stabiliziation 
(or suppression) of certain superstructures matching (or not matching) the peri- 
odicity of the lattice at the boundaries. 

Finally, it should also be mentioned that in the case of MC simulatons on rigid 
lattices (crystalline phases, lattice gas) the finite size of the simulated sample 
limits the possible reduction of the concentration of system components. The ef- 
fect is important, e.g., when modeling vacancy-controlled processes in alloys. 


12.3.6 
Numerical Implementation of MC 


Most basically, a simulation algorithm aims at generating microscopic states o; of 
a system (e.g., particular atomic or spin configurations) corresponding to a partic- 
ular macroscopic state (characterized by macroscopic parameters: temperature, 
pressure, degree of order, etc.). Conceptually, this is achieved by generating a 
Markov chain of microscopic states and the related particular ideas consist of a 
choice of a formula for the transition frequency W(o; — aj) fulfilling the detailed 
balance. 


12.3.6.1 Classical Realization of Markov Chains 

Let the system energy be equal to Ein; in the microstate o; and to Efn in the micro- 
state oj. Then after Metropolis et al. (1953) one writes Eq. (49), where AE = 
Ein — Eini, kg and T denote the Boltzmann constant and temperature, respectively, 
and 7 is a time scale constant. 


Wa; > oj) = 


ER EN 
= min! „T x exp| GT (49) 


Although the Metropolis transition frequencies (49) obviously fulfill the detailed 
balance (in a canonical ensemble) and a corresponding Markov chain definitely 
converges at the equilibrium distribution of the microstates {aj}, in some partic- 
ular cases their use may be disadvantageous at high temperatures where, due 
to the transition probabilities approaching the value of 1, the system being off- 
equilibrium keeps oscillating between different microscopic states, which makes 
the simulated process not perfectly ergodic (Landau and Binder 2000). 

An alternative concept for the transition frequencies W(a; — aj) goes back to 
Glauber (1963); Eq. (50) for W(a; — aj) is proposed. 
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Fulfillment of detailed balance follows directly from Eq. (50). It is also immedi- 
ately found that at high temperatures both W(o; — aj) and W(a; — o;) tend to 
the level of 1/(2z). 
The simulation algorithms involving the above expressions for W(a; — oj) usu- 
ally work in the following cycles. 
e The system is in some microscopic state o;. 
e Another microscopic state a; # a; is chosen at random from 
the set {a;}. 
e Transition o; — aj is executed or suppressed according to the 
probability x W(a; — aj). 


The above scheme yields a probabilistic rationale of Eq. (50). The probability of an 
event “the system either transforms from øg; to aj or remains in g?’ is in this par- 
ticular case equal to 1. On the other hand, it must be equal to a sum of the two 
corresponding probabilities. Equation (50) is clearly consistent with these as- 
sumptions. 

The time-scale constant t is usually kept fixed over all the Markov chain; some- 
times it is related to temperature. 

The drawback of the procedure is that, especially at low temperatures, a num- 
ber of MC steps are lost (the transition is suppressed). The drawback becomes 
especially troublesome when simulating phenomena where AE = Efn — Ein; is 
always positive (e.g., in the case of kinetic Monte Carlo simulations of diffusion 
or relaxation processes involving activation barriers — see Section 12.3.4). 


12.3.6.2 “Residence Time” Algorithm 

The idea stemmed from Young and Elcock (1966) and Bortz et al. (1975), then 
was applied and developed in many papers on the basis of the method proposed 
by Gillespie (1976) for the simulation of the kinetics of coupled chemical reac- 
tions. The basic idea is that, instead of making a number of unsuccessful at- 
tempts to perform a transition, one computes the (average) time for the system 
to stay in its microstate i, then performs one of the possible transitions with a 
choice which respects the proper weights. 

The following steps are used. 
e All possible transitions ø; — aj between the microstates of 
the system are listed and numbered and their corresponding 
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Metropolis frequencies [Eq. (49)] We = W(a; — oj) = 
7! x exp{—[E(a;) — E(o;)]/kgT} are determined. This 
appears feasible in many cases when particular systems and 
particular transition mechanisms are considered (see, e.g., 
Bortz et al. 1975). 

e Two random numbers Rı and Rz between 0 and 1 are 
generated. 

e The transition k is chosen, for which Eq. (51) holds. 








W; £ W; 
<R<), (51) 


e Time is incremented by At [Eq. (52)], where At has an 
obvious meaning of a “residence time” of the system in the 
initial microscopic state. 


In R2 


>_W 


J 


At = 





(52) 


It was shown (Novotny 1995; Athenes et al. 1997) that the standard Metropolis- 
type algorithms are equivalent to the “residence-time” one in the sense that the 
average residence time <At> (52) equals the time resulting from the cumulation 
of unsuccessful “Metropolis” transition attempts (see Appendix). 

The classical “residence-time” algorithm has been generalized towards reduc- 
ing the effect of back and forth transitions (Athenes et al. 1997) — in particular 
application to vacancy diffusion. 


12.3.6.3 The Problem of Time Scales 

The interpretation of results of MC simulations in terms of natural physical phe- 
nomena requires that the MC time - i.e., the sequence of simulation steps, is re- 
lated to real time. A solution of the problem depends on the particular algorithm 
applied. 


Metropolis-type algorithms 
e In each MC step a number N of possible transitions o; — oj 
has to be determined. 
e Each MC step is associated with time increment At = t/N. 


As thoroughly analyzed by Athenes (Athenes 1997), the realization of the above 
idea requires that the value of N and its possible variation during the simulated 
process is examined carefully. 

If applied to different situations (e.g., different temperatures) there is not a 
unique translation from MC simulation to real time. 
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“Residence time” algorithm As mentioned before, the time increment At is 
weighted by a sum of frequencies of possible transitions [Eq. (52)]. 

Among a number of proposed justifications (see, e.g., Nowotny 1995; Athenes 
et al. 1997) the one proposed by Athenes (1997) seems to be the simplest. 

Let the system be in the microstate o; at t = 0 and let P(t) be a probability that 
it still remains in this microstate at t > 0 (¢ is thus the residence time). If { Wi} 
denotes the frequencies of all possible transitions starting from øg; the differential 
equation (53) holds, with Eq. (54) as the obvious solution, which is the probability 
distribution for residence times with the normalization factor C = X; Wi. 


“ptt =- (= w) x P(t) (53) 
P(t) = C x exp - bs x x | (54) 


It is clear that the application of Eq. (52) means a numerical realization of the 
evaluation of the residence time At according to the probability given by Eq. (53). 

From Eq. (54) we find Eq. (55) gives the value most often assigned to the resi- 
dence time in practical applications. 


1 


2N 


i 


= 





(55) 


12.3.7 
Applications to Alloys 


12.3.7.1 General Assumptions 

We will focus our attention on the configurational thermodynamics of alloys: the 
search for equilibrium configurations and investigation of relaxation kinetics (fea- 
tures of the system evolution toward the equilibrium configuration). Most of the 
information will concern MC simulations of canonical ensembles. A recent ac- 
count of the achievements and trends has been given by Schweika (1998). 

Particular meanings will now be attributed to the general notions introduced in 
the preceding sections. 

The microstates o; correspond to atomic configurations, from which one can 
easily deduce macroscopic parameters such as short- and long-range order pa- 
rameters, internal energy, and precipitate size and distribution. 

The transitions between microstates consist of elementary steps of atomic mi- 
gration. When searching for equilibrium configurations only, direct exchanges of 
atoms can be considered. In the case of kinetics in alloys, we mainly consider 
atomic jumps to nearest-neighbor (NN) vacancies (Fig. 12.6), which have been de- 
scribed in detail in Chapter 5. It is, however, worth noticing that as vacancies are 
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Fig. 12.6 Atomic jump energetics. 
O ATOM 
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most often treated as additional alloy components, in both cases one has to do 
with actual exchanges between particles (atoms and vacancies). 

The corresponding MC simulations are run by applying either of the algo- 
rithms discussed in the preceding sections [Eqs. (49-52)]. As pioneering works, 
the papers of Flinn and McManus (1961) for Metropolis-type and of Young and 
Elcock (1966) for the “residence time” algorithms are quoted. There are, however, 
two alternative ways in which particular energies are substituted for E,,: in classi- 
cal simulations Ejn is identified with the energy Er (Fig. 12.6) — the final energy 
of the system after the atomic jump is executed. More appropriate from the point 
of view of transition-state theory is an alternative assumption, where the saddle- 
point energy Ey is substituted for Efn.” 

The second option is more suitable for simulating kinetics and, therefore, the 
corresponding method is called kinetic Monte Carlo (KMC) (see, e.g., Landau and 
Binder 2000). However, no strict proof for the latter has been given as yet, and to 
the contrary, when applied to specific problems of kinetics both standard MC and 
KMC yield analogous results (Oramus et al. 2001a,b). 


12.3.7.2 Physical Model of an Alloy 

Realization of the general task requires that a particular model of an alloy is con- 
sidered. The absolute majority of MC simulations of alloys have up to now been 
done with the Ising model (see Chapter 10), although, due to the development of 
more or less rigorous ab-initio calculations considerably increasing in materials 
science during recent years, progress beyond this approximation has been 
achieved. 

The classical Ising model was proposed to describe a magnetic system as a lat- 
tice of pairwise interacting spins, which were assumed to flip without changing 
positions in the lattice. Translation of the problem of this “classical” Ising system 
with temperature-dependent magnetization M and domain structure into the 
problem of an alloy with, e.g., a temperature-dependent degree of chemical long- 


5) In the case of Metropolis-type algorithms i— SP and SP — f. It can easily be shown 
[Eqs. (49), (50)], a transition from the state i that a detailed balance then holds for the 
to the state f across the saddle point (SP) is whole transition i > f. 


realized by dividing the process into two, 
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range order y and a network of antiphase domains immediately suggests the cor- 
respondence: spins-up > A-atoms; spins-down + B-atoms; M + y, where, how- 
ever, contrary to the “concentrations” of up- and down-spins in the ferromagnet, 
atomic concentrations in an alloy must be conserved. The bridge between the two 
topics is mostly due to Kawasaki (1966), who proposed a formalism assuming the 
conservation of magnetization to consider critical spin diffusion in a ferromagnet. 

Application of the Ising model to an alloy and its implementation with MC 
simulation algorithms require that the energy of a configuration can be described 
as the sum of pair interactions ej; between the component atoms. 

In particular, energy increments/decrements AE attributed to the exchanges of 
atomic positions (including vacancies) and controlling the values of the exchange 
frequencies [Eqs. (49), (50)] should be known. As shown in Chapter 10, in any 
case AE depends on the atomic pair interactions ej via their linear combinations 
Vij OC 2ejj — ej — ey (i, j = A, B) (the proportionality coefficient depends on partic- 
ular definitions of other parameters) sometimes called “ordering energies” or just 
“pair-interaction energies,” which is highly misleading. Therefore, for practical 
use in classical MC (not in KMC!) it is enough to know only the vj parameters. 
While for both “7” and “j” being atoms, vi; may be evaluated experimentally (e.g., 
using chemical short-range order parameters deduced from diffuse scattering 
data — see Section 12.3.9.2 and Chapter 13.1), in the case of vacancy—atom or 
vacancy-vacancy interactions experimental access to the parameters is much 
more difficult. Very often pair interactions with and between vacancies are ne- 
glected (put equal to zero) but then, in addition to the vj parameters, so-called 
“asymmetry energies” wij oc ei — ejj parameterize the AE. Evaluation of the latter 
most often consists of speculations supported by theoretical calculations. For an 
example of such an approach see Oramus et al. (2001a). 

Another source for the values of pair-interaction energies to be used in MC 
simulations are more precise simulation methods such as ab-initio or molecular 
statics/dynamics (see, e.g., Kim 1991). 

As long as a system is described by means of an Ising Hamiltonian, ab-initio 
calculations may be addressed in such a way that the appropriate cohesion and 
defect-formation energies expressed within the cluster variation method (CVM) 
in terms of pair interactions are assumed to be equal to the values resulting 
from the quantum electron theory. An example in this field may be found in 
Koztowski et al. (2005). It should, however, be made clear that more interesting 
results following from the combination of quasi- or strictly ab-initio calculations 
with MC simulation of alloys are obtained when going beyond the Ising model 
and by involving many-body potentials. This topic is discussed briefly in Section 
12.3.10. 

In the case of KMC, except for pair interactions determining the energies of 
atoms in lattice sites, the values of saddle-point energies Ey (Fig. 12.6) are 
needed. In many papers arbitrary fixed values of Ey are either assigned to partic- 
ular atoms (neglecting the influence of local configurations), or, as for example in 
Pitsch and Gahn (1993); Athenes et al. (1996), Em is calculated by assuming that 
pair interaction energies eM); bonding the jumping atoms on the saddle point 
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are proportional to the corresponding ej parameters related to atoms occupying 
lattice sites. In such a way, the reliability of the simulations with an atomic jump 
model involving saddle-point energy Em is somewhat weakened due to the diffi- 
culty in the evaluation of Ey. 

Experimentally, saddle-point energies of atoms migrating in a crystal are deduc- 
ible, e.g., from the analysis of phonon spectra (Schober et al. 1992) or diffusion/ 
relaxation data, provided activation energies for vacancy formation are known. 
The problem is, however, most often solvable by resorting to ab-initio or quasi- 
ab-initio calculations of the alloy energetics and by running molecular statics for 
atoms drawn along the path of an elementary jump (see, e.g., Oramus et al. 
2001b; Schweiger et al. 2001). Saddle-point energies evaluated in this way were 
applied in KMC simulations of ordering kinetics in Ni3Al (Oramus et al. 2001b). 


12.3.8 
Practical Aspects 


In most of the related studies a crystal of an alloy with particular symmetry is si- 
mulated. The size of the sample depends on the computer power available; now- 
adays, the simulations usually involve a range of at least 10° atoms. A large sim- 
ulation cell minimizes finite size effect and, in particular, allows treatment of 
realistically low concentrations (especially that of vacancies). In the case of bulk 
properties being in question, periodic boundary conditions are applied in all x, 
y, and z directions (see the previous section and, e.g., Binder and Heermann 
1997). A study of surface or general boundary effects requires that periodic 
boundary conditions are partially or totally removed. 
The following ways exist to implement an MC algorithm with atomic jumps in 
a crystalline material. 
(a) Pair exchange: Metropolis-type simulation of atoms moving 
by direct exchange — applied when only equilibrium states 
are studied 
e random choice of two atoms of different chemical nature 
e execution or rejection of their exchange according to 
probabilities given by Eq. (49) or (50). 
increment of time by t/N, where N is the total number of 
atoms. 
(b) Vacancy-atom exchange: Metropolis-type simulation of atoms 
moving with vacancy mechanism 
e introduction of vacancies to the system. Most often just 
one vacancy is introduced by random removal of an atom. 
To maintain stoichiometry strictly, it can be necessary to 
remove more than one atom. 
e random choice of an atom in the first coordination shell of 
the vacancy 
execution or rejection of its exchange with the vacancy — 
according to probabilities given by Eq. (49) or (50) 
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e repetition of the above steps for each vacancy in the system 

e increment of time by t/Z, where Z is the first-shell 
coordination number (for explanation see the Appendix). 

(c) Residence time algorithm: As already mentioned, Metropolis- 
type algorithms appear inefficient in this case because 
numerous atomic jumps are rejected. Problems with vacancy 
traps appear. The present-day common solution is the 

“residence time” algorithm. 

e Metropolis transition frequencies W; (i= 1,..., Z) [Eq. 
(49)] are calculated for all potential exchanges of a vacancy 
with atoms in the first coordination shell. 

e The exchange chosen randomly according to Eq. (45) is 
executed. 

e Time is incremented either by At = — 





ne (by generating 
y. 
DW 
a random fraction R each time), or by an average 





1 
<A) = SW Note that the increments are by definition 


i Wi 
proportional to the reciprocal attempt frequency t. This 
frequency is usually taken as constant, but some authors 
have proposed taking into account its temperature 


dependence (Bozzolo et al. 2002). 


In the Appendix, the question of “time scale” is addressed again, now with defi- 
nite reference to atomic jumps. A very consistent explanation of the problem was 
given by Athenes et al. (1997). 


12.3.9 
Review of Current Applications in Studies of Alloys 


Monte Carlo methods have nowadays become one of the most important and 
powerful tools in modeling of alloys. A review of databases shows that over 1200 
related papers have been published during the past decade (over 400 during the 
past three years) and the investigations cover all topics in the field. The classical 
interest in the search for ground-state configurations and, consequently, phase di- 
agram calculations (Landau and Binder 2000) has been extended to time evolu- 
tions including diffusion phenomena and the kinetics of diverse structural trans- 
formations. Most recent MC studies of alloys concern the hottest topics, such as 
the properties of nanostructured materials (see, e.g., Guofeng-Wang et al. 
2004a,b; Canzian et al. 2004; Kozłowski et al. 2005) and hydrogen storage in 
metals (see, e.g., Bhatia et al. 2004; Kamakoti and Sholl 2005). 

Below, some of the less routine MC approaches to the alloy physics problems 
are briefly scanned. 
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12.3.9.1 Computation of Phase Diagrams using Grandcanonical Ensemble 
Calculation of phase diagrams for binary alloys is an important task in order 
to determine the solubility limits, supersaturation, existence range of ordered 
phases, etc., but also to test the validity of potential parameters. While in princi- 
ple it is possible to calculate phase diagrams using standard MC techniques in 
the canonical ensemble, precision and computational speed are higher if one 
uses instead a grand canonical ensemble (Binder et al. 1981). Practically speak- 
ing, this means that instead of applying an evolution of microstates via AB ex- 
changes of vacancy diffusion (both conserving total concentration), we transform 
A atoms into B and vice versa, again according to MC rules, but now with an ad- 
ditional term in the transition probability. This additional term, (56), corresponds 
to an external applied field in the case of the magnetic Ising model. Here, Ay is 
the difference in chemical potential of species A and B, and AN; is the change of 
the number of B atoms during the MC step. 


Au x ANg (56) 


Running the simulation with such an algorithm (with Au given) after equilibra- 
tion leads to an equilibrium concentration. Varying Aw now yields a curve 
cp (Au) with jumps at phase boundaries, and, after repetition at different temper- 
atures, one can construct the complete phase diagram. 

A complication of this procedure is due to metastability — often there is a cer- 
tain difference in cg values depending on whether Ay was increased or decreased 
during the calculation. One solution is to take an average of both values, or to put 
“seeds” of the second phase into the simulation box. 


12.3.9.2 Reverse and Inverse Monte Carlo Methods: from Experimental SRO 
Parameters to Atomic Interaction Energies 

While standard MC simulations are used for the generation of equilibrium states 
(configurations) or possibly for the study of kinetics (relaxation toward the equi- 
librium state) on the basis of the system energetics and external conditions (tem- 
perature, pressure), the method may also be used to determine atomistic energy 
parameters on the basis of the equilibrium state (for a review see, e.g., Schweika 
1998). 

The first step is the so-called reverse MC method-going back to Gehlen and Co- 
hen (1965), who fitted correlation functions (short-range order (SRO) parameters) 
to atomic configuration, so that the experimentally observed pattern of diffuse 
scattered intensity is reproduced. The method is thus a specific technique for nu- 
merical fitting rather than a simulation of a physical phenomenon. The algorithm 
constructs a Markov chain of configuration changes gi — ar (due to atomic dis- 
placements) according to the transition frequencies [Eq. (39)] given by McGreevy 
and Pusztai (1988) as in Eq. (57), where y is defined by Eq. (58), gon and ol) 
are theoretical and experimental values of SRO parameters, respectively, and eg 
denotes the corresponding standard deviation. 
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As discussed by Schweika, in order to make the procedure converge to the defi- 
nite solution, one has to perform a number of runs starting from different initial 
conditions. 

The method is very efficient as a fitting tool and is also used by one of the pres- 
ent authors in the calculations of Laplace transformations of the MC-simulated 
“order—order” relaxation isotherms (Oramus et al. 2001a). Several successful ap- 
plications in the studies of atomic correlation in amorphous phases have recently 
been reported (see, e.g., Hanada et al. 2004). 

The inverse Monte Carlo method is an application of the above fitting proce- 
dure of the values of pair-interaction energies to the equilibrium set {æg} of SRO 
parameters obtained from experiment. The original idea is due to Gerold and 
Kern (1987). If the crystal configuration (described by {«,}) corresponds to ther- 
mal equilibrium, the value of configurational energy U is dynamically constant — 
i.e., despite permanent fluctuations due to atomic migration its average value 
does not change. 

An exchange of an A-B pair of atoms brings about the change of U expressed 
in Eq. (59), where AN; and v; denote the resulting change in the number of A-A 
pairs and the effective pair-interaction energy (actually, the so-called ordering en- 
ergy v oc 2 x eag — aa — Epp, See Section 12.3.7.2) in the ith coordination shell, 
respectively. 


AU=) AN (59) 
The probability of the A-B exchange is assumed to obey the Glauber formula [Eq. 
(60)]. 
AU 
exp "kr 


AU (60) 
1+ exp|— ET 
B 


p= 


As <AU) = const, <AN;> should be equal to zero for each coordination shell; 
therefore Eq. (61) applies. 


<Niy= Š. px AN; =0 (61) 


exchanges 
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The following algorithm realizes the idea. 

(1) A crystal configuration with the equilibrium {a} is 
generated. 

(2) An A-B exchange is virtually executed in a randomly selected 
pair of atoms and the resulting values of the AN; are 
calculated. Note: the actual configuration of the crystal does 
not change. 

(3) Point (2) is repeated many times. 

(4) The evaluated set of AN; numbers are substituted into Eq. 
(60) and combined with Eq. (59) to form a set of equations 
for pair interaction energies v;. 


In the original paper Gerold and Kern tested the method on several binary inter- 
metallic systems. Among recent applications, studies of Ag-Al (Yu et al. 2004) 
and Fe-Pd (Mehaddene 2005) systems may be found. 

The effective pair-interaction energy parameters v; may be evaluated in a more 
straightforward way involving standard MC simulation — as proposed by Livet 
(1987). Here, an equilibrium atomic configuration at temperature T (above the 
“order-disorder” transition) is generated using the standard Metropolis algo- 
rithm with some starting set {v;(”} and the values of {0} are calculated. 
The values of v;® may be determined within the classical Clapp-Moss model 
(Clapp and Moss 1966). An iterative procedure is then applied, where subsequent 
corrections Av; to the pair-interaction parameters are calculated until the agree- 
ment between the calculated and experimental SRO parameters {a} is achieved. 


12.3.10 
Going beyond the Ising Model and Rigid-Lattice Simulations 


Numerous papers published during past decades show that Monte Carlo methods 
of alloy modeling may easily operate in continuum space (i.e., beyond the rigid 
lattice approximation), the domain classically attributed to molecular dynamics. 
The corresponding studies concern liquid phases (see, e.g., Landau and Binder 
2000), amorphous systems or surface phenomena (Dereli et al. 1989; Bhatta- 
charya and Sethna 1998; Vauth et al. 2003; Rojas 2004). The amorphous distribu- 
tion of atoms is either approximated (lattice gas-like approach (Vauth et al. 2003)), 
or modeled in real continuum (Dereli et al. 1989). The results concerning the for- 
mation of amorphous atomic configurations simulated by means of molecular dy- 
namics and off-lattice MC were compared by Bhattacharya et al. (1998). 

On the other hand, off-lattice MC simulations may be run for crystalline sys- 
tems in order to model lattice strains and relaxations. Pioneering results in this 
field date from the 1990s (see, e.g., Lee 1991; Fratzl and Penrose 1995). An inter- 
esting study in this field has recently been performed on Al-Cu-Mg alloys by 
Mason et al. (Mason et al. 2004). The paper is recommended as a very good refer- 
ence regarding the combination of MC concepts with the methodology typical of 
the continuous space treatment. 
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Another area of development of MC techniques in alloy physics is an imple- 
mentation of the method with rigorous (or partially rigorous) physical models of 
interatomic interactions. This means going beyond the traditional Ising atomic 
pair interactions and accounting for many-body potentials. Literature from the 
field since the mid-1990s offers a number of interesting studies covering a wide 
range of research from the application of EAM potentials (Kim 1991; Rojas 2004) 
up to the involvement of rigorous ab-initio calculations (de Gironcoli and Gia- 
nozzi 1991; Keshari and Ishikawa 1994; Jellinek et al. 1998; Dorfman et al. 2001; 
Bozzolo et al. 2002; Wang et al. 2004; Vogtenhuber et al. 2005). In the latter case, 
however, rather small clusters of atoms have been simulated up to now. 


12.3.11 
Monte Carlo Simulations in View of other Techniques of Alloy Modeling 


The adequacy of the Monte Carlo technique as a tool for solving problems in 
equilibrium and nonequilibrium statistical thermodynamics has been definitely 
proven and verified, mostly due to the extensive research output of Kurt Binder. 
The method may, therefore, be considered as parallel or complementary to other 
modeling techniques, which allows for interesting comparative studies. 

Maugis et al. (2005) have recently studied precipitation in a model binary B2- 
ordered system by means of MC simulations, cluster dynamics, and a model 
based on classical laws of nucleation, growth, and coarsening. It was concluded 
that classical laws are in good agreement with simulations as long as the regular 
solution approximation remains adequate for the system. On the other hand, 
good agreement between MC and cluster dynamics results has been achieved 
when precisely describing the energetics of the smallest clusters. 

Another example of a comparative study is found in the previously mentioned 
investigation of “order-order” kinetics in binary intermetallics (Oramus et al. 
2001a, 2003). Here, the MC technique appeared the most efficient when simulat- 
ing entire relaxations, while the correctness of basic assumptions (e.g., the va- 
cancy mechanism of atomic migration) and the reliability of detailed findings 
(particular sequences of atomic jumps) were verified by molecular dynamics. 


12.4 
Phase Field Models 


12.4.1 
Introduction 


Microstructures in materials may be thought of as bulk features (grains, do- 
mains, phases) that are separated by interfaces (grain boundaries, antiphase 
boundaries, interphase interfaces). In this picture, microstructural evolution is a 
process that comes about through the migration of these interfaces. Phase field 
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models are essentially models that allow us to study the energy and kinetics of 
migration of interfaces. These models have proved to be very useful for under- 
standing microstructural evolution in a wide variety of settings, and phase trans- 
formations in particular. Since these models are computationally intensive, their 
increasing popularity is also driven by the availability of cheaper and more power- 
ful computers. 

We begin this section with a tutorial introduction to the Cahn-Hilliard model 
(Cahn and Hilliard 1958) of a diffuse interface. The arguments that lead to the 
development of this model are firmly rooted in the physics of inhomogeneous al- 
loys, and allow us to identify the salient — but generic — features of other more 
complex phase field models. We then introduce the Cahn—Allen model (Allen 
and Cahn 1979) for an antiphase boundary (APB) between two ordered domains. 
These two models are prototypical in that, together, they cover many of the phe- 
nomena of interest in materials science, and generalizations to other more com- 
plicated phenomena (such as solidification and grain growth) are often based on 
these two basic models. We will end this section with a brief description of fea- 
tures that need to be incorporated into phase field models for studying interesting 
effects such as anisotropy in interfacial energy. 

The review by Chen (2002), and some parts of the reviews by Fratzl et al. 
(1999), Binder and Fratzl (2001), and Thornton et al. (2003) cover the major de- 
velopments in phase field models and their applications. 

For this section, we will confine our attention to condensed systems under iso- 
thermal conditions. From a thermodynamic point of view, then, microstructural 
evolution — by which we mean the evolution of the field variable c(r, t) — that the 
system undergoes must be such that it always decreases the system's free energy. 


12.4.2 
Cahn-Hilliard Model 


We start with a description of the model of Cahn and Hilliard (1958), culminating 
in the Cahn-Hilliard equation (Cahn 1961). We will present some of the key re- 
sults on interfacial energy and width. We will also discuss its numerical solution, 
and results from computer simulations. 

Following Cahn and Hilliard, we consider a compositionally inhomogeneous 
binary A-B alloy with a local composition c(r), and a (constant) density Ny of 
atoms (molecules) per unit volume. 


12.4.2.1 Energetics 

Defining f as the free energy per atom, the total free energy of the system, F, is 
expressed as an integral of f over the system volume in Eq. (62), where Ny is the 
number of atoms (molecules) per unit volume, assumed to be a constant. 


F= N. | fav (62) 
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When the composition field c(r) is non-uniform, the local, coarse-grained free en- 
ergy f per atom is dependent not only on the local composition c, but also on the 
local spatial derivatives of the composition. Thus, we may expand f in a Taylor 
series about f,(c), the ne energy per atom of a solution of uniform composition 

c [Eq. (63)], where L;, Ký and x? j are the Taylor series coefficients, and are mate- 
rial property tensors, anid we have defined p; = öc/öxi, and qij = 0?c/dxi0xj. 





fc, Pu...) = BO + Livi + Erp + w Pay + (63) 


We have adopted the Einstein convention of summation over repeated indices. 
The Taylor series coefficients are formally defined by Eqs. (64-66). 








0 
L;= es (64) 
op; 
22 
wp = $F (65) 
OpiOp; 
0 
Kj = i (66) 
Olij 


For centrosymmetric crystals and amorphous solids, L; = 0; also, since Kj and Ki 
are symmetric, second-rank tensor, they are isotropic for cubic and amorphous 
materials; i.e., Ki = Kb; and xë j =K Bõij, where ô; is the isotropic second-rank 
unity tensor (Kronecker symbol). Assuming only small composition gradients, 
and negligible higher spatial derivatives, we retain only those terms with up to 
second-rank tensors in Eq. (63). We first apply the divergence theorem to the 


term with Kj in Eq. (67), where nis the unit normal vector to the surface S. 


B 


OK 
| Kj Pini dS = | y 
S V 0%; 


Since we are interested in the bulk behavior in an infinitely extended system, the 
composition gradient p; may be assumed to be zero everywhere on the surface at 
infinity. Applying the chain rule Org /0x; = (0x7 /öc)(öc/öx;), and rearranging, 
we obtain Eq. (68). 





‚dV + | xij qj dV (67) 
V 


| Bay dV | ĉr dV (68) 
Ki; qij = — ipi 
y ij fij y oC PiPj 


Using this equation in Eq. (63) yields Eq. (69) for the free energy of the system in 
Eq. (62) (from now on, we neglect the higher-order terms in the Taylor series ex- 
pansion in Eq. (63)), where xj = (x; /2) — (dx /dc). 
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F= N | Ute) + espn) av (69) 


For cubic crystals and amorphous solids, « = «dy is an isotropic tensor, and Eq. 
(69) reduces to Eq. (70). 


F=N, fi falc) + x(Ve)?] dv (70) 


In most phase field models, x is assumed to be a constant, and yields an isotropic 
interfacial energy. 


12.4.2.2 Interfacial Energy and Width 

In phase separating systems at low temperatures, the phase diagram exhibits a 
miscibility gap (Fig. 12.7; compare also Chapter 7). The plot of the free energy 
fo(c) versus composition for a temperature T below the critical temperature T, 
exhibits a double well as shown in Fig. 12.8; any alloy whose composition lies in- 
side the miscibility gap at T would, on equilibration, have a planar interface sepa- 
rating two phases of composition c, and cg. This interface would also be composi- 
tionally diffuse (as shown in Fig. 12.9) and hence have a finite width, within 
which the composition would change continuously from c, to cg. The interface fi- 
nite width can be thought of as determined by a competition between two energy 
terms: if it is too wide, the system would have a large region of intermediate com- 
position with higher f(c). On the other hand, if the width is too narrow, the sys- 
tem would have a high energy due to the gradient energy term in Eq. (69). Thus, 
the system chooses that composition profile for which the functional F in Eq. (69) 
is minimized. This exercise yields the governing equation for c*(x), the equilib- 
rium composition profile, Eq. (71), where Af(c) = folc) — (1—c)u§ — cuz, and 








c 


Fig. 12.7 Schematic phase diagram of a binary A-B phase separating 
system exhibiting a miscibility gap at low temperatures. 
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f(c) 





c 


Fig. 12.8 Schematic depicting the dependence of bulk free energy 
density fẹ on composition for the binary system depicted in Fig. 12.7 at 
a low temperature such as 7). 


ui and u$ are the chemical potentials of species A and B, respectively, evaluated 
in a homogeneous alloy of composition c, (or, equivalently, cp). 


oc* 


Ale) = (E) (71) 


Os 


If we approximate the function f(c) to the form of the double well potential in 
Eq. (72), where A, sets the height of the barrier between the two energy wells 
in Fig. 12.8, it can be shown that the equilibrium composition profile is given 
by Eq. (73), where a= (cp — c,) YA./x. From this profile, the interfacial free 
energy can be shown to be o = (1/3)Ny(cy — C4)? VAr and the interfacial width 


w = 4(cg — ca) Y(R/A.). 





Distance 


Fig. 12.9 Schematic of the equilibrium composition profile across a 
compositionally diffuse interface between two equilibrium phases of 
compositions c, (to the left) and cg (to the right). 
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P(c) = Asle — ca)? (cp — c)? (72) 


e” 


1+ e% 





(73) 


C = Cy + (Cp — Ca) 


12.4.2.3 Dynamics 

The evolution of the (inhomogeneous) composition field c(r,t) is modeled by 
writing J, the net flux of B atoms, as proportional to the gradient in chemical po- 
tential (to be defined below), as in Eq. (74), where M is a mobility tensor of sec- 
ond rank. 


J= -MVu (74) 


Using this form for the diffusional flux in the continuity equation, we obtain the 
equation that governs the dynamics of the composition field, Eq. (75). 


EV. MYu (75) 


ot 
The (generalized) chemical potential, u, is defined by Eq. (76). 


u= en ` # 2° (76) 





For cubic crystals and amorphous solids, the mobility tensor is isotropic: 
Mj = Môj. Assuming M to be independent of concentration, Eqs. (75) and (76) 
yield Eq. (77). 


£ = MV? É a 24%] (77) 
oc 


ot 


In the literature, this equation (originally derived by Cahn 1961) is referred to as 
the Cahn-Hilliard equation. It is a nonlinear partial differential equation, since it 
contains terms such as c?,c?,..., that enter through éf,/dc in Eq. (77). 

Starting with an initial microstructure, given by an initial condition c(r, t = 0), 
solving Eq. (77) allows us to obtain c(r,t), and hence the microstructure, at any 
later time; since the Cahn-Hilliard equation cannot be solved analytically, micro- 
structural evolution can be studied only by solving Eq. (77) numerically. 


12.4.3 
Numerical Implementation 


In computer simulations of microstructure, the amount of available memory con- 
strains us to study only a finite system; in order to avoid any spurious effects that 
may arise from the surfaces of the finite system, it is customary to employ peri- 
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odic boundary conditions. The early computer simulations employed a simple, fi- 
nite difference technique, with an explicit Euler technique for time-stepping. For 
example, for a two-dimensional (2D) system (generalization to 3D is straightfor- 
ward), with an N x N square grid of size Ax = Ay, discretization of Eq. (77) yields 
Eq. (78), where Pag = u(pAx, gAy, t) is given by Eq. (79). 














A 
Da Da -M Kyrı,g Hy-1,4 Hy git Hp, q-1 4p, 4 (78) 
At (Ax)? 
t of ort. var op qt | re 4oy.q 
Ka \ a 2 (79) 
GC pa (Ax) 


While such a discretization is simple to formulate, it has problems, the most im- 
portant being the limit on the size of time step At that one can use (see Zhu et al. 
(1999) for details). 

Most of the modern, large-scale studies are based on Fourier spectral tech- 
niques, one of which, referred to as a semi-implicit Fourier spectral method, de- 
veloped by Zhu et al. (1999), is described below. 

We start with the Fourier transform of the governing equation. 

Since the composition field is described on N points on a regular grid of spac- 
ing Ax, the Fourier transform is also described on a regular grid in k-space, with a 
spacing of 2x/NAx. Using a tilde (~) on a real-space variable to denote its Fourier 
transform, the Fourier-transformed governing equation in k-space has the form of 
Eq. (80), where g is the Fourier transform of g(c) = 0f,/dc. 


7 = —Mk?g(k, t) — 2Mxk*¢(k, t) (80) 


Discretizing this equation with backward difference for time, we get Eq. (81), 
which can be rearranged to yield Eq. (82). 


dé 
dt 





= —Mk?g(k + At) — 2Mik*é(k,t + At) (81) 


é(k,t) — Mrk?g(k, t+ At) 


en 1+ 2Mxk*At 





(82) 


Note that the right-hand side of Eq. (82) has the term g, which needs to be eval- 
uated at time t+ At; since this function in real space g(c) = ôfa/ôc has nonlinear 
terms in c, it is difficult to evaluate its Fourier transform at the end of the time 
step. Thus, we approximate it using g(k,t+ At) x g(k,t) to get a semi-implicit 
formulation. It is this formulation that most modern studies use; it is applicable 
for bulk systems that are modeled using periodic boundary conditions. We note 
that Zhu et al. (1999) have also shown how this method can be extended to study 
systems in which the atomic mobility M is a position-dependent variable. 


12.4 Phase Field Models 


Since the governing equations are all partial differential equations, other tech- 
niques are also available; we note in particular the finite element method (Canuto 
et al. 1988) and the control volume method (Patankar 1980; Ferziger and Peric 
2002). 

For a given number of grid points N and a given total number of time steps, we 
would like to use in our simulations the largest possible values of Ax and At. 
However, since the interface is compositionally diffuse, and has a finite width, 
we have to ensure a sufficiently large number of grid points at the interface to re- 
solve the composition profile there. Typically, we use four to six grid points to re- 
solve the interface; i.e., w = nAx, with 4 < n < 6. 

Thus, for a new phase field model, one has first to compute, as accurately as 
possible, the equilibrium profile of the field variable across a planar interface, 
and estimate its width w. In the more elaborate, large-scale simulations, one 
then ensures that w spans at least four grid points. 


12.4.4 
Application: Spinodal Decomposition 


Figure 12.10 shows the evolution of the composition field during spinodal decom- 
position in a symmetric alloy c = 0.5 quenched to a low temperature at which it 
finds itself well inside the miscibility gap in Fig. 12.7. Since the free energy fo(c) 
exhibits negative curvature, this alloy is unstable with respect to small composi- 
tion fluctuations, which are introduced into the simulations through a small ran- 
dom noise at every grid point. 
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Fig. 12.10 Microstructural evolution during equation [Eq. (75)]. The dark and light shades 


spinodal decomposition of an initially represent A-rich and B-rich phases, respec- 
homogeneous alloy at high temperature tively. The average composition co remains 
which is quenched to a low temperature into conserved during the simulations. The figure 
the miscibility gap in Fig. 12.7. The simula- on the left is for an earlier time than the 


tions are based on solving the Cahn-Hilliard right-hand figure. 
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However, we must mention that the Cahn-Hilliard model, as developed here, 
is incapable of studying early stages of phase separation in alloys outside the spi- 
nodal. In these alloys (say, A-rich alloys), phase separation requires nucleation of 
B-rich islands; since the process of nucleation increases the system free energy, 
and the Cahn-Hilliard equation ensures a strictly decreasing F, nucleation events 
cannot be studied. A possible way out of this problem is to use thermally induced 
fluctuations (a noise term) in the equation (such an equation is termed the 
Cahn-Hilliard-Cook equation (Cook 1970)) at every time step (see Binder and 
Fratzl (2001) for further details on the formulation). Simulation of nucleation 
through this technique, however, is time-consuming. Moreover, in many cases 
one is interested in microstructural evolution during growth and, more impor- 
tantly, during coarsening (Ostwald ripening). In these situations, one simply in- 
troduces viable nuclei of the B-rich phase into the system at the beginning of the 
simulations. 


12.4.5 
Cahn-Allen Model 


Some alloys (for example, Cu-Zn) exhibit a disorder-to-order transition; at high 
temperatures, the alloy is in the form of a random solid solution which, at low 
temperatures, becomes ordered. For example, a bcc alloy, with an A2 structure, 
may undergo this transition to exhibit a B2 structure, in which atoms of species 
A and B occupy, preferentially, the corner and body-center sites, respectively (or 
vice versa). If the A2 structure is thought to be composed of a sum of two inter- 
penetrating simple cubic lattices (referred to as «- and f-sublattices), the B2 struc- 
ture is obtained when the two sublattices are differentially occupied by species B; 
in the parent A2 structure, species B occupies the two sublattices randomly. 

Clearly, an alloy in the B2 structure possesses a long-range order, but is dis- 
ordered in the A2 structure at high temperatures. A parameter 7 that quantifies 
this long-range order may be defined in many ways. A popular choice for 7 for 
describing the B2 structure is by Eq. (83), where x} (or xh) is the fraction of « 
(or p) sublattice sites occupied by B atoms. 


n= (xp — x5) (83) 


From this definition, configurations with (+7) and (—y) are equivalent. However, 
a region in the alloy that has B atoms on the £ sublattice (i.e., 7 > 0) and another 
that has them on the « sublattice (with 7 < 0) would find themselves separated by 
an “antiphase boundary” (APB). 

The order parameter as defined above is not in all cases just a scalar quantity; it 
can be a more dimensional vectorial parameter for structures made of more than 
two sublattices, such as the L1, structure. In contrast to the central quantity of 
the Cahn-Hilliard theory, the concentration, the order parameter is a noncon- 
served quantity, i.e., its integral value over the whole sample is not constant dur- 
ing the temporal evolution. 
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tín) 





n 


Fig. 12.11 Schematic of the dependence of the bulk free energy density 
on the order parameter y in the Cahn-Allen model. 


The model of Allen and Cahn (1979) is for the energy and dynamics of such an 
APB, which may be seen as a region in which the order parameter varies con- 
tinuously (from +y to —n) over a region of finite width. The arguments used for 
developing the Cahn-Allen model, therefore, are the same as those for the 
Cahn-Hilliard model. 

Following the same steps as the Cahn-Hilliard model, the total free energy 
may be written as Eq. (84), where, f,(n) is the (bulk) free energy density of a ho- 
mogeneous alloy with uniform y; f,(7) is shown schematically in Fig. 12.11. 


F= IRQ + ky(Vn)?] av (84) 


12.4.5.1 Kinetics 

Since the order parameter is not conserved, Cahn and Allen postulated Eq. (85) 
for the dynamics, where L is a (positive) relaxation coefficient. This postulate im- 
plies that, locally, the order parameter would evolve in response to the local driv- 
ing force dF /6n. 


ôn (F/Ne) 


at ôn 





(85) 


Substituting Eq. (84) into this equation, we get the Cahn—Allen equation (which, 
in physics literature, goes by the name “time-dependent Ginzburg-Landau 
(TDGL) equation”) [Eq. (86)]. 


om —L ei — 202 (86) 
ot on 


Since this equation is similar in form to (but simpler than) the Cahn-Hilliard 
equation, the derivation of the semi-implicit Fourier spectral procedure for nu- 
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Fig. 12.12 Microstructural evolution in an initially disordered (7 = 0) 
alloy quenched to a low temperature (below the disorder-to-order, or 
critical, temperature T.). The dark and light shades represent regions 
with y < 0 and > 0, respectively. The simulation is based on solving 
the Cahn-Allen equation (86). In contrast to Fig. 12.10, the average 
value of y in these figures is not conserved during the simulation. 
The left-hand figure is for an earlier time than the right-hand figure. 


merically solving this equation is not presented here. Figure 12.12 shows the evo- 
lution of ordered domains during the ordering of an initially disordered alloy, 
simulated by solving the Cahn—Allen equation, above. 


12.4.6 
Generalized Phase Field Models 


12.4.6.1 Key Features of Phase Field Models 

Our discussion of the Cahn-Hilliard and Cahn-Allen models now allows us to 
list the key features of almost all phase field models used for studying microstruc- 
tural evolution in materials. 

First, they use a continuum mathematical description in which one or more 
position-dependent field variables are used for specifying a system's configuration 
(microstructure). Each configuration is associated with an appropriate energy; 
this energy is written as a functional of the field variables. In particular, this func- 
tional has two contributions: a bulk free energy density, and a gradient energy 
density. The dynamics of the field variables are governed by a set of partial differ- 
ential equations: the Cahn-Hilliard equation for conserved fields, and the Cahn- 
Allen equation for nonconserved fields. 

As with the other simulation techniques in this chapter, phase field models (a) 
provide a mathematical description of an instantaneous configuration of the sys- 
tem, (b) describe an appropriate energy (in the case of isothermal, isobaric 
systems, it would be the Gibbs free energy) of a configuration, and (c) describe 
a method by which the system evolves from one configuration to the next. 


12.4 Phase Field Models 


In this section, let us consider such generalized phase field models and how 
they have been applied in studies of interesting phenomena. The most important 
step is in the choice of an appropriate set of field variables to describe a micro- 
structure. This choice then drives the remaining steps. 


12.4.6.2 Precipitation of an Ordered Phase 

A simple generalization is a model for studying precipitation of an ordered phase. 
Since the matrix and precipitate phases have different compositions, c(r,t) is a 
relevant field variable; further, the precipitate is also an ordered phase, so y(r, t) 
is also a relevant field variable, yielding Eq. (87), with the bulk free energy density 
f(c,n) obtained from (bulk) thermodynamics of the alloy system. It must yield 
the two-phase equilibrium between the matrix (characterized by (c,7) = (c,,0)) 
and the precipitate (with (c,7) = (c,n*)) phases. Microstructural evolution is 
governed by a set of two coupled equations; one is the Cahn-Hilliard equation 
for composition, and the other a Cahn—Allen equation for the order parameter. 


F= | Ut. 1) + Ke(Vo)? + x,(Vn)?] dV (87) 


It is now possible to generalize this model further to take into account more 
than one order parameter. Additional y values are needed when one considers 
ordered phases that have several (more than two) variants. For example, in 
Ni-Al-based superalloys, the matrix is disordered, with an fcc structure, and the 
precipitate is ordered, with the L1, structure. Considering the fcc lattice as a com- 
bination of four interpenetrating simple cubic (sub)lattices, the L12 structure is 
obtained when one of them is preferentially occupied by B atoms, and the other 
three by A atoms; in the perfectly ordered state, the alloy composition would be 
AzB. An (imperfectly) ordered alloy with L1, may be described using three inde- 
pendent concentration waves, whose amplitudes can be associated with three 
order parameters (71, 2,3) (Wang et al. 1998). 

The four variants of the ordered structure that can be formed from the same 
disordered fcc lattice are then characterized by (n,n,-n), (n, -n,n), (-n,n,n), 
and (-n,—-n,—n). With f(c,11,72,73) as the bulk free energy density, the total 
free energy would be given by Eq. (88). 


F=N,| 
v 


The expression for the bulk free energy density f (c, 1,2,3) must obey certain 
symmetry properties; see Wang et al. (1998) for details. The evolution equations 
now consist of a Cahn-Hilliard equation for composition, and a set of three 
Cahn—Allen equations, one for each order parameter 77, 

Using such a model, Wang et al. (1998) have studied precipitation of y’ precip- 
itates from the y matrix in Ni-Al alloys. Similar models have been used by Wen 





3 


Fe, mm, m) + Ke(Vo)? +) rln)? | dV (88) 
p=1 
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et al. (1999) in their study of precipitation of an orthorhombic phase from a hex- 
agonal phase in Ti-Al-Nb alloys. 


12.4.6.3 Grain Growth in Polycrystals 
A polycrystalline material has grains separated by grain boundaries. At suffi- 
ciently high temperatures, the grain boundaries migrate due to their curvature, 
leading to grain coarsening (often referred to as grain growth). In order to study 
this process, Fan and Chen (1997) used a model in which each grain (say, the p-th 
grain) is associated with an “orientational order parameter” 7, such that y; = 1 in 
the region occupied by the ith grain, and y; is zero everywhere outside it. In this 
model, a grain boundary between the pth and qth grains is a region in which 7, 
and 7, exhibit sharp gradients. Further, for a polycrystal with n grains, this model 
requires the use of n orientational order parameters. In effect, this model can be 
thought of as the continuum equivalent of the n-state Potts model (Potts 1952). 

Since the grain boundary can migrate, thus transforming regions from one 
grain to another, the order parameter is clearly not conserved; Fan and Chen de- 
rived a set of Cahn—Allen equations to describe the dynamics of the order param- 
eters. 

To see how this model works, here is the free energy functional [Eq. (89)]. 


r= Fn + San av (89) 


p=1 


The bulk free energy is formulated in such a way that it has n degenerate minima 
(wells of equal depth) at locations (1,0,0,...,0),(0,1,0,...,0),...(0,0,0,...,1), 
For a system with only two grains, only 44, and y, are sufficient to describe the 
microstructure; for such a system f(7,,7), illustrated in Fig. 12.13, would have 
two absolute minima at (7,77) values of (1,0) and (0, 1); further, since these two 





Fig. 12.13 Schematic of the dependence of the bulk free energy density 
on two order parameters (which represent two arbitrarily different grain 
orientations) in the Fan—Chen model. The free energy surface has two 
absolute minima at (7,,777) = (1,0) and (0,1). The free energy values at 
the two minima are the same. 
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Fig. 12.14 Microstructural evolution during spinodal decomposition in 
a polycrystalline alloy. The simulation is based on a phase field model 
that combines the Cahn-Hilliard model for a phase separating alloy 
with the Fan—Chen model for a polycrystal. This particular system is 
characterized by large values of atomic mobility M and grain boundary 
mobility L. The scaled simulation time is shown above each 
microstructure (from Ramanarayan 2004). 


configurations (which represent two different grains) are equivalent to each other, 
the free energy values at these two minima would also be equal. 

The Fan—Chen model was combined with the Cahn-Hilliard model in a recent 
study (Ramanarayan 2004) of grain boundary effects on spinodal decomposition. 
A sample result from this work is shown in Fig. 12.14. In this example, migrating 
grain boundaries act as “transformation fronts” at which spinodal decomposition 
takes place (due to fast diffusion at the grain boundary); the resulting microstruc- 
ture exhibits alternating lamellae of (incompletely transformed) A-rich and B-rich 
regions lying normal to the (migrating) grain boundary. This example shows 
clearly that phase field models can be used in simulations of phenomena that 
could lead to highly complex microstructures. 
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12.4.6.4 Solidification 

As a final example, let us consider a phase field model which is in wide use in 
solidification studies. For simplicity, we consider solidification in a pure metal. 
The microstructure is described in terms of a phase field variable y that takes a 
value of zero in the liquid and unity in the solid; the solid—liquid interface is a 
region where 7 exhibits a large (but continuous) gradient. This continuum de- 
scription has several advantages over the sharp interface models that were in 
wide use earlier: with a phase field model, there is no need to track the location 
of the interface at every time step. Thus, it is easy to study complex shapes and 
topological changes (such as a merger of two dendritic arms). 

Solidification of a region from the liquid state to the solid state is signified by a 
rise in the value of 7 in and near the interface region, accompanied by an evolu- 
tion of latent heat which has to be transported away. Thus, one uses a coupled set 
of equations governing the evolution of two field variables: the phase field y and 
the temperature field T; see Langer (1986) and Kobayashi (1993) for details. 

Since the phase field variable 7 has been introduced into the model largely for 
mathematical convenience, early studies were devoted to establishing that this 
model does lead to results that are consistent with those from classical sharp- 
interface theories. They established, for example, that it reproduces the Gibbs- 
Thomson correction to the melting point due to a curved interface. We refer the 
reader to a recent review by Karma (2001) and to the text by Davis (2001) for fur- 
ther details. 


12.4.7 
Other Topics 


The popularity of phase field techniques is due to the ease with which many im- 
portant effects can be incorporated into the formalism. A recent review by Chen 
(2002) discusses a wide variety of phenomena where these models have been 
used to study microstructural evolution. In addition to the ones we have covered, 
these phenomena include ferroelectric transformations, martensitic transforma- 
tions, phase transformations in thin films, dislocation dynamics, and crack prop- 
agation. In this section, we restrict ourselves to a brief discussion of develop- 
ments in incorporating into phase field models (a) an interfacial energy 
anisotropy and (b) elastic stress effects. 


12.4.7.1 Anisotropy in Interfacial Energy 

In many crystalline systems, the interfacial energy ø is anisotropic. Since the gra- 
dient energy coefficient tensor xij is of second rank, systems with lower symme- 
tries (tetragonal, orthorhombic, etc.) can be studied in a limited way using aniso- 
tropic versions of xj; however, for cubic systems, this tensor is isotropic. Thus, 
systems with anisotropic ø (particularly those with cubic aniostropy) require other 
strategies. The most popular strategy (due to Kobayashi 1993) is to use a scalar (or 
isotropic) value of x which depends on the interface orientation (identified with 
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the direction of the local gradient in order parameter). A second method is to ex- 
tend the original Cahn-Hilliard formalism to include higher-order terms in the 
Taylor series expansion of the local free energy f in Eq. (63). This idea, alluded 
to by Langer (1986) and Taylor and Cahn (1998), was pursued by Abinandanan 
and Haider (2001), who worked out all the fourth-order terms necessary for de- 
scribing cubic anisotropy in c, and used one of them to study the finite size de- 
pendence of particle shapes. A third possibility is to work with multiple-order pa- 
rameter models, such as those required for describing the L1; structure (Braun et 
al. 1998). 


12.4.7.2 Elastic Strain Energy 

In many solid systems with multiple phases (such as the superalloys which have 
a Ni-rich fcc solid solution as the matrix and the L12-ordered intermetallic solu- 
tion Ni;Al as the precipitate phase), the interface is coherent. In such systems, a 
lattice parameter mismatch between the two phases introduces elastic strains ev- 
erywhere, which have been shown to have a strong influence on microstructural 
evolution. For example, in the Ni-Al system, particles of the precipitate phase 
change their shape from sphere to a cuboid and, still later, to plates. These par- 
ticles also become aligned along the <100) directions of the Ni-rich matrix (Kha- 
chaturyan 1983; Johnson 1999). 

These elastic stress effects have been incorporated into the phase field models 
by using the simplest approximation, in which the total free energy of the system 
F, is a simple sum of the elastic strain energy of the system Fa and the chemical 
free energy Fen [Eq. (90)]. 


F, = Fin + Fa (90) 


The chemical part is given in the usual form of a functional, such as Eq. (70). The 
rest of the formalism (for example, the definition of chemical potential in Eq. 
(76)) remains the same, except that F in these equations is replaced by F, The 
elastic stress effects work in the following way in such models: the order parame- 
ter determines the lattice parameter (or, effectively, the misfit strain), which, in 
turn, results in a stress field. Thus, even though F, is introduced as an additive 
term in the above equation, it has an implicit dependence on the order parameter 
distribution and hence the microstructure; it is through this coupling that it is 
able to exert an influence back on the microstructure and its evolution. 
Diffusional equilibration (relaxation) of the composition field is far slower than 
that of the elastic (mechanical) equilibration of the stress field. Thus, for a given 
configuration (at the beginning of the time-stepping procedure), we may compute 
the elastic stress field, and use it in solving for the evolution equation for the or- 
der parameter fields at the end of the time step. This implies that incorporating 
elastic stress effects into the phase field formalism requires, first, a description of 
how the misfit strain depends on the order parameter(s) and, second, an addi- 
tional module that computes the elastic stress field for a given distribution of 
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misfit strains. For the second part, Fourier transform formalisms exist (Khacha- 
turyan 1983; Mura 1982), making it easy to include this module in the usual 
(Fourier transform-based) solution of the evolution equations. 


12.5 
Outlook 


Further progress in the development of computational techniques, but of course 
also of computer technology, will make it possible to study larger systems on 
longer time scales. Nevertheless, no drastic progress can be expected here, but 
it is necessary to combine the respective methods into multiscale integrated 
codes. The somewhat futuristic expectation is to generate effective potentials from 
ab-initio codes, derive short-term (and local) development using molecular statics 
or dynamics methods, and finally use Monte Carlo, phase field, or finite element 
codes to compute the large-scale, long-term evolution of an alloy (see, e.g., Yip 
2005; Lu 2005). On the other hand, reduced dimensions as in nanotechnology 
and in thin-film devices get into length scales which can be realistically modeled 
with current techniques. 


Appendix 


Let us assume that a Metropolis-type MC algorithm is used. In such a case the 
probability p; of the particular ith atom from the first coordination shell of the 
vacancy executing the jump is given by Eq. (A1), where the meaning of w; (atomic 
jump frequency) is analogous to W(a; — aj) in Eqs. (49) and (50). 


54 Al 
Bi=ZXTX Wi (A1) 


The factor 1/Z accounts for the initial random choice among the Z nearest 
neighbors of the vacancy. The parameter t has in fact the meaning of a reciprocal 
jump-attempt frequency discussed in Chapter 5. 

The corresponding probability for the event that the ith atom is selected, but 
the jump is rejected is given by Eq. (A2). 


1 1 
Pi; 7x1 Tx W)=5 Pi 








(A2) 


The probability p;, referred to as the “a posteriori’ probability (Athenes et al. 
1997), defines the effective time scale constant t’ = t/Z of the simulated process. 
Accordingly, after each Metropolis-type MC step time has to be incremented by At 


[Eq. (A3)] 


References 


At=1/Z (A3) 
This becomes still more clear if one calculates a probability for the event that the 
ith atom jumps to the vacancy after n subsequent unsuccessful attempts preceding 
this jump. The probability of the event is given by Eq. (A4), and the total time in- 
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crement to be attributed to the event by Eq. (A5). 


Pn = pad x Pi 


(n+1)xt 


Atn 1 = 7 


(A4) 


(A5) 


It is obvious that At, has a meaning of the residence time for the ith atom on 


the initial lattice site. 


The average residence time for this atom is, therefore, given by Eq. (A6), which 
is perfectly consistent with the considerations presented in Section 12.3.6.3 [Eqs. 


(54) and (55)]. 


oo 


Atn 1 = 


NI a 
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13 
High-Resolution Experimental Methods 


13.1 
High-Resolution Scattering Methods and Time-Resolved Diffraction 
Bogdan Sepiol and Karl F. Ludwig 


13.1.1 
Introduction: Theoretical Concepts, X-Ray, and Neutron Scattering Methods 


Detailed information about the microstructure is without doubt the most impor- 
tant and fundamental information necessary to start further analysis of any alloy. 
Generally, one can recognize a few levels for microstructure analysis: (1) the way 
the atoms of an alloy are arranged on a theoretical mean structure; (2) all the 
kinds of displacements from the mean structure; and (3) the dynamical behavior 
of constituent atoms including their vibrational properties and the motion of 
atoms between lattice sites. Unfortunately, investigation of any of these levels for 
a specific alloy is an extremely time-consuming and difficult task. Up to now, only 
very few alloys have been explored in all or in most facets. Usually, only selected 
features of the investigated system are studied, and analogies to similar known 
structures are used to make conclusions about those properties which are not ac- 
cessible to direct exploration. A deep knowledge of some selected model systems 
therefore appears even more important. 

A detailed characterization of the structure of the system on an atomic scale is 
the basis for a thorough understanding of diffusion in the solid state, where it is 
necessary to be familiar with both the collective (phonons) and single-particle dy- 
namics (usually called diffusion). In most cases this requires the combination of 
different experimental techniques. X-ray and thermal neutron scattering are stan- 
dard techniques for the investigation of the structural properties of alloys. Infor- 
mation about the mean structure is contained in the Bragg peaks and for this 
kind of study electron scattering is also a valuable technique. Displacements of 
atoms from the mean structure are, however, better detected quantitatively by 
X-ray and thermal neutron scattering. These displacements result in a scattering 
intensity between Bragg peaks and are caused by static atomic displacements and 
by short-range order phenomena. The corresponding approaches are generally 
called diffuse scattering methods (for reviews see, e.g., Schweika 1998; Schonfeld 
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Fig. 13.1 Development of the brilliance of X-ray sources, i.e., the 
number of photons emitted in unit time by a unit source area in a unit 
solid angle in a relative bandwidth of 10°? for a given energy. 


1999). Because of the low cross-section for scattering, the extensive development 
of X-ray diffuse scattering methods has relied on the rapid growth in available 
X-ray brilliance at successive generations of synchrotron sources (Fig. 13.1). 

The dynamical properties of hard and soft condensed matter are of consider- 
able importance from a fundamental point of view, but are in addition essential 
for the functionality of nanoscale devices. The wavelengths of X-rays and neu- 
trons in principle allows them to probe dynamics at much shorter length scales 
than is possible with visible light scattering (see Fig. 13.2), though X-ray tech- 
niques in this area have only become feasible with the advent of synchrotron ra- 
diation sources. The role of dynamical properties becomes increasingly relevant 
with decreasing size of the structural units, and even novel dynamical phenom- 
ena are expected in nanostructures. The performance of nanoscale devices will 
be strongly influenced by finite-size effects and the dimensionality of the system. 
The microscopic origin of these effects is far from being understood yet. A thor- 
ough understanding opens the way to tailor the development of future functional 
nanoscale systems. Since the properties of low-dimensional structures are signifi- 
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Fig. 13.2 Frequency/energy and spatial/wavenumber domains of 
inelastic scattering probes of dynamics in materials. 


cantly different from those of corresponding bulk materials, new methods have to 
be developed for the experimental characterization and the theoretical modeling. 
An efficient way to achieve this is to use the brilliant X-rays from modern syn- 
chrotron radiation sources or to use neutron scattering methods to study the dy- 
namical properties, usually under ultrahigh vacuum conditions. Nuclear resonant 
scattering of synchrotron radiation and neutron scattering are powerful methods 
for studying vibrational properties, diffusion and growth, and magnetic processes 
especially due to their high spatial and temporal resolutions. A distinct advantage 
of the technique of nuclear resonant scattering is that it is isotope-specific. Com- 
pared to other methods, the signal is essentially free from contributions from sur- 
rounding materials. Moreover, probe layers can be selectively deposited to study 
the magnetic and dynamical properties with atomic resolution. 

In the mid-1990s, only very sparse information on diffusion and dynamics in 
pure metals and in intermetallic phases was available. Diffusion is a necessary 
precondition for most microstructural processes such as, for example, the nucle- 
ation of new phases, coarsening, and recrystallization, with a wide use in current 
technology, e.g., for surface hardening, changing deformation behavior by nucle- 
ation, diffusion doping, or sintering. Intermetallics further attracted attention 
as suitable materials for high-temperature applications (Sauthof 1995; Liu et al. 
2002). Knowledge of the diffusion behavior of intermetallics is, therefore, of fun- 
damental importance for basic materials science as well as for their use in tech- 
nological applications. 


I 
: 709 


710 


13 High-Resolution Experimental Methods 


13.12 
Magnetic Scattering 


Neutron- and X-ray scattering techniques continue to play a central role in eluci- 
dating the magnetic structures of alloys, whether in bulk, thin films, or nano- 
structures. The spin of neutrons makes them a sensitive tool for the study of local 
magnetic moments in a material. Since the original work of Shull and collabora- 
tors beginning in the late 1940s (Shull and Smart 1949), elastic neutron scatter- 
ing has most often been the technique of choice for determining magnetic struc- 
ture and has successfully elucidated even complex ordering behavior, such as 
helices, helifans, cycloids, and multi-wavelength structures. Inelastic neutron 
scattering also has a long history of success in examining magnetic fluctuations 
and excitations. In the more recent decades, neutron capabilities have continued 
to evolve with the widespread application of polarized neutron scattering. 

In general, the interaction of the neutron spin with magnetic moments in ma- 
terials does not distinguish between contributions to the local magnetic density 
from electron orbital and from spin angular momentum. Nor does it distinguish 
between contributions of different chemical species. Moreover, some materials 
have sufficiently high neutron absorption cross-sections to make magnetic scat- 
tering studies difficult. The nonresonant elastic magnetic X-ray scattering ampli- 
tude for linearly polarized photons is below that of elastic charge scattering by a 
factor of E,/m.c?. However, the high brilliance of synchrotron X-ray sources has 
allowed magnetic X-ray scattering to develop in the past 20 years into a powerful 
alternative to neutron scattering, one that is able to overcome some of the limita- 
tions of magnetic neutron scattering. Making use of the natural polarization of 
synchrotron radiation, magnetic X-ray scattering can often separate the spin and 
orbital contributions to the magnetic moment by a polarization analysis of the dif- 
fracted beam. Moreover, the ability to tune the photon energy at these sources to 
utilize resonant enhancement effects now routinely provides opportunities to ex- 
amine chemically specific magnetic structure. Thus, magnetic X-ray scattering has 
developed into a strong complementary tool to magnetic neutron scattering. The 
literature on magnetic neutron- and X-ray scattering studies of alloys is so large 
that we do not attempt to survey it here. Instead, we discuss the current technical 
capabilities of each approach and some recent examples of their use. 


13.1.2.1 Magnetic Neutron Scattering 

There are a number of good review sources discussing the capabilities of magnet- 
ic neutron scattering (Bacon 1975; Lovesey 1986; Skéld and Price 1987; Williams 
1988; Balcar and Lovesey 1989; Squires 1997; Shirane et al. 2002). The differen- 
tial cross-section for elastic scattering of neutrons can be written as Eq. (1), where 
the sum is over atoms I, s; and sp are the initial and final neutron spin directions. 
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do 
m% x (1) 





iq: sis 
) g'an US 
l 





13.1 High-Resolution Scattering Methods and Time-Resolved Diffraction 


If, for simplicity, we neglect scattering from the nuclear magnetic moment, 
Eq. (2) can be written for u , where h; is the nuclear scattering amplitude, 
p = yge*/2mc* the magnetic scattering amplitude, y the neutron gyromagnetic 
ratio, “,; the component of the local magnetic moment that is perpendicular to 
q, and o the Pauli spin matrix of the neutrons. 


U” = b, — pay 0 (2) 


We can therefore write the scattering cross-section as a sum of nuclear and mag- 
netic unit cell form factors as in Eq. (3), where the sum is over all unit cells and 
Fy and Fy are nuclear and magnetic scattering structure factors for the unit cell. 


2 
do 
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We first discuss unpolarized magnetic neutron scattering. In this case the neu- 
tron spins are randomly oriented so that the average of the cross-product between 
nuclear and magnetic scattering vanishes. The nuclear and magnetic scattering 
amplitudes then add incoherently. If the neutron beam is unpolarized and the 
sample has a uniaxial magnetic moment orientation m, it is useful to write the 
elastic magnetic scattering cross-section as the square of the Fourier transform 
of the component of the magnetic form factor that is perpendicular to q, u, (q) 


[Eq. (4)]. 





& wag) (2) a - (a ml (4)? (4) 


This term must be separated from the nuclear scattering if both terms contribute 
at the reciprocal space point of interest. With different degrees of success this can 
be done by varying the direction or amplitude of an applied magnetic field, or by 
increasing the sample temperature above the Curie or Néel temperature. 

By measuring a set of magnetic intensities at Bragg peaks in a single crystal or 
a polycrystalline sample, a set of magnetic form factor amplitudes can be deter- 
mined and the magnetic structure can often be solved in real space. Results of 
relatively early magnetic structure determinations were compiled by Oles et al. 
(1976) and by Cox (1972). While polarized magnetic neutron scattering has grown 
considerably in popularity relative to unpolarized studies, the high flux of unpo- 
larized neutrons available continues to make them attractive for experiments with 
low scattering cross-sections. For example, diffuse scattering can be examined to 
learn about short-range magnetic order in amorphous or paramagnetic materials 
in the wide-angle scattering regime (see, for instance, the study of amorphous 
Gd-Si alloys by Chumakov et al. 2005) or about magnetic inhomogeneities on 
larger scales in the small-angle scattering regime (see, for instance, the study of 
ferromagnetic amorphous Fe-Zr by Calderon et al. 2005). 
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Inelastic and quasielastic unpolarized magnetic neutron scattering are also 
powerful methods for the study of time-dependent magnetic phenomena. Using 
a triple-axis geometry, they essentially measure the dynamic structure factor 


[Eq. (5)]. 


sgo) = Z| ate Se uy, (Ou (9) (5) 
l 


=o 


The energy resolution AE of neutron spectrometers is usually greater than 10 neV 
(corresponding to time scales shorter than 107! s) and more typically 0.1-0.5 
meV. Quasielastic scattering refers to the examination of magnetic fluctuations 
occurring over a broad range of energy scales. Alhough it has traditionally dealt 
with relatively small energy scales, comparable to the resolution of the spectrom- 
eter, magnetic fluctuations with broad spectra can give considerable weight even 
at energies of several milli-electronvolts. As an example, Gaulin et al. (2002) have 
used quasielastic unpolarized magnetic neutron scattering to examine the energy 
scale and temperature dependence of incommensurate spin fluctuations in the 
heavy fermion superconductor UNi,Al;. Another interesting recent example is a 
study of the nature of magnetic waves in Zn-Mg-Tb quasicrystal alloys by Sato 
et al. (2006). 

Inelastic scattering from excitations with definite energies is usually distinct 
from quasielastic scattering. Inelastic unpolarized magnetic neutron scattering 
has been perhaps the greatest source of information about the dynamics of spin 
waves in materials, both zero-point fluctuations and thermal excitations. A nice 
recent example is the work of Szuszkiewicz et al. (2006), measuring spin waves 
in MnTe and modeling their dispersion relations to obtain exchange parameters 
in the material. 

One of the difficulties that can be encountered in interpreting the observed 
elastic neutron scattering is separating the magnetic scattering from the nuclear 
scattering. The use of polarized neutrons can resolve the situation, however, and 
polarized magnetic neutron scattering experiments have grown rapidly in num- 
ber during the past 25 years (for a comprehensive discussion see Williams 1988). 
Polarized neutron beams can be created by making use of interference between 
magnetic and nuclear Bragg scattering from ferromagnetic crystals (such as the 
Heusler alloy CuzMnAl), by reflection from magnetized neutron mirrors (“super- 
mirrors” in which the critical angles of reflection are different for different spin 
directions) or by using selective filters such as “°Sm or polarized ®He nuclei. 

The term U` in the neutron scattering cross-section changes sign when the 
neutron spin is reversed. Thus, one method of isolating the magnetic scattering 
component of the signal is to reverse the neutron spin using a spin-flipping coil — 
a “flipper” — that changes the spin by an angle of z. A polarizing monochromator 
crystal first produces a beam of polarized neutrons and a magnetic field at the 
sample orientates the sample magnetic moment. A flipper coil between the 
monochromator and the sample can then be selectively turned on to use the Lar- 
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mor precession of spins in a magnetic field to reverse the spin directions. The in- 
tensity of a Bragg peak is measured with the flipper alternately turned off and on. 
The ratio of these intensities in a centrosymmetric crystal is given by Eq. (6), 
where the last equality assumes that Fy « Fy. 


Ton Fy + Fmı 2 Fmi 
Rao z1+4 6 
Iof (= — 7) Fy (6) 








This greatly enhances the relative value of magnetic signal over the (Fu/Fn)” 
that is expected in general for unpolarized neutron scattering. The technique has 
been widely used for examining magnetization densities using Bragg peaks in 
single crystals, for studying short-range magnetic order in amorphous materials 
and binary alloys, and for the small-angle scattering examination of magnetic in- 
homogeneities on nanometer length scales (see, for example, the separation of 
chemical and magnetic structure due to precipitation in an amorphous Ni-P al- 
loy by Tatchev et al. 2005). The study of larger (micron)-scale magnetic inhomoge- 
neities can be performed with polarized super ultra-small angle neutron scatter- 
ing (SUSANS), using multiple-bounce channel cut crystals to limit the beam 
divergence and to measure precisely the scattering angle (Wagh et al. 2005). 

More detailed information can be obtained from polarized magnetic neutron 
scattering by using a triple-axis spectrometer with a spin flipper and polarizer in 
the scattered beam as well as in the incident beam. The term U;'” in the neutron- 
scattering cross-section gives the probability that a neutron will change its spin 
direction on scattering. In analyzing this probability, it is convenient to designate 
directions relative to the neutron polarization in a right-handed cartesian coordi- 
nate system: (€,7,¢) with the initial polarization in the £ direction (Moon et al. 
1969). Then the terms for spin-flip and non-spin-flip scattering become those in 
Eqs. (7). 


U, „=b+ a 
no—flip T PHi¢ 
(7) 





Unip = —Pp( Hie + it ,) 


We see that the nuclear scattering factor b does not contribute to the spin-flip 
scattering. In the case where q is parallel to the direction of neutron polarization, 
all magnetic scattering is in the spin-flip channel, since u; = 0. If the monochro- 
mator and analyzer crystals are set so that both reflect the same spin orientation, 
then setting both flippers either on or off lets the scattered beam component pass 
that has not flipped. Alternatively, setting one flipper off and the other on lets that 
component of the scattered beam pass that has its spin flipped. By separately 
measuring spin-flip and non-spin-flip scattering with the neutron polarization al- 
ternately parallel and perpendicular to q, the individual components of magnetic 
moment can be examined. When measuring the magnetic form factor at Bragg 
peaks, this technique largely avoids the difficulties that occur in unpolarized mag- 
netic neutron scattering when separation of the nuclear and magnetic scattering 
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is attempted, even in cases of relatively weak magnetic scattering (see, for exam- 
ple, the examination of ferrimagnetism in Fe-B metallic glasses by Cowlam et al. 
2005). 

Because of increasing interest in thin-film magnetic structures, in particular 
in giant magnetoresistance (GMR) materials, polarized neutron reflection (PNR) 
has grown significantly more popular since the mid-1990s. When either incident 
polarization is used alone or both incident and transmitted polarization are em- 
ployed, fits of the reflection curves give considerable information about mag- 
netic layer orientation, thickness, and magnetic interface roughness. For exam- 
ple, Singh et al. (2005) have used PNR to examine the magnetic moment of Ni 
in Ni/Cu multilayer structures. In addition to investigating the average vertical 
magnetic structure with the specular reflectivity, the diffuse scattering at low re- 
flected angles gives information about the lateral magnetic structure, such as that 
due to magnetic domains. A review of some of the recent investigations using 
this approach is given by Felcher and te Velthuis (2001). 

Polarized neutron beams can also be used for neutron depolarization (ND) 
studies (for an early review, see Endoh et al. 1992). This technique utilizes the de- 
polarization of an initially polarized beam of neutrons when it transits through a 
ferromagnetic sample. In its most sophisticated form, the entire 3 x 3 matrix is 
measured relating incident polarization directions to transmitted ones and is in- 
terpreted in terms of domain size and shape. As an example, Sakarya et al. (2005) 
used the technique to examine domains in the ferromagnetic superconductor 
UGe,, part of an unusual class of materials in which ferromagnetism and super- 
conductivity apparently coexist. 

A very different use of neutron spin is made in neutron spin echo (NSE) spec- 
troscopy: see Mezei (2003). In NSE the spins of polarized neutrons precess in a 
magnetic field both before and after scattering. For a given field strength and 
length, the amount of precession for each neutron depends on its velocity. A z 
spin flipper is placed just before the sample so that the second precession exactly 
compensates for the first precession, if the exit velocity is equal to the incident 
velocity. If the scattering is inelastic so that the outgoing neutron velocity is 
slightly changed, then the resulting beam is slightly depolarized. The depolariza- 
tion measurement determines the change in neutron velocity on scattering. A key 
attribute of the technique is that only the change in velocity of each neutron is 
important, so that the overall distribution of energies in the incident neutron 
beam can be much wider than the energy resolution of the experiment. The 
NSE technique gives access to time scales much longer than with traditional qua- 
sielastic scattering — on order of 100 ns. It has provided unique information about 
slow relaxation in spin glasses (see, for example, Pappas et al. 2003). 

A variation of NSE is spin echo small-angle neutron scattering (SESANS), 
which uses the spin echo principle to perform elastic small-angle scattering. It 
utilizes the decrease in polarization resulting from the change in neutron preces- 
sion along the altered path of the scattered beam through the second magnetic 
field. Thus, just as NSE has an effective energy resolution narrower than the en- 
ergy width of the incident neutron beam, SESANS has an effective angular reso- 
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lution smaller than the divergence of the neutron beam. Length scales up to a 
micron are accessible. Although the technique was originally developed for non- 
magnetic applications, it also possesses the capability to examine long length 
scale magnetic inhomogeneities (Grigoriev et al. 2006). In this case, the magnetic 
scattering can itself act as a flipper, allowing the separation of magnetic and nu- 
clear scattering. 


13.1.2.2 Magnetic X-Ray Scattering 

Discussions about the possibilities for magnetic X-ray scattering from materials 
were started in 1970 by Platzman and Tzoar (Platzman and Tzoar 1970) with the 
first elastic scattering studies by de Bergevin and Brunel (1972), the first magnet- 
ic Compton scattering experiments by Sakai and Ono (1976), and the first reso- 
nant scattering studies by Namikawa et al. (1985). Due to the need for brighter 
sources, the field began to develop rapidly only with the growing availability of 
synchrotron X-ray sources in the early 1980s. Since then the field has prospered 
as new sources and approaches continue to be developed. A number of excellent 
reviews on magnetic X-ray scattering are now available (Lovesay and Collins 1996; 
Cooper and Stirling 1999; Laundy 1999; Altarelli 2001; Lebech 2002; Beaurepaire 
et al. 2006). 

As discussed in more detail below, magnetic X-ray scattering offers a number 
of complementary capabilities to magnetic neutron scattering. Using polarization 
analysis, the orbital and spin contributions to the form factor of the magnetic 
moment can be separated. The magnetic contributions of individual chemical 
species can be investigated with resonance techniques. The high collimation of 
synchrotron X-ray sources permits the precise study of magnetic periods in com- 
plex structures and of long-wavelength critical fluctuations near phase transitions. 
Meanwhile the high brilliance of these sources also allows the study of magne- 
tism at surfaces, in thin films, and in nanoparticles. Further, the brilliance of syn- 
chrotron sources facilitates the development of microdiffraction tools. In addi- 
tion, there are important elements such as Gd and Sn which have high thermal 
neutron absorption cross-sections, complicating their study by neutron scattering. 
It should be noted that “elastic” X-ray scattering integrates over a much wider 
range of energies than do most neutron scattering measurements. It therefore 
usually probes time scales of 10°1° s while neutron scattering usually examines 
magnetic structure on time scales approximately three orders of magnitude lon- 
ger. In most cases, this is not believed to cause any observable differences but it 
should be kept in mind. Finally, magnetic Compton scattering measures the spin 
momentum density, a quantity of fundamental interest which is not accessible to 
neutron scattering. 

The cross-section for nonresonant elastic X-ray scattering from a material can 
be written as Eq. (8) (Blume 1985), where the sums are over electron positions 
rj, p is the electron momentum, and aj, a, and a3 polarization-dependent vectors 
given by Eqs. (9), in which k, k’ are unit vectors in the directions of the incident 
and scattered photons respectively, and ê, é’ are their respective polarization 
vectors. 
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The first term in Eq. (8) describes the Thomson charge scattering while the 
others describe “magnetic” scattering from the electron spin (second term) and 
orbital angular momentum (third term). The magnetic scattering amplitudes are 
lower than that of the charge scattering by a factor E,/m,c? of approximately 1074 
at typical X-ray energies. They are also imaginary, so that they do not interfere 
with the real part of the charge scattering but only with the typically smaller 
imaginary part (not shown in Eq. (9)), if linearly polarized X-rays are used. 

When using linearly polarized X-rays the different polarization dependences of 
the second and third terms allow the determination of the individual contribu- 
tions of spin and orbital motion to the total magnetic moment (Blume and Gibbs 
1988). In particular, the matrix elements linking the polarization components per- 
pendicular to the plane of scattering (a-polarization) with the components in the 
plane of scattering (z-polarization) have nonzero terms in both diagonal and non- 
diagonal elements (Eqs. (10), where the scattering vector q is in the z-direction 
and the x direction is perpendicular to the y-z plane of scattering). 


Moo = Sx sin 20 
Mon = 2 sin? 6[(S, + L,) cos 0 + Sz sin 6] (10) 


Mr: = sin 20[2L, sin? 0+ Sx] 


Analysis of the polarization of the scattered beam is usually performed using an 
analyzer crystal diffracting at 90°. Since the scattering is nonresonant, the energy 
of the X-ray beam can typically be chosen to ensure this condition. 

In cases where the symmetry of the magnetic moment is the same as the 
charge symmetry (as in a ferromagnet) and when linearly polarized X-rays are 
used, it is possible, but often difficult, to separate the nonresonant magnetic scat- 
tering signal from the much larger charge scattering by reversing the direction of 
an applied magnetic field or by analyzing changes in polarization. However, non- 
resonant magnetic X-ray scattering has been most often used in cases where 
there are specific Bragg reflections primarily due to the magnetic structure. In 
the early work of Gibbs et al. (1988), this approach was used to identify orbital 
and spin contributions in Ho, which has a large magnetic moment (10.64p) and 
a spiral antiferromagnetic structure. Despite the small cross-section for non- 
resonant magnetic X-ray scattering, the increasing availability of high-brilliance 
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beams from third-generation synchrotron sources has broadened its range of use. 
In one interesting application, microfocused nonresonant magnetic scattering 
was used to image spin density wave domains in Cr on the micron scale (Evans 
et al. 2002). In another case, nonresonant magnetic X-ray scattering has been 
used to investigate the differences between bulk and near-surface ordering in 
NdCuy alloys (Schneidewind et al. 2001). 

The use of relatively high-energy X-rays (40-150 keV) offers a particular set of 
possibilities for nonresonant magnetic X-ray scattering studies. The relatively 
small penetration depths (on the order of microns) of typical hard 5-30 keV 
X-rays for most magnetic materials means that, in contrast to magnetic neutron 
scattering, only the near-surface regions of bulk samples are examined. While 
this can be an advantage for those studies interested in the near-surface region, 
it also means that the measurements may be sensitive to surface preparation 
and quality, a disadvantage if the primary subject of interest is the bulk alloy be- 
havior. Using high-energy X-rays, experiments can be performed in transmission 
geometry, helping to ensure that the scattering originates from a bulk phenome- 
non. In addition, in this limit, the contribution of orbital scattering to the mag- 
netic scattering cross-section becomes negligible and thus only the spin compo- 
nent perpendicular to the scattering plane is measured. By comparing magnetic 
neutron and high-energy X-ray magnetic scattering, the orbital and spin structure 
factors can be separated without using polarization measurements, as has been 
shown by Strempfer et al. (2000) in the case of Cr. 

An alternative to using the small signals produced by the nonresonant magnet- 
ic scattering with linear polarization is to use circularly polarized X-rays. These 
can be obtained using a synchrotron beam slightly outside the orbital plane, with 
a phase plate or an appropriate insertion device such as an elliptical multipole 
wiggler. Circularly polarized X-rays produce interference between the real part of 
the charge scattering and the magnetic scattering, significantly enhancing the 
contribution of the magnetic scattering. Although the magnetic scattering then 
occurrs at the same place in reciprocal space as the charge scattering, it is possi- 
ble to moderate the influence of charge scattering. This can be done, for instance 
by diffracting near the horizontal direction at a synchrotron to take advantage of 
the suppression of charge scattering by polarization, by examining changes in the 
observed scattering under reversal of photon helicity, or by reversal of an applied 
magnetic field. By utilizing a white beam and an energy-dispersive detector, this 
approach has allowed the very efficient measurement of the magnetic form factor 
in ferromagnets such as Fe (Laundy et al. 1991; Collins et al. 1992; Ito and 
Hirano 1997), HoFe (Collins et al. 1993), and Ni (Laundy et al. 1998). Other 
experimenters have used a monochromated beam with circular polarization to 
determine the temperature dependences of the spin and orbital moments in a 
ferromagnetic Sm-Gd-Al alloy (Taylor et al. 2002). An alternative approach to 
mixing the charge and magnetic scattering to increase the magnetic scattering 
intensity would be to examine noncentrosymmetric structures, since the struc- 
ture factor there is complex. However, this approach has apparently not been 
attempted yet. 
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In addition to the terms shown in Eq. (8), there enters a term A - p from pertur- 
bation theory which gives rise to photoelectric absorption in its first order. Here A 
is the vector potential of the electromagnetic field. Absorption spectroscopy, par- 
ticularly magnetic circular and linear dichroism spectroscopy, has become a very 
powerful tool for measuring the spin-dependent density of states (see the review 
by Wende 2004). However, we confine ourselves here to a discussion of scattering 
techniques. The second-order A - p term contribution to the elastic scattering am- 
plitude [expression (11)] 


(11) 
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becomes dominant in resonance conditions when the incident photon energy is 
comparable to the difference in energy between the ground state and a virtual ex- 
cited state. Here the state of the electron system is |i) and the sum is over virtual 
excited states |o). The cases of interest are virtual excitations between a core and 
an unfilled valence shell. Although the resonant elastic scattering term above does 
not explicitly include a dependence on magnetic moment, it is sensitive to mag- 
netic order through differences in cross-section that depend on the electron spin 
states. In the simplest cases, this can arise from the exchange splitting of the va- 
lence states probed. Resonant magnetic scattering can often be observed in cases 
where its observation by nonresonant techniques would have been difficult or 
impossible. In addition, using photon energies near the absorption edges of 
different elements, the magnetic contribution of those elements can be probed in- 
dividually, a unique capability. Moreover, relatively small samples can typically be 
studied using resonance enhancement. This has facilitated the study of, for in- 
stance, samples under high pressure. As an example Kernavanois et al. (2005) 
have examined the resonant magnetic scattering from Ce(Co,Fe;_,)2 at pressures 
up to nearly 10 kbar. 

Resonant scattering is large near absorption edges and is particularly sensitive 
to magnetic order in the partially unfilled 3d shell states in the 3d transition met- 
als, 4f states in lanthanides and 5f states in actinides. In the electric dipole ap- 
proximation the change in electron orbital momentum is +1 while the higher- 
order electric quadrupole term gives a change of +2. Thus the strongest resonant 
contributions (coming from the electric dipole term) would be expected to occur 
near 3d transition metal L} 3 edges (2p—3d resonance), lanthanide M45 edges 
(3d—4f resonance), and actinide N45 edges (4d—5f resonance). However, most of 
these edges are in the soft X-ray regime. The short absorption length of soft 
X-rays in air necessitates the use of specialized vacuum spectrometers for such 
experiments. Resonant diffraction experiments with hard X-rays must instead uti- 
lize the transition metal K-edge (accessing primarily the 1s—4p resonance), lan- 
thanide L-edges (accessing primarily the 2p—5d resonance) and actinide M-edges 
(accessing primarily the 3d—5f resonance). In the case of the transition metals 
and the lanthanides, hybridization of the probed valence states with the polarized 
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3d or 4f states is believed to play the dominant role in creating a magnetic reso- 
nance effect. 

Resonant magnetic scattering experiments near K-edges are not strongly sensi- 
tive to the partially empty valence 3d states and hence resonance effects there are 
usually relatively modest, particularly if no polarization analysis is used, as seen 
in the early study of ferromagnetic Ni by Namikawa et al. (1985). Although rela- 
tive signal rates are not large, polarization analysis can show significant enhance- 
ments of on scattering near K-edges. This has been utilized in the study of the 
spin wave state in Cr (Mannix et al. 2001a), though its widest use has been in the 
study of magnetic, charge, and orbital ordering in transition metal oxides (see 
the review by Hill 2001). Interestingly, large resonances have been observed at 
the K-edges of nonmagnetic elements in U antiferromagnets (Mannix et al. 
2001b). Though a detailed understanding is still controversial, the phenomenon 
opens new possibilities for K-edge resonant experiments. 

Diffraction near lanthanide L- and actinide M-edges often offers the possibility 
of larger resonant enhancements in the hard X-ray regime. In the first observa- 
tions of resonant scattering by Gibbs et al. (1988), a strong resonance was seen at 
a Ho L-edge. The exact nature of the enhancement appears to be complex (van 
Veenendaal et al. 1997). Even larger resonant effects are typically found near the 
M45 edges of actinides where a 3d electron can make a virtual transition into 
a partially unoccupied polarized 5f shell. The nature of the resonance scattering 
complicates quantitative calculation of the magnetic moments themselves. How- 
ever, hard X-ray resonant scattering at L- and M-edges has been widely used to 
examine specific chemical aspects of magnetic ordering, including the nature of 
magnetic phase transitions in bulk alloys (Pengra et al. 1994; Vigliante et al. 1998; 
Hupfeld et al. 2000; Lidstrom et al. 2000; Goff et al. 2001; Stunault et al. 2001, 
2002, 2004; Okuyama et al. 2004; Good et al. 2005; Kuzishita et al. 2005), the mi- 
crodomain structure and anisotropy in thin films (Beutier et al. 2004), complex 
behavior in multilayers (Nelson 1999; Jaouen 2002; Marrows et al. 2005), magnet- 
ic order in heavy-fermion thin films (Jourdan et al. 2005), and magnetism at sur- 
faces (Ferrer et al. 1997). 

As discussed above, the strongest magnetic resonances are in the soft X-ray re- 
gime, at the relevant transition metal L-edges, the lanthanide M-edges, and the 
actinide N-edges. The huge enhancements in this regime (by a factor of up to 
107 at actinide N-edges) allow the magnetism of surfaces and extremely thin 
films to be studied. Taking advantage of these opportunities requires working 
completely in a vacuum environment to avoid X-ray absorption in the air; this 
considerably complicates experiments. Because of the low scattering wavenum- 
bers accessible, resonant soft X-ray magnetic scattering studies have often fo- 
cused on magnetic structure on nanometer length scales via scattering or on 
surface/interface roughness via reflectivity. As with hard X-ray magnetic scatter- 
ing, polarization analysis of the scattered photons can help separate charge and 
magnetic scattering. Also, circularly polarized X-rays can be used to produce in- 
terference between the charge and magnetic scattering. The reversal of helicity 
can then be used to help isolate the interference term between charge and mag- 
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netic scattering. However, for the case of reflectivity at least, clearly interpreting 
the observed scattering in terms of magnetic and atomic structure can be tricky 
(Osgood et al. 1999; Kortright and Kim 2000). Recent studies include that of 
Dudzik et al. (2000) examining striped magnetic domains using the Fe L3-edge 
in FePd. There, varying the helicity of the circularly polarized X-rays allowed the 
investigators to determine the structure of closure domains. A number of experi- 
ments have meanwhile been examining Co/Pt multilayers whose structure and 
magnetic interactions can be tuned over a significant range (Kortright et al. 
2001; Hellwig et al. 2003; Pierce et al. 2003). Other investigations have studied 
stripe domains in Co/Pt multilayers using a transmission geometry (Kortright 
et al. 2001), the magnetism of coupled layers in spin-valve and giant magnetore- 
sistance materials (Hase et al. 2000; Mirone et al. 2000), magnetic heterogeneity 
in a recording media alloy (Kortright et al. 2003), magnetic correlations in dense 
nanoparticle assemblies (Kortright et al. 2005), separation of chemical and mag- 
netic interfacial roughness in CoFe thin films (Freeland et al. 1999; Kelly et al. 
2002), thickness dependence of the magnetic behavior of ultrathin films 
(Weschke et al. 2005), and the magnetization of buried interfaces in Co films 
(MacKay et al. 1996). 

The enhanced scattering from soft X-ray resonance conditions has been used 
to observe coherent magnetic scattering. Yakhou et al. (2001) were able to follow 
domain wall fluctuations in UAs and Chesnel et al. (2002) could examine trans- 
verse strip domain morphology in FePd wires. As discussed further in the section 
on coherent scattering, Pierce et al. (2003) were able to use coherent magnetic 
scattering to determine the extent of microscopic return point memory in Pt/Co 
multilayers. Even the imaging of magnetic domains in a Co/Pt multilayer using 
lensless holography has been demonstrated by Eisebitt et al. (2004). 

Inelastic magnetic X-ray scattering is another method being applied. While ex- 
tensive use has been made of X-ray emission spectroscopy to examine magnetic 
structure, we will confine ourselves here to true inelastic scattering processes; 
most of these experiments use magnetic Compton scattering. Compton scatter- 
ing uses energy transfers that are large compared to valence electron binding 
energies so that electron motion during the interaction is negligible - this is es- 
sentially the impulse approximation. Therefore magnetic Compton scattering is 
sensitive only to the spin of the electrons and experiments using circularly polar- 
ized X-rays can measure the projection of the magnetic Compton momentum 
profile in a given direction [Eq. (12)]. 


ate {Jon — ny(p)] dpe dp, (12) 


This is a unique capability, as the magnetic momentum density, which is not ac- 
cessible to neutron scattering, can often be compared directly with theoretical pre- 
dictions. The cross-section for spin magnetic Compton scattering is given by Eq. 
(13), where P, is the degree of circular polarization. 
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The subscript refers to the incident photon and the subscript “s” to the scat- 
tered photon. This contribution must be separated from the charge Compton 
scattering by reversing the direction of polarization or by reversing the direction 
of an applied field. Typically, the contribution of the magnetic term to the overall 
Compton scattering cross-section is 1% or less. Therefore, high photon statistics 
are required. In order to maximize the signal and momentum resolution, experi- 
ments are typically performed at relatively high X-ray energies (>40 keV) and ina 
backscattering geometry. 

Sakai and Ono (1976) performed the first experiments in the field (using a ra- 
dioactive source) and Sakai (1996) has reviewed the development of the method. 
As an example of more recent work utilizing synchrotron radiation, Cooper et al. 
(1993) have measured the spin-resolved Compton profile of HoFe,. The differing 
lineshapes expected for the Ho and Fe momentum distributions were used to an- 
alyze the significantly different behaviors of the spin moments of the atoms as a 
function of temperature. In another study, Lawson et al. (1997) were able to take 
advantage of the insensitivity of magnetic Compton scattering to orbital moments 
to measure the spin moment on uranium. The spin moment is antiparallel to the 
orbital moment in this case, resulting in only a small net moment, which compli- 
cated previous studies by other techniques. In transition metals, a particular ad- 
vantage of magnetic Compton scattering compared to elastic scattering is its 
sensitivity to delocalized valence electron spin moments. It has been used, for in- 
stance, in studies of Ni which are able to challenge current theoretical under- 
standing (Dixon et al. 1998). In a ferromagnetic Gd-Y alloy, Duffy et al. (2000) 
have examined induced spin polarization, while others have studied the boron 
contribution to the spin moment in amorphous Fe-B alloys (Taylor et al. 2001), 
the spin momentum densities in Fe (Montano et al. 2000), in Pd-Co alloys 
(Taylor et al. 2002a), in invar (Fe3Pt) (Srajer et al. 1999; Taylor et al. 2002b), in a 
Ce-Fe-Ru alloy (Ahuja et al. 2002), in the Ni»MnSn Heusler alloy (Deb et al. 
2001), in Fe3Si and Fe3Al alloys (Zukowski et al. 2000), and in CrRhzB; (Sakurai 
et al. 2003), spin transitions in a Ce-Fe-Ru alloy (Sharma et al. 2005), and the 
metamagnetic transition in UCoAI (Tsutsui et al. 2005). 


13.1.3 
Spectroscopy 


In this section, spectroscopic methods are discussed which provide some of the 
most complete dynamic structural information available. Section 13.1.3.1 is de- 
voted to coherent X-ray scattering; Section 13.1.3.2 deals with studies of phonon- 
excitation spectra by X-ray scattering. Both methods have been employed to an 
increasing extent in recent years. Section 13.1.3.3 is about methods which have 
been present for significantly longer in the scientific community, though they 
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are not less interesting—quasielastic scattering of X-rays applied for diffusion in- 
vestigations in various ordered and disordered materials. 


13.1.3.1 Coherent Time-Resolved X-Ray Scattering 

The continuing increase in brilliance of synchrotron X-ray sources has allowed 
the development of “coherent” X-ray scattering as a technique for examining dy- 
namics and kinetics in metallic alloys. Traditional X-ray scattering uses photon 
beams that have effective coherence lengths significantly smaller than the size of 
the beam itself. That is, the X-ray wave vector k varies among different incident 
photons so that diffraction from points on the sample that are separated beyond 
some coherence length (longitudinal and transverse) smaller than the beam size 
does not interfere at the detector, but rather adds incoherently. The availability of 
high-brilliance hard X-ray beams from third-generation synchrotron sources has 
opened the possibility of making small X-ray beams (typically of order 10 um in 
diameter) that exhibit significant coherence throughout the scattering volume. 
Here we call the use of such beams in scattering experiments “coherent” X-ray 
scattering, though traditional X-ray diffraction experiments also rely on a more 
limited coherence to create interference between scattering from objects that are 
not too far apart spatially (i.e., within the limits set by the traditional “resolution 
function”). The use of coherent X-ray beams has also been termed X-ray intensity 
fluctuation spectroscopy (XIFS) or X-ray photon correlation spectroscopy (XPCS). 
While currently available hard X-ray beams do not exhibit the level of coherence 
available from optical lasers, they nonetheless enable some of the techniques tra- 
ditionally used in the optical regime to be used now in the hard X-ray regime. In 
particular, the use of “coherent” X-rays offers the potential to examine dynamics, 
kinetics and microscopic reversibility on length scales of 10-!-10? nm in metallic 
alloys. 

It has now been just over a decade since the first coherent hard X-ray scattering 
experiments were performed (Sutton et al. 1991) and there have been continued 
technical and conceptual advances in the technique. However, the limitations on 
coherent hard X-ray flux available have restricted the application of coherent X-ray 
scattering to a limited number of situations in which the coherent scattered 
intensity is sufficiently high for its evolution to be monitored accurately on the 
time scales of interest. Thus, while there have been some coherent X-ray studies 
of metallic alloys, most experiments using the technique have so far focused on 
macromolecular materials that have high effective scattering power (a large num- 
ber of atoms within each macromolecule and low absorption) and relatively long 
time constants. 

On a typical undulator beamline at a third-generation synchrotron source, the 
effective size of the electron beam serving as the X-ray source is on the order of 
1000 um x 20 um (H x V) FWHM. The flux and coherence available for a scatter- 
ing experiment depend on the source and beamline optics. Typical fluxes through 
a 10 um pinhole located R ~ 50 m from the source are on the order of 101° pho- 
tons s™! using a crystal monochromator. This geometry gives a transverse coher- 
ence length in each direction of é, ~ AR/(2s), where s is the appropriate source 
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size and 4 is the X-ray wavelength. Typical transverse coherence areas are then on 
the order of 5 um x 500 um (H x V) FWHM and coherent fractions are 20-30% 
in beams of 10 um diameter. 

The longitudinal coherence length & ~ A?/2AA, where A4 is the beams spread 
in wavelength, is also an important parameter. For a typical crystal monochroma- 
tor, the wavelength spread A4/A is about 10-°-10~* so that & is a few microns. 
Thus, if photon pathlengths for scattering processes from different atoms differ 
by more than a few microns, the scattering contributions will add incoherently 
at the detector. This is not usually a problem for small-angle scattering experi- 
ments, because pathlength differences are small. In that case, the energy resolu- 
tion can actually be relaxed significantly from the crystal monochromator value, 
for example by using a multilayer monochromator. For wide-angle scattering in 
a symmetric reflection geometry at incident angle 0, however, the pathlength dif- 
ference will typically be as large as 21,5, sin? 0, where laps is the X-ray absorption 
length in the material. Thus for typical wide-angle scattering, labs should be less 
than about 10 um. 

In order to work in the near field of the pinhole of diameter d defining the 
beam, the sample must be placed a distance Lı from the pinhole with Lı « d?/2. 
In order to measure the scattering as a function of momentum transfer ñq, the 
scattered photons must be detected in the far field so that the distance between 
sample and detector L} must be Lz > h?/A, where h is a projected beam size on 
the sample. Coherent scattering from a material without long-range order gives 
a stochastic speckle pattern with typical speckle sizes at the detector of about 
30 um. 

To collect scattering signals efficiently with minimal background, direct- 
illumination deep-depletion layer CCD X-ray area detectors are in many ways 
ideal if the time scales of interest are sufficiently slow. When count rates are low 
and it is desirable to do so, data can be analyzed in a photon counting mode us- 
ing a droplet algorithm (Livet et al. 2000) to identify individual photons and even 
to energy-discriminate among scattered photons with a typical resolution of about 
190 eV. In this mode, correlations are usually calculated post facto. Alternatively, 
at the fastest time scales, fast avalanche photodiode point detectors have been 
used in conjunction with digital autocorrelators to give 50 ns temporal resolution 
(Sikharulidze et al. 2002). 


13.1.3.1.1 Homodyne X-Ray Studies of Equilibrium Fluctuation Dynamics 

Since the speckle pattern in a coherent scattering experiment is a function of the 
position of each scatterer in the sample, it changes with time according to Eq. 
(14) if the scatterers are moving. 


5 fient 
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I(q,t) = = plq t)“ (qt) (14) 








Here fi is the scattering factor for the ith atom, r;(t) is its position at time t, and 
p(q, t) is (in the case of hard X-rays) the Fourier transform of the electron density. 


723 


724 


13 High-Resolution Experimental Methods 


This changing intensity is the basis of the widely used dynamic light scattering 
(DLS) technique, performed with optical lasers. In homodyne optical DLS, the dy- 
namics of macromolecules or clusters on the micron size scale is examined by 
measuring the decay of the intensity autocorrelation function of the speckle pat- 
tern. This gives the second-order correlation function Eq. (15). 


(qt) (qt + O Cela, tela EAC, t + tp" (a, t+ Dy 
<I(q, t") Cela; t) (a tyi 





2.(4, t) = 
(15) 


While g)(q,t) is directly accessible from the intensity autocorrelation func- 
tion, theory usually calculates the first-order correlation function g;(q,t) = 
<p(q, t’)p(q, t’ + t)>,/<1(q, t’) >. If fluctuations are Gaussian, however, g>(q, t) is 
related to gı (q, t), the first-order correlation function, by Eq. (16). 


gi(q,t) = 1+ |go(q, t)? (16) 


For simple translational diffusion, the correlation functions exhibit an exponen- 
tial decay exp(—q? Dt), where D is the diffusion constant. More generally, the de- 
cay of the normalized autocorrelation functions reveals how fluctuations grow 
and decay on length scales 27/q. 

The advent of coherent X-ray scattering allows DLS to be extended to the X-ray 
regime so that dynamics in opaque materials can, at least in theory, be examined 
down to atomic length scales. As discussed above, practical limitations restrict the 
technique’s applicability to cases in which the clusters of scattering atoms have a 
large scattering cross-section and the time scales are not too fast. This is consid- 
ered in a quantitative manner in an estimate [Eq. (17)] due to G.B. Stephenson 
(2003) of the typical count rate N9? that can be expected per speckle; ca is the 
atomic cross-section for elastic scattering, t is the effective sample thickness, A is 
the cross-section area of the incident X-ray beam with Nj, coherent incident pho- 
tons, and the normalized incoherent structure factor is S(q). 


42 
sp _ 4°GaS(q) 
= Nin 1 
N 2nAt (17) 


In many cases, it is the contrast between regions that causes the observed scatter- 
ing; then o,ı must be replaced by an appropriately weighted difference in scatter- 
ing cross-sections between the regions. For typical coherent scattering experi- 
ments using samples containing mostly medium-Z elements (e.g. Cu), we can 
estimate that NS? ~ 1 x 1078 S(q)Ninc. A typical incident flux is about 101° pho- 
tons s-!. If we take as a minimum requirement that N;,, ~ 107? per correlation 
time in order to perform correlation spectroscopy, then we would need a normal- 
ized structure factor S(q) on the order of 10 in order to measure correlation times 
on the order of 1 s. Of course, this is a very approximate calculation, but it can 
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clearly be seen that for studies of subsecond dynamics, sizeable clusters of atoms 
scattering collectively are required. 

Consistently with this conclusion, X-ray fluctuation spectroscopy has been ap- 
plied to examine the dynamics of relatively large groups of atoms, such as colloi- 
dal nanoparticles, latex spheres in solution, and block copolymer micelles. There 
have been fewer coherent X-ray studies of equilibrium dynamics near phase tran- 
sitions in metallic alloys. In a relatively early experiment, Brauer et al. (1995) 
examined the critical fluctuations near the B2—D03 order-disorder transition in 
Fe3Al. More recently, Sutton’s group at McGill has revisited the issue, bringing 
significant improvements in beam intensity and stability since the earlier work 
(Sutton 2005). These studies have been able to measure carefully the fluctuation 
dynamics and compare it to theoretical expectations for the first time in an alloy. 
The experiments rely on the relatively large scattering very near the critical point 
due to the large fluctuation length scales (over 500 nm in these investigations), 
and on the relatively slow dynamics in the alloy at this point (tens of seconds). 
Francoual et al. (2003) have also examined the dynamics of phason fluctuations 
in an icosohedral AlPdMn alloy, again comparing the fluctuation dynamics with 
theory for the first time. 

In summary, measurements of equilibrium fluctuation dynamics with coherent 
X-ray scattering have developed considerably in the past decade, but intensity con- 
siderations remain quite limiting. Potentially this limitation can be relaxed using 
heterodyne correlation spectroscopy, as discussed next. 


13.1.3.1.2 Heterodyne X-Ray Studies of Equilibrium Fluctuation Dynamics 

The first heterodyne hard X-ray studies of fluctuation dynamics have been re- 
ported recently (Gutt et al. 2003; Livet et al. 2006; Madsen et al. 2004). Hetero- 
dyne studies use a scattered reference beam to interfere with the scattered sample 
beam. If we again denote the scattering amplitudes from the reference and sam- 
ple objects as p, and p, respectively, then the measured intensity is given by Eq. 
(18). 


T= |p, + pl? = I +2 Re(p,p:*) + Is (18) 


The autocorrelation of the total intensity is then obtained from Eq. (19), where I, 
and I, now refer to averaged quantities. 





<1(q, t')1(q, t + #) > = (Ir + L)? + 2I, Relgi(q, t)] + I?g2(q, t) (19) 


If I, > I;, then the second term will be much larger than the third term, i.e., the 
heterodyne signal will be much larger than the available homodyne signal. There 
are other advantages to the heterodyne approach as well. The heterodyne tech- 
nique is less sensitive to large, slow-moving contaminants (e.g., dust) and it yields 
gi(q,t) more directly. 

There are, however, important constraints on the implementation of the heter- 
odyne approach. Heterodyning requires that the scattering from the reference 
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and sample beams interferes at the detector. This is possible only if pathlengths 
for photons scattered by the reference and sample differ by less than the longitu- 
dinal coherence length (typically on the order of a micron for a hard X-ray beam, 
as discussed above). Moreover, if the reference is not tightly coupled mechanically 
to the sample, relative vibrational motions can cause severe problems. 

An additional consideration is that the random noise on the reference signal I, 
sets a limit to the increased sensitivity practically possible. In optical heterodyn- 
ing, ratios I,/I, at least as high as 10* have been used (Earnshaw 1997), though 
smaller values are more common. For a fully coherent beam (not realized in cur- 
rent sources), Eq. (18) suggests that, for a given coherence time, the shot noise on 
the reference signal at each pixel will be smaller than the fluctuation-caused 
changes to the second term if N; > N „1/2, where the N represent average photon 
numbers from the two scatterers. However, in making the autocorrelation of 
Eq. (19), many coherence times will typically be averaged over. If we average 
over N, coherence times, we might expect the shot noise to be sufficiently small 
if N.N, > (N.N,) "2, i.e., N > (N/N). This conjecture, however, remains to 
be verified. 

In recent hard X-ray heterodyne spectroscopy studies of capillary waves on liq- 
uid surfaces by Gutt et al. (2003), ratios of reference to sample signal of 1-3:1 are 
cited. The reference signal in this work is believed to be the component of the in- 
cident beam that is totally reflected from the surface, not a separate scattering ob- 
ject. In the work of Livet, Sutton, and collaborators (Livet et al. 2006), scattering 
from an aerosol served as the reference for the study of coherent scattering from 
a filled polymer. 

Effective heterodyning has also been used in “lensless” soft X-ray holography 
(McNulty et al. 1992; Howells et al. 2001; Fisebitt et al. 2003). Here several ap- 
proaches have been investigated to create a reference beam. These include the 
use of a small auxiliary hole in a mask to give a pinhole diffraction reference sig- 
nal and the use of small-angle scattering reference signals from a static aerogel. 

Thus hard X-ray heterodyne fluctuation spectroscopy, which offers the potential 
of higher sensitivity than homodyne techniques, has now been demonstrated but 
its development remains in its infancy. 


13.1.3.1.3 Studies of Critical Fluctuations with Microbeams 

Mocuta et al. (2005) have developed a very different approach to the study of fluc- 
tuations near a critical point. They focused an X-ray beam to a micron length 
scale in the critical region of Fe3Al and observed the temporal variation in scat- 
tered intensity as individual fluctuations appeared and disappeared. They were 
able to compare their results carefully with theory and to identify the crossover 
from noncritical to critical dynamics behavior. 


13.1.3.1.4 Coherent X-Ray Studies of the Kinetics of Nonequilibrium Systems 
Coherent X-ray scattering can also provide unique information about the kinetics 
of phase transitions, though theoretical interpretation is a key issue. In the case 
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of a material evolving toward equilibrium, the structure is no longer stationary 
and the autocorrelation function defined above in Eq. (15) is no longer meaning- 
ful. An alternative approach is to define the two-time correlation function, Eq. 
(20), where the average represents an ensemble average often estimated from 
the X-ray data by smearing out the speckle pattern. 





ae (4 — <I(q, 2) (= t2) — <I(q, >) (20) 


<I(q, t1)> <I(q, t2)> 


Of course, the two-time correlation function is again an experimental quantity 
which is not accessible from traditional “incoherent” scattering experiments. 

In the case of late stage coarsening in ordering and phase separating alloys, 
Langevin theory and simulations (Brown et al. 1997, 1999) have predicted how 
C(q, t1,t2) decays as a function of wave vector q and difference in observation 
times At = t — tı. They predict that the speckle pattern from coarsening alloys 
is quite long-lived, with correlation decay times comparable to the mean time 
tm = (tı + t2)/2 since the ordering began. They also predict a scaling relationship 
between decay time t and mean ordering time tm with scaling variable x = q°t. 

The general theory and simulation predictions of the behavior of C(q, t1, tz) 
have been borne out in coherent small-angle X-ray scattering (SAXS) studies 
of phase separation (Malik et al. 1998; Livet et al. 2001) and wide-angle X-ray- 
scattering (WAXS) studies of ordering (Fluerasu et al. 2005; Ludwig et al. 2005). 
While the experiments confirmed some of the theoretical predictions about coars- 
ening kinetics, there were also features of the coherent X-ray scattering that were 
unexpected. For example, Fig. 13.3(a) shows the normalized two-time correlation 
functions at three different mean times tm during the domain coarsening of a 
Cuo.79Pdo.21 long-period superlattice (LPS) alloy (Ludwig et al. 2005). The width 
of the peaks gives the correlation decay time, which is linear in mean ordering 
time, as predicted by theory (Fig. 13.3b). However, the dimensionless ratio be- 
tween them is different than expected. Langevin theory, Langevin simulations 
(Brown et al. 1999), and Ising model simulations (Ludwig et al. 2005) on a variety 
of ordering models all give ratios of 1.3-1.4:1 between correlation decay time and 
mean time of coarsening. Experiments on Cu-Pd give ratios of about 0.5:1 and 
independent experiments on Cu-Au alloys give slightly higher values (Fluerasu 
et al. 2005), but still significantly lower than 1.3:1. The cause and significance of 
the discrepancy are still unclear. 

In addition, the studies on Cu-Pd and Cu-Au alloys (Fluerasu et al. 2005; 
Ludwig et al. 2005) find small superlattice peak shifts during ordering that would 
correspond to average lattice changes of approximately 0.1-0.2%. At least in the 
Cu-Pd case, the superlattice peaks shift while the individual speckles making up 
the peak do not. Since a uniform change in lattice parameter during ordering 
would shift the speckles, we conclude that the peak shift is due to some inhomo- 
geneous process — perhaps the elimination of strain at antiphase domain bound- 
aries as suggested by Fluerasu et al. (2005). 
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Fig. 13.3 Behavior of two-time correlation functions during ordering 

of Cuo.79Pdo.21 alloy: a) normalized two-time correlation function for 
mean ordering times tm of (from bottom to top) 5676 s, 17 286 s, and 
30 440 s; b) scaling of reduced two-time correlation function decay time 
t with reduced mean time. The two are proportional, as predicted by 
theory. 


An alternative approach to using the two-time correlation function of Eq. (20) 
for examining late-stage coarsening kinetics with coherent X-ray scattering is 
to use a detrending fluctuation analysis instead (Stadler et al. 2003). This allows 
one to examine the speckle intensity fluctuations about a varying mean. Stadler 
et al. (2003, 2004) used this approach to distinguish between competing coarsen- 
ing mechanisms in Al-Ag and Al-Zn alloys. 
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In summary, the coherent scattering study of phase transition kinetics is still 
young. It is clear that much more information is available from coherent studies 
than from traditional X-ray scattering. However, theoretical developments are also 
needed to interpret the extra information obtained from coherent scattering in 
more general cases. 


13.1.3.1.5 Coherent X-Ray Studies of Microscopic Reversibility 
In first-order phase transitions, the development and growth of the product phase 
inside the parent phase is often heterogeneous, with nucleation and growth path- 
ways determined by local defects and stresses. This raises the important question 
of to what extent such processes are microscopically reversible. Since the coher- 
ent scattering speckle pattern is sensitive to the microscopic structure of the ma- 
terial, variations in the speckle pattern allow differences between microscopic 
states to be measured quantitatively. The first such measurement has been per- 
formed by Pierce et al. (2003) using coherent magnetic soft X-ray scattering to ex- 
amine the magnetic domain evolution around the hysteresis loop of Co—Pt multi- 
layer films. For a given magnetic field the magnetic domain structure and hence 
the speckle pattern are largely static. As the magnetic field is varied around either 
a minor hysteresis loop or the full, major hysteresis loop, the speckle pattern 
is monitored. The microscopic correlation between the domain structure of the 
sample at two fields B4 and Bz is calculated with the speckle cross-correlation 
function, basically a sum over the detector pixels of the product of suitably nor- 
malized intensities  I(q,Bı)/(q, B2). At low fields, the microscopic memory 
loss grows exponentially with field. For smooth films, without strong pinning 
sites, saturation completely erased the microscopic memory while for rough 
films significant microscopic memory remained after going around the hysteresis 
loop. 

Though the technique has only recently been demonstrated, the use of coher- 
ent X-ray scattering has clearly shown its potential to follow microscopic revers- 
ibility and memory quantitatively. 


13.1.3.2 Phonon Excitations 
Studies of collective motion of atoms in condensed matter, better known as pho- 
non or vibrational dynamics measurement, is traditionally the domain of neutron 
spectroscopy. Neutrons as probing particles are particularly suitable, due to: 
e a large penetration depth because of the sufficiently low 
neutron-nucleus scattering cross-section 
e the energy of neutrons being comparable to the energies of 
collective excitations such as phonons 
e neutron momentum making it possible to probe the whole 
dispersion scheme out to several Ä-!, in contrast to inelastic 
light scattering techniques such as Brillouin and Raman 
scattering which can only determine acoustic and optic 
modes at very small momentum transfers. 
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Whereas there are a number of good reviews discussing phonon investigations 
with neutron spectroscopy (Krivoglaz 1969; Squires 1997; Lovesey 1986; Bee 
1988; Scherm and Fak 2006), here only new types of spectroscopy that have been 
developing rapidly during the last few years will be discussed. 

It was pointed out in textbooks (e.g., Ashcroft and Mermin 1988) a long time 
ago that X-rays can in principle be utilized to determine the phonon spectra. 
However, it was stressed that it will be an enormously difficult experimental chal- 
lenge caused by the extremely high-energy resolution of an X-ray instrument 
necessary to resolve excitations resulting from the collective atom motions. Since 
that time, however, enormous technical improvements have enabled this kind of 
measurement. Moreover, measurements of vibrational density of states (VDOS) 
spectra are nowadays possible with two powerful and partially complementary 
high-resolution techniques, namely by inelastic X-ray scattering and by nuclear 
inelastic scattering. Both techniques will be described in the following sections. 


13.1.3.2.1 Inelastic X-Ray Scattering 
Considering X-ray photons with a wavelength of about 1A (with an equivalent 
photon energy of about 12 keV) compared to the vibrational excitations in con- 
densed matter, which are in the meV range, requires a relative energy resolution 
of at least AE/E = 10°”. The use of photons is, however, sometimes significantly 
advantageous compared to the use of neutrons. This can be particularly impor- 
tant in the studies of disordered media: it is not possible to study acoustic excita- 
tions propagating with the speed of sound vs using a probe particle with a speed v 
smaller than v,. This limitation is not relevant in neutron spectroscopy studies 
of crystalline samples because acoustic excitations can be studied in high-order 
Brillouin zones. The situation is very different in the studies of phonons in 
disordered systems such as liquids, glasses, and gases. In crystals, the meso- 
scopic space-time domain corresponding to a momentum -energy region of 0.01- 
10 nm! and 0.1-20 meV is traditionally studied by inelastic neutron scattering 
(INS) (Lovesey 1986; Squires 1997). The neutron technique has been used suc- 
cessfully to investigate the dynamics in disordered systems, at momentum trans- 
fer typically larger than 10 nm“! (Suck et al. 1992; Buchenau et al. 1996). In the 
mesoscopic scale neutrons cannot easily probe the acoustic branch as soon as the 
speed of sound vs is higher in the measured disordered material. Considering 
that the typical values of v, in glasses and liquids are either comparable or consid- 
erably larger than 1500 ms~, it can be explained why, in disordered systems, a 
comprehensive experimental picture of the high-frequency collective dynamics is 
still missing” (Sette et al. 1998). 

To a certain extent another advantage of the inelastic X-ray scattering (IXS) 
technique is even more important. This is the fact that very small beam sizes of 


1) Only in fluids of heavy atoms and low- media can the acoustic dynamics be 
density gases is the speed of sound v, lower investigated effectively via neutron 
than about 1000 ms“! and only in these spectroscopy. 
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the order of a few tens of microns can be obtained currently at third-generation 
synchrotron sources, allowing studies of systems available only in small quanti- 
ties down to only a few thousand um?. Another consequence of the very narrow 
beam is the possibility of investigating samples in extreme conditions, especially 
at very high pressures up to 100 GPa and at high temperatures. 

These differences with respect to inelastic neutron scattering motivated the 
development of the very high-resolution inelastic X-ray scattering technique. The 
pioneering experiments performed in 1986 (Burkel et al. 1986; Pattison et al. 
1986; Suortti et al. 1986) triggered rapid evolution of the IXS technique. To date 
there are two instruments operational at the ESRF, one at APS and one at Spring- 
8, and several more under construction. 

The optical layout of the instrument is based on the triple-axis principle; it 
consists of the high-energy resolution monochromator, the sample goniometer, 
and the crystal analyzer. Details of the instrumentation can be found in Burkel 
(2000), Ruocco and Sette (2003), and Sette and Krisch (2006). Due to the backscat- 
tering geometry the beamline is fairly long in order to acquire a sufficient beam 
offset between the incident photon beam from the X-ray source and the focused, 
high energy-resolution beam at the sample position. 

The general applicability of the method for the study of elementary solids can 
be estimated easily (Bosak and Krisch 2005). The overall scattered intensity is pro- 
portional to nsf?(<Q)) exp(—s/2), where n is the concentration of scatterers, s is 
the sample thickness, f(<Q)) the atomic form factor and A is the X-ray attenua- 
tion length 2 = 1/u. The optimum signal is attained if s = 2; on the other hand s 
must not exceed the focal depth of field of the spectrometer. For typical VDOS 
setups the depth of field is limited to about 3 mm. 

As a consequence, the X-VDOS can be determined with an appropriate accu- 
racy for essentially all elemental solids. One of the potential applications of this 
novel technique is the VDOS determination of samples submitted to high pres- 
sures. The most commonly used high-pressure device is the diamond anvil cell 
(DAC). For elements heavier than scandium the signal level should be sufficiently 
high to allow measurements in a DAC. This is of particular interest for geophys- 
ical studies, where the determination of the VDOS in combination with low Q 
measurements allows the determination of average sound and average longitudi- 
nal speed and the derivation of the shear velocity without precise knowledge of 
the equation of state. With respect to INS, the amount of material needed is three 
to five orders less and anomalous absorption of atoms such as B, Cd, or G, or an 
anomalously high cross-section as in H, is not present. The validity of the 
approach has been checked very recently by IXS measurement on diamond and 
on MgO and by successful comparison with ab-initio and thermodynamical re- 
sults (Bosak and Krisch 2005). 

IXS measurements in solids are not numerous but due to the novelity of the 
technique other recent applications should be mentioned. The first is the mea- 
surement of the lattice dynamics of molybdenum in the DAC cell, where com- 
plete phonon dispersion curves were collected at high pressure and compared 
with ab-initio calculations (Farber et al. 2006). The second contribution deals 
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with the dispersion of acoustic and optical surface phonon modes at the 2H poly- 
type of NbSe, by IXS under grazing incidence conditions, demonstrating surface 
sensitivity of the method by selective studies of either surface or bulk lattice dy- 
namics in a single experiment (Murphy et al. 2005). 


13.1.3.2.2 Nuclear Inelastic Scattering 

As was stressed in Section 13.1.3.2 for scattering of visible light, momen- 
tum transfer from phonons covers only a minute part of the Brillouin zone. For 
scattering of X-rays, energy changes through phonon interaction are orders of 
magnitude smaller than X-ray energies and therefore extreme resolution is de- 
manded to resolve the tiny changes. For recoilless nuclear resonance absorption, 
the extreme energy resolution of the Mossbauer effect goes without saying. There 
was an insurmountable intensity problem earlier, however, due to the width of the 
phonon line being greater by orders of magnitude and therefore the amplitude 
being tiny compared to the Mössbauer line. The breakthrough came through syn- 
chrotron radiation. Instead of measuring scattering from electronic shells, one 
can use resonant scattering from nuclei where increases in scattering amplitude 
of several orders of magnitude are possible. Therefore, the counting rate in nu- 
clear inelastic absorption experiments can be quite high using the present instru- 
mentation with the third generation of synchrotron sources, even from extremely 
small samples. The high brilliance of synchrotron radiation permits the tuning of 
the incoming monochromatized X-ray energy, so that it exactly matches the nu- 
clear excitation energy and can be resonantly absorbed at the ‘Mossbauer level’ 
of a suitable nucleus, usually the 14.4 keV nuclear level of °’Fe. 

If a phonon is now created or absorbed by the photon, the following nuclear 
resonance works only when, after (Minkiewicz et al. 1967) the phonon interac- 
tion, the X-ray energy just matches the energy of the nuclear transition. There- 
fore, by tuning the X-ray energy, the phonon spectrum of a solid can be scanned. 
The incoherent scattering (re-emission after absorption at the nuclear level) of the 
X-rays reproduces the vibrational phonon density of states (VDOS). The accompa- 
nying prompt electronic scattering is separated using the pulse structure of syn- 
chrotron radiation and the relatively long lifetime of nuclear excitation. The inci- 
dent radiation is monochromatized down to an energy width of about 1 meV. 

The method is termed nuclear inelastic scattering (NIS). A distinguishing fea- 
ture of nuclear inelastic scattering compared to other phonon-sensitive tech- 
niques is its unique isotope selectivity: no nuclear level other than the first excited 
level of °’Fe will match the 14.4 keV incoming X-ray energy. Nuclear inelastic 
scattering therefore selects no species other than iron atoms. 

The development of the method is described in a number of publications 
(see, e.g., Seto et al. 1995; Sturhahn et al. 1995; Chumakov et al. 1995); a review 
of some of the recent investigations using this method is given by Chumakov and 
Sturhahn (1999) and by Burkel (2000). For the theoretical aspects of the method, 
refer to Sturhahn and Kohn (1999). 

Since NIS is a resonant method, the X-ray scattering amplitude is several or- 
ders of magnitude higher than the Thomson scattering from the electrons. Count 
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rates in inelastic experiments are usually quite high and even the vibrational den- 
sity of states from submonolayer thin samples can be measured. Despite the short 
time since the first experiments, the method has been applied to a variety of sys- 
tems ranging from crystals to disordered and amorphous materials as well as 
from multilayers to monolayers and nanocrystalline islands. The feasibility of 
the method has been demonstrated first by measurements of the phonon density 
of states of «-iron foil as a function of temperature (Chumakov et al. 1996a) or 
pressure (Ltibbers et al. 2000). From the huge variety of systems studied since 
the mid-1990s, only a small selection can be presented here — for instance, 
studies of VDOS in oriented hcp iron (Giefers et al. 2002); phonon projected den- 
sity of states in FeBO3 (Kohn et al. 1998); of VDOS in magnetite (Handke et al. 
2005), in iron alloys such as Fe3Al (Fultz et al. 1998), in Pt3Fe (Yue et al. 2002), in 
icosahedral quasicrystals (Brand et al. 1999), and in nanocrystalline iron (Fultz 
et al. 1997); and of protein dynamics in myoglobin (Achterhold et al. 2002) and 
in thin iron films in grazing incidence geometry (Röhlsberger et al. 1999). 
Finally one should mention that phonon densities can be measured using not 
only the °’Fe Mössbauer isotope but also !°1!Dy (Chumakov et al. 2001), 1Sn 
(Barla et al. 2000), and other isotopes, or even in samples containing no resonant 
isotope in a modified setup where photons scattered inelastically in the non- 
resonant sample are analyzed by a foil containing the resonant isotope in front 
of the detector (Chumakov et al. 1996b). The last technique especially seems to 
be very promising if the next, much more powerful, generation of synchrotron 
sources start running. 


13.1.3.3 Quasielastic Scattering: Diffusion 

Diffusion in solids is investigated typically on a macroscopic scale following the 
interpenetration of two atomic species across an interface related to the chemical 
diffusion coefficient. The self-diffusion coefficient can be measured, on the other 
hand, by labeling atoms with radioactive isotopes. This technique, also known as 
the tracer or macroscopic method of diffusion investigation (see, e.g., Mehrer 
1990, 1996), deduces atomic events from macroscopic effects, such as, for in- 
stance, the concentration gradient of diffusing isotopes. Diffusion can be studied 
also at the microscopic level, i.e., by revealing the jump vector and the jump rate 
of the atoms. The study of the microscopic jump diffusion mechanism is the top- 
ic of this section. Different scattering methods can be used to resolve the atomic 
dynamics on various time scales: Mössbauer spectroscopy, quasielastic neutron 
scattering and neutron spin-echo, nuclear resonant scattering, quasielastic scat- 
tering of helium atoms, or X-ray photon correlation spectroscopy. 

Whereas quasielastic methods like Mössbauer spectroscopy and neutron scat- 
tering both work in the energy domain, some new methods work in the time do- 
main, i.e., interference between different frequencies is observed as a function of 
time. X-rays from the synchrotron source monochromatized in two steps are 
transmitted through the sample, exciting the resonant nuclei. The delayed, re- 
emitted radiation is mapped as a function of elapsed time, constituting the time 
spectrum of the nuclei in the sample. The method, called nuclear resonance scat- 
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tering of synchrotron radiation, is a daughter of Méssbauer spectroscopy. Relative 
methods working in the time domain are neutron spin-echo spectroscopy and 
X-ray photon correlation spectroscopy. One should note that considerable pro- 
portions of the dynamics investigations were performed in recent years with 
synchrotron radiation, which turns out to be an exceptionally versatile tool for dif- 
fusion studies. 

In order to verify which diffusion mechanism operates in a specific material it 
is necessary to develop a mathematical model of the diffusion process. This is 
done from knowledge of the system, or it is simply an educated guess. The math- 
ematical model is constructed as a system of differential equations. Usually it can 
be solved analytically and yields the function describing the atomic motion. The 
free parameters in the model are adjusted in such a way that the best possible 
agreement with the experiment is achieved. 

The simplistic picture of atoms jumping on the lattice can be formulated pre- 
cisely using the van Hove (1954) correlation function G(r, t)”. If we know the 
function G(r,t), we have full information about the diffusing system. The van 
Hove correlation function (or pair correlation function) has the following classical 
meaning. 


G(r, t) d’r is the probability of finding a nucleus in the 
volume element r...r + d?r for a given time t, if this or 
another nucleus has been at the position r = 0 at time t = 0. 
Its self-part Gs(r, t) d’r is the probability of finding a nucleus 
in the volume element r...r + d’r for a given time t if the 
same particle was at the origin r = 0 at t = 0, averaged over all 
the starting positions of the nucleus. 


Neglecting the discrete structure of a real material, i.e., for a continuous medium, 
the diffusion of particles must obey Fick’s equation. The solution of this equation 
neglects the discontinuous nature of real materials. In a real material and at not 
too elevated temperatures, diffusion can be considered as jump diffusion as in 
the model of Chudley and Elliott (1961). This model, although originally devel- 
oped for liquids, found more applications in solid-state diffusion with the follow- 
ing assumptions (cf. Chapter 5, Section 5.2). 
e All lattice sites are equivalent, i.e., we have a Bravais 
(translation-invariant) lattice. 
e The diffusion is decoupled from lattice vibrations. 
e Only jumps to nearest-neighbor sites are allowed; l; is an ith 
vector connecting these sites. 
e The jump time is negligible compared with the residence 
time on the lattice site t. 
e Successive jumps are uncorrelated, i.e., the jump direction of 
the following jump is independent of the preceding one. In 


2) r denotes here displacements of atoms (in our case jump vectors) and not positions. 
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other words, the present state of the system is determined 
only by the past state at a particular time (Markov process = 
uncorrelated diffusion). 


If we denote a jump rate of the particle from one site to any nearest-neighbor site 
by 7! and the number of neighboring sites (coordination number) by N, then the 
probability of finding this particle at position r at time t+ At is given by Eq. (21), 
i.e., two probability terms are included: the first term describes particles that were 
at the neighboring site i at a distance 1; from r and jump into r during the time 
interval At; the second term particles that stay at the site r. 


N 


Gs(r,t-+ At) =) Gale H lj, t)At 4 SEE -as (21) 
j=l 





In the limit of infinitesimally small At the master equation, Eq. (22), is obtained. 
ð i f 
gest + Ab) E jt) — Gs(r,t)}. (22) 


Equation (22) is usually solved by a Fourier transform in space leading to the dif- 
ferential equation for the so-called intermediate scattering function Is(k, t).” 








Me 


Il 
= 


(kt) = 2 [1 — exp(—ik - j) Is (k, t) (23) 


The solution of Eq. (23) with the boundary condition corresponding to 
Gs(r, 0) = ô(r) > Is(k,0) = 1 is an exponential function of time, Eq. (24). 


N 


Is (k,t) = Is(k,0) exp[—p(k)t/z], x(k) = N!) [1 — exp(—ik - })] (24) 


j=1 
Is(k, t) transformed into the energy domain by a Fourier transform in time yields 
the scattering function S(k, œ) [Eq. (25)], which is a Lorentzian function and can 


be measured experimentally by Méssbauer spectroscopy or quasielastic neutron 
scattering. 


F(k)/2 
S(k,0)e — IE, rk) = (2h/d)y(k) (25) 
(T(k)/2) + (ho) 
3) We use the same terminology for all use k for the wave vector, Is(k,t) for the 
methods. Thus for Méssbauer spectroscopy intermediate scattering function, and 


and for quasielastic neutron scattering we S(k, œw) for the scattering function. 


735 


736 


13 High-Resolution Experimental Methods 


The linewidth of this Lorentzian scattering function T(k) depends on k-1 and 
hence on the relative orientations of the radiation and the crystalline structure. 
This model can describe not only nearest-neighbor jumps; it can be extended 
straightforwardly for jumps into further neighbors’ shells by Eq. (26). 


vk) =1- 2 Wi (a> exp(—ik - 1) (26) 
i j 


Here W; is the probability of a jump to a coordination shell i and N’ denotes the 
number of lattice sites in the coordination shell i. The sum over probabilities W; 
must be normalized to unity. 

We can calculate the line broadening explicitly using Eq. (26) for different 
lattices; e.g., for nearest-neighbor jumps on a cubic bcc lattice with a lattice 
constant a, the coordinate number N! = 8 and k,, ky, and kz the components 
of the outgoing wave vector k referred to the crystal axes. The eight possible 
jump vectors l; are: +(a/2,a/2,a/2), +(a/2,-a/2,a/2), +(—a/2,a/2,a/2) and 
+(—a/2,—a/2,a/2). The function y(k) calculated due to Eq.(26) reads as Eq. 


(27). 
y(k) = 1 — cos (% s) cos( t, s) cos( t- s) (27) 


In the same way “structure factors” y(k) can be calculated for other Bravais 
lattices. For example, diffusion on a bcc lattice is realized by iron atoms in Fe—Al 
disordered alloys. Diffusional line broadening I'(k) in disordered bcc Fe-Al is 
shown in Fig. 13.4. In this case we can conclude immediately that Fe atoms 
jump only via the nearest-neighbor distance and that all lattice sites are equiva- 
lent due to disorder. A comparison with the theoretical line broadening confirms 
this simplest microscopic diffusion model. 

One should note that for calculating the function Is(k, t) or T (k), only the jump 
rate 7! and the explicit directions of atomic jumps l; must be applied. However, 
in order to derive from Eq. (26) a function that can be used for calculating nuclear 
resonant scattering (NRS) spectra, the further transformation discussed in Sec- 
tion 13.1.3.3.2 is required. 

Intermetallic alloys are compounds of metals whose crystal structures are dif- 
ferent from those of the constituents and do not form “simple” solid solutions, 
i.e., a Bravais lattice with sites occupied statistically by atoms of the alloy constitu- 
ents. Intermetallic alloys exhibit instead an additional type of atomic order con- 
sisting of superlattices occupied by only one species (cf. Chapter 2). Intermetallics 
form because the strength of bonding between unlike atoms is stronger than that 
between like ones. This leads to particular crystal structures with a more or less 
ordered distribution of atoms (Ferrero and Saccone 1991). For stoichiometric 
compositions and at low temperatures they are well ordered and their degree of 
order can be controlled by temperature and/or chemical composition. Due to 
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Fig. 13.4 Angular dependence of the line broadening of disordered 
Fe;_,Al,, x = 0.26,0.25, and 0.17 (Feldwisch 1996): @ is the angle from 
[001] direction in the (110) plane; the solid line was calculated from 
Eq. (27) for nearest-neighbor jumps in a bcc lattice. 


their advantageous mechanical behavior (yield stress anomaly; cf. Chapter 6) 
and their corrosion resistance, they are of high technological interest for high- 
temperature applications (Sauthof 1995). It is energetically unfavorable to pro- 
duce antistructure defects by atoms jumping to “wrong” lattice sites in these 
structures. This complicates the diffusion mechanisms, such as jumps of a given 
atom species leading to wrong sites may be markedly less probable than jumps 
to sites on their own sublattice, i.e., there are different jump rates between sub- 
lattices (cf. Chapter 5). In this case it is not sufficient to set up only one master 
equation such as Eq. (22) for the Bravais lattice. For a superstructure with m sub- 
lattices one needs m master equations (a jump matrix). The theory of Chudley 
and Elliott (1961) was extended by Rowe et al. (1971) for hydrogen diffusion on 
interstitial sites of a non-Bravais lattice for investigations by quasielastic neutron 
scattering. The Rowe theory assumes equal occupation of different sublattices 
(sites with different “local symmetry” in the unit cell), and attempts to extend 
the theory for systems with differently occupied sublattices have been undertaken 
(Kutner and Sosnowska 1977). Randl et al. (1994) examined the theory in detail 
for diffusion in non-Bravais structures. 

An atom in a non-Bravais lattice can have energetically less and more favored 
lattice sites (some jumps can be less probable), i.e., we will have different jump 
rates 1/t, which is the jump rate from a site of symmetry i to any nearest- 
neighbor site of symmetry j. The jump rates between nearest-neighbor sites are 
constrained by the detailed balance, which demands that the number of atoms 
jumping in a time unit from one sublattice into another must be equal to the 
number of reverse jumps according to Eq. (28), where c; is the probability of the 
occupation of the ith sublattice (X` ci = 1) and each site of the ith sublattice is 
surrounded by nj sites of the jth sublattice. 
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as (28) 


MijTij nji Tji 





The solution of the set of master equations is obtained by calculation of the eigen- 
values and eigenvectors of the appropriate jump matrix (Randl et al. 1994). The 
resulting scattering function is a sum of m Lorentzian functions, each with the 
particular width T, and weighting factor wy. 

If the jump mechanism and jump frequencies are known, we can in any case 
calculate the macroscopic diffusion coefficient and compare it with the tracer dif- 
fusion result. The diffusion coefficient in a non-Bravais lattice with m nonequiva- 
lent sublattices can be calculated as a sum of partial diffusion coefficients be- 
tween sublattices [ Eq. (29)]. 


1 r = 
D= a - l) r'a (29) 


ij 


13.1.3.3.1 Quasielastic Methods: Mössbauer Spectroscopy and Neutron Scattering 
Our interest in diffusion studies concentrates on methods which can be used to 
resolve details of the atomic jump process, i.e., features which are not directly ac- 
cessible to tracer diffusion studies. Nowadays there are three different methods 
derived from nuclear physics with this property: 

e quasielastic Mössbauer spectroscopy (QMS) 

e quasielastic neutron scattering (QNS) 

e nuclear resonant scattering of synchrotron radiation (NRS). 


For neutron scattering k is the scattering vector and hw the energy transfer. As ha 
is very small the scattering process is nearly elastic or “quasielastic;” that is why 
the method is called quasielastic neutron scattering (QNS). In analogy we speak 
of quasielastic Méssbauer spectroscopy (QMS) because the method is based on 
the same theory as QNS, hw again being very small. 

Due to the relatively slow diffusive motion in solids a very good energy resolu- 
tion is required in order to be capable of measuring this slow motion. On the 
other hand, diffusive jumps are over very short distances and, hence, large mo- 
mentum transfers are required. Diffusion processes or, more precisely, jumps of 
atoms are by their nature a random phenomenon, i.e., their influence on the 
measured system can be treated as a perturbance and can lead to a deterioration 
of the spectrum. In particular, a broadening of the spectral line in the energy do- 
main due to diffusion jumps can be observed. Importantly, this destructive action 
of diffusion can be investigated as a function of the wave vector. This enables dif- 
ferentiation between various microscopic diffusion mechanisms. 

Quasielastic neutron scattering is a method promoted in the late 1950s 
by Brockhouse (1959) and was first applied to diffusion jumps in metals in 
the 1980s, whereas quasielastic Mossbauer spectroscopy started in the late 1960s 
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(Knauer and Mullen 1968). The phenomenon of the emission or absorption of a 
y-ray photon without energy loss due to recoil of the nucleus and without thermal 
broadening is known as the Mossbauer effect. For early reviews, see Wertheim 
(1964), Flinn (1980), Bauminger et al. (1986), and Nienhaus and Parak (1994). A 
y-quantum emitted by a nucleus decaying from an excited state to the ground 
state is absorbed resonantly at the ground level of a second nucleus of the same 
isotope. It was Rudolf Méssbauer’s discovery that, for nuclei placed in solids at 
low temperature there is, however, a finite probability that emission or absorption 
of the y-ray photon will take place without absorption or emission of a phonon. 
This means that the solid will be in the same internal state before and after the 
event, so that, in effect, the recoil is taken up by the crystal as a whole and not by 
an individual atom. This makes the recoil energy immeasurably small. The result 
is a spectroscopy having the resolution of the lifetime uncertainty of the excited 
nuclear state, which is usually in the 10°? eV range. In practice, the energy is 
scanned by repeatedly moving a radioactive source toward and away from an ab- 
sorber. Due to the corresponding Doppler shift, the energy of the y-rays arriving 
at the absorber is varied, causing a varying count rate at the detector behind the 
absorber. 

For °’Fe, by far the most frequently used isotope in Mossbauer spectroscopy, 
the energy difference between the excited and ground states, i.e., the energy of 
the y-quantum, is 14.4 keV, corresponding to a length of its wave vector k of 
7.3A7!. A very much simplified picture can be helpful in understanding this phe- 
nomenon: at low temperature the energy width of the emitted radiation is deter- 
mined by the mean lifetime ro of the excited Mössbauer level. If at sufficiently 
high temperatures, however, the mean residence time t of diffusing atoms be- 
tween two successive jumps is of the same order of magnitude as, or even smaller 
than the mean lifetime to of the Mössbauer level, each emitting Mössbauer atom 
on average changes its position during the emission process. Thus the wave train 
emitted by a diffusing atom is “cut” into several shorter wave trains, which leads 
to a greater width via the Heisenberg uncertainty principle. As these wave trains 
are emitted by one and the same nucleus, they are coherent. The interference be- 
tween wave trains emitted by the same nucleus depends on the relative orienta- 
tions of the jump vector and the direction of the wave vectors (cf. Eq. (26) and 
Fig. 13.4). Therefore, in certain crystal directions the linewidth is small, while in 
others it is very large. In this picture the broadening will be greater, the stronger 
the fragmentation of the wave train. A simplified, semiclassical explanation of the 
diffusional line broadening is shown in Fig. 13.5. 

An analogous argument is possible for QNS where the wave train of neutrons 
scattered by a diffusing atom is cut into several wave trains. It will be noticed that 
in QNS the calculated scattering function S(k, œ) in Eq. (25) has to be convoluted 
with the energy resolution of the given experimental setup. The high-energy res- 
olution of the QNS technique is achieved by the application of backscattering 
spectrometers, which are similar to triple-axis spectrometers with the difference 
that the scattering angle 29 at the monochromator as well as at the analyzer is 
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Fig. 13.5 In a nuclear resonant experiment, b) Phase matching is generally lost after the 
the jumping atom itself represents the source jump process. This leads to an accelerated 
and therefore conserves its phase relative to decay of the intensity emitted in the forward 
the emitted wave during the jump. This is direction (NRS) and to the linewidth 

why a) the phase matching with emitted broadening (QMS, QNS), respectively 


waves from other nuclei is established by the (Kaisermayr 2001). 
instantaneous excitation by the X-ray pulse. 


fixed at nearly 180°. On the one hand, this restricts the range of possible energy 
transfers, which can no longer be obtained by changing the angular position of 
the monochromator and analyzer. On the other hand, however, it significantly in- 
creases the resolution in energy (Hempelmann 2000). 

QMS and QNS have proven to be appropriate tools for diffusion studies for dif- 
ferent classes of alloys QMS has been useful in the studies of iron diffusion in 
iron aluminides and silicides. QNS, on the other hand, can be applied for diffu- 
sion investigations in alloys containing nickel, cobalt, and titanium,” provided the 
diffusivity is sufficiently fast, because the energy resolution of backscattering 
spectrometers IN10 or IN16 at the high-flux reactor ILL Grenoble is lower by a 
factor of approximately 100 than the QMS energy resolution (Hempelmann 
2000). 

Very recently a new neutron spin-echo (NSE) technique was applied (Kaiser- 
mayr et al. 2001a), where the energy resolution close to that of °7Fe QMS is pos- 
sible. NSE has been successfully applied to the study of the dynamics in amor- 
phous systems (proteins, polymers, glassy dynamics, etc.) but it was shown that 
it is also suitable for the investigation of diffusion on lattices giving direct access 


4) We are discussing diffusion studies of materials and of hydrogen interstitial 
metallic species in intermetallic alloys here. diffusion in metals provide, however, the 
Studies of hydrogen diffusion in organic greater part of QNS applications. 
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to the jump mechanism. NSE is a Fourier method and is sensitive to the time- 
dependent correlation function yielding directly the intermediate scattering func- 
tion I(Q,t), Eq. (24). NSE combines the high-energy resolution from QNS with 
the high intensity of a beam which is only moderately monochromatic. In NSE 
the velocity change of neutrons after scattering by a sample is measured by com- 
paring the Larmor precession in known magnetic fields before and after the scat- 
tering. This comparison is made for each neutron individually, thus the resolu- 
tion of the velocity change can be much better than that corresponding to the 
width of the incident beam. 

For a review of the QMS method in diffusion studies see, e.g., Vogl (1990, 
1996) and Vogl and Sepiol (2005); for QNS see Springer (1998) and Hempelmann 
(2000) and references therein. 


13.1.3.3.2 Nuclear Resonant Scattering of Synchrotron Radiation 

The theoretical principles of nuclear resonant scattering (NRS) of Méssbauer ra- 
diation were worked out just after the discovery of the Mössbauer effect but until 
Ruby (1974) suggested that synchrotron radiation (SR) could be used for a reso- 
nant excitation of nuclei, it was not considered for applications. Only 11 years 
later Gerdau et al. (1985) observed the resonant effect with X-rays for the first 
time. The next step was the experimental observation of the time structure of 
the scattered SR in the forward direction, which, primarily, opened the vast field 
of hyperfine structure for investigation. Finally, Smirnov and Kohn (1995) (also 
Kohn and Smirnov 1998) proposed a theory of NRS in the presence of diffusive 
motion of nuclei. 

The NRS technique combines the properties of a small and intense synchro- 
tron beam with the spatial and energetic resolution of conventional Mössbauer 
spectroscopy and can be regarded as its time-based analogue (Mossbauer spec- 
troscopy in the time domain). The first experiment showing the applicability 
of SR to diffusion studies was performed at the synchrotron radiation source 
ESRF in Grenoble (Sepiol et al. 1996). NRS allows the determination of the diffu- 
sion mechanism on an atomic scale in space and time; see Vogl and Sepiol 
(1999a) for a systematic discussion of theoretical and experimental aspects of 
NRS. 

The principle of NRS is based on the unique sharpness of the nuclear levels (on 
the order of 10°? eV) if the scattering process is slow. The enormous brilliance of 
SR produced by an undulator specially designed for the energy required to excite 
the resonant level of a Méssbauer atom (14.4. keV for the *”Fe isotope) allows the 
extraction of a very narrow band in the range of few meV from the energy spec- 
trum. This is achieved with so-called nested monochromators that are especially 
designed for NRS and which have been the object of intensive work of X-ray op- 
tics groups over the last decade. To collect photon counts with good time resolu- 
tion an avalanche photodiode is usually used as a detector. °’Fe isotope nuclei are 
excited by synchrotron radiation tuned to the resonance energy. In turn these ex- 
cited states decay with their characteristic lifetime modulated by coherence ef- 
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fects. Due to the pulsed time structure of synchrotron radiation, the discrimina- 
tion between electronic (fast) and nuclear (slow) scattering is quite simple. 

The most important difference between QMS, QNS, and NRS is that NRS 
works in the time domain, whereas the first two operate in the energy domain. 
The principal idea of NRS is in the strict sense very similar to that of both quasie- 
lastic methods: the extreme coherence of the synchrotron radiation in the forward 
direction after nuclear resonance absorption in the sample is destroyed by diffu- 
sion, which leads to a faster decay of the scattered intensity with respect to an 
undisturbed scattering process. From this “diffusionally accelerated” decay, de- 
tails of the diffusion process can be derived due to the angular dependence of 
the decay as a function of the X-ray wave vector. Differences between NRS 
and QMS/QNS techniques consist in the mathematical handling of the interme- 
diate scattering function Is(k, t) [Eq. (24)] obtained from the explicit jump diffu- 
sion model, which must be, however, be transformed twice to provide an experi- 
mentally measurable quantity. These two transformations, Is(k, t) > g(k,w) > 
Ips (k, t), are set out in Eqs. (30). 
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o(k, œw) = |. dt exp fion — zl Is(k, t). 


Trs(k, t) is the intensity of forward scattered radiation as a function of time and 
the help function g(k,@) is called the universal resonance function calculated 
by a time-to-energy Laplace transformation of the intermediate scattering func- 
tion. Here L is the effective sample thickness, which is proportional to the 
temperature-dependent Debye-Waller factor and resonant isotope *’Fe content. 
In the thin-sample approximation Eqs. (30) can be solved analytically, e.g., for 
the Bravais lattice with the linewidth T (k) from Eq. (25), which gives Eq. (31), 
which means that the logarithm of the decay rate is proportional to the width of 
the diffusional broadening T'(k) as measured in classical QMS. 


Ins(k,) oc E exp Be + rk) (31) 


One should note that Eq. (31) is correct only in the thin-sample approximation, 
due to the much more important role of the effective sample thickness in NRS 
than in QMS or QNS. Usually Eqs. (30) can be calculated only numerically, but 
if the sample is enriched in °’Fe isotope, i.e., thick for X-rays, one can obtain 
very high count rates and much shorter measuring times than the quasielastic 
methods from Section 13.1.3.3.1. 

While the first studies of diffusion with SR were performed in bulk materials, 
the field where the unique properties of SR are truly exploited is the application 
of NRS to surface studies. These measurements are carried out in grazing- 
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incidence condition, i.e., the technique of NRS and grazing-incidence reflection 
are combined for studying the structure, the hyperfine parameters, and the dy- 
namics of thin films (Röhlsberger 1999). It is known that NRS in grazing- 
incidence geometry provides depth selectivity, so it can also be used for investiga- 
tion of diffusion phenomena on the surface or in near-surface regions of metallic 
films containing iron (or other Mossbauer isotopes). Measurements in the 
grazing-incidence geometry can be performed only if: (a) the beam divergence is 
very low, (b) the energetic resolution is in the milli-electronvolt range, and both 
conditions are excellently fulfilled by the current X-rays sources. 

Two features make grazing-incidence NRS useful for studies of surface dynam- 
ics: (a) the scattering intensity is proportional to the square of the number of res- 
onant nuclei (i.e., the effective thickness) L? [cf. Eq. (31)]; (b) the thickness scales 
as 1/«, « being the very small angle of incidence (usually a few milliradians). The 
factor of 1/a increasing the effective layer thickness allows measurements of very 
thin layers or even sub-monolayers containing resonant atoms (Sladecek et al. 
2004). 


13.1.3.3.3 Pure Metals and Dilute Alloys 

The first investigations of the diffusion mechanism were restricted to simple Brav- 
ais systems. Such a system is for instance bcc f-titanium, where self-diffusion 
of titanium atoms belongs among the fastest self-diffusion processes in metals 
and, moreover, titanium is an excellent incoherent scatterer. The question of the 
elementary diffusion jump was solved definitely by the use of QNS (Vogl et al. 
1989) and it was not very surprising that the dominant diffusion mechanism 
was a jump into a nearest-neighbor (NN) vacancy. However, no other diffusion 
mechanism was found which could explain anomalous self-diffusion in ß-Ti or 
in other bec metals such as ß-Zr or f-Hf. Anomalous effects in bcc metals are 
not only extraordinarily fast self-diffusion but also curvature of Arrhenius plots. 
These effects were finally explained by phonons. At elevated temperatures the 
phonon spectrum of titanium contains some very soft phonon modes corre- 
sponding to large vibration amplitudes “pushing” Ti atoms to vacant NN sites 
and in this way considerably lowering the migration energies. Thus the high dif- 
fusivity in f-titanium could be explained. 

Diffusion of dilute ” Fe atoms in fcc single crystals of Al and Cu has been in- 
vestigated by QMS (Mantl et al. 1983; Steinmetz et al. 1986). In this case the sit- 
uation is not so simple as in f-titanium because iron atoms are impurities in the 
host lattice, attracting vacancies via lattice strain. Extended theory of impurity- 
atom diffusion via bound vacancies elaborated by Le Claire (1978) could explain 
broadening of the linewidth measured in different directions versus Al and Cu 
single crystals.” Diffusion coefficients obtained from Eq. (29) agree well with the 
tracer diffusion data supporting the validity of the method. 


5) Samples in these measurements were method, called emission Méssbauer 
sources of radiation resulting from the spectroscopy, uses the same theoretical 
decay of °’Co as a precursor of ” Fe. This description by Eqs. (25) and (26). 
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13.1.3.3.4 Ordered Alloys 


B2 alloys (CsCl structure) B2-structures are the simplest intermetallic alloy struc- 
tures; nevertheless they are extremely interesting because NN jumps in these lat- 
tices always lead to antistructure sites and therefore should be energetically disfa- 
vored. The diffusion mechnism in B2 lattices was a subject of vigorous discussion 
in the 1990s (Fahnle et al. 1999; Wolff et al. 1999; Frank et al. 2001; Bester et al. 
2002). The most intriguing question for a well-ordered B2 structure is whether 
diffusing atoms jump to NN sites, or whether they perform jumps to e.g., second- 
or third-NN sites belonging to their own sublattice. Quasielastic measurements 
on selected intermetallic alloys contributed substantially to the understanding of 
atomic jumps in B2 lattices. 

Fe,_,Al, alloys crystallize in the B2 structure within a wide homogeneity range 
on the iron-rich side. The data obtained by measuring iron diffusion in well- 
ordered FeAl by QMS (Vogl and Sepiol 1994; Feldwisch et al. 1995) and by NRS 
(Vogl et al. 1998) methods could be fitted only with a superposition of the func- 
tions from Eq. (26) corresponding to [100] and [110] jumps in the ratio of 
1.9(+0.1):1. This is quite unobvious result for the Fe atoms, indicating a priority 
of effective jumps to third-NN sites over second-NN sites. Fe¢sAl35, which is still 
B2 ordered but — due to the composition — exhibits a considerable amount of dis- 
order, provided quite different spectra due to an increased amount of excess iron 
atoms. The spectra of this alloy clearly consist of two Lorentzian lines (Feldwisch 
et al. 1995), which is a certain sign that the lattice sites visited by the iron atom do 
not belong to a Bravais lattice. Finally for the composition close to Fe3Al (see Fig. 
13.4), diffusivity of iron can be very well fitted by simple jumps on the bcc lattice 
due to Eq. (27). An atomistic diffusion model based on these results is as follows: 
starting near the ideal B2 composition, FeAl iron atoms diffuse via NN jumps to 
antistructure sites with a remarkably short residence time on the aluminum sub- 
lattice. With increasing iron content the residence time of iron atoms on the alu- 
minum sublattice increases and in “disordered” Fe3Al iron diffuses on a Bravais 
lattice. 

A Monte Carlo simulation with vacancy exchange exclusively with NN atoms, 
but also with an interaction between the vacancy and the atoms, fitted experimen- 
tal results perfectly (Weinkamer et al. 1999). The interpretation is as follows: the 
vacancy migrates through the lattice, destroying and restoring the lattice order. It 
interacts particularly with defects just created by its own movement, which causes 
different future paths of the vacancy to be energetically unequal. In the particular 
FeAl alloy, the vacancy prefers such sequences of NN jumps, which result in ef- 
fective jumps of iron atoms to third-nearest-neighbor sites ([110] jumps). Recent 
defect-structure simulations (Bester et al. 2002) confirm the NN jump as the most 
reasonable elementary diffusion jump in B2 FeAl. 

The QNS method was applied successfully to the study of Ni diffusion in NiGa 
(Kaisermayr et al. 2000), and of Co in CoGa (Kaisermayr et al. 2001b). In both 
intermetallic alloys only NN jumps were found and the next-nearest-neighbor 
jumps of Ni suggested by Donaldson and Rowlings (1976) could not be found. 
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The conclusion is that transition metal (Ni, Co) atoms in NiGa and CoGa jump 
via antistructure sites. 


DO; alloys (Fe3Si structure) The DO; superstructure of Fe3Si is more complicated 
than the B2 superstructure. The cubic supercell can be divided into four sub- 
lattices, i.e., three iron sublattices (a1, %2, and y) and one silicon (£) sublattice. 
The lattice constant of these sublattices of the supercell is twice as large as the 
lattice constant of the small bcc cells. Méssbauer studies were the first to detect 
that despite the high degree of order in Fe3Si, iron diffusion in this compound 
is very fast (Sepiol and Vogl 1993; Vogl and Sepiol 1999b). The most surprising 
result was that iron diffusion becomes slower with decreasing order, i.e., below 25 
at.% silicon content. This was later confirmed by tracer studies (Gude and Mehrer 
1997) and later on by studies with NRS (Sepiol et al. 1996, 1998). The spectra 
measured by QMS and NRS turn out to be simpler. In contrast to the situation 
in B2 structures, an iron atom in Fe3Si can diffuse exclusively between iron sub- 
lattices via NN jumps without creating antistructure defects. The diffusion mech- 
anism turns out to be a rather conventional one: diffusion in Fe3Si is fast due to 
a large concentration of vacancies (Kümmerle et al. 1995) and relatively soft pho- 
non modes (Randl et al. 1995) favoring low migration energies, and occurs via 
NN jumps. Off-stoichiometric Fe-Si alloys contain many fewer vacancies and 
thus diffusion is slower. These experimental results have been confirmed basi- 
cally by the recent ab-initio simulations of Dennler and Hafner (2006). 

Another compound with a DO; structure is Ni3Sb. Nickel diffusion in Ni3Sb, 
which contains many vacancies, is the fastest ever found in an intermetallic alloy. 
Results obtained with the QNS method (Vogl et al. 1996a) fitted well under the 
assumption that nickel atoms jump between NN sites, while other jump models 
give noticeably worse results. 

Fast diffusion of Sn was investigated in the Cu3Sn sample (Thiess et al. 2003) 
with the NRS method, supporting the nearest-neighbor jumps model as well. 


B8 alloys (NiSb structure) Diffusion of nickel in NiSb (Vogl et al. 1993) and of 
iron in FeSb (Sladecek et al. 2001) were performed by QNS and QMS methods, 
respectively. The B8 structure of NiSb and FeSb is rather open and it is not sur- 
prising that the atomic jumps are performed via interstitial sites. 


13.1.3.3.5 Amorphous Materials 

Diffusion in amorphous metals and alloys (metallic glasses), has been a subject of 
lively discussion since the mid-1990s (Faupel et al. 2003). Generally, glasses are 
textbook examples of dense random packing systems. The metastability of these 
systems is their most interesting aspect, particularly due to their relevance in liv- 
ing organisms, but also due to their increasing technical importance (Hilzinger 
et al,. 1999; Buchanan 2002). Their diffusional behavior is responsible for the 
metastability, giving rise to various rearrangement processes at moderately ele- 
vated temperatures. Obviously, understanding the diffusion processes in glasses 
is of fundamental importance and, moreover, plays a significant role in their 
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production. The classical tools for measurements of diffusion and dynamics in 
glasses were neutron spectroscopy (Springer 1972, 1998) and photon correlation 
spectroscopy (Pusey 1991). Optical methods are, however, limited to studies of 
nonopaque media by very low wave vector transfers. In recent times it has not 
been possible to study long-range atomic transport in metallic glasses because of 
their strong tendency to crystallize when heated through the glass transition. 
Only recently has discovery of multicomponent bulk metallic glasses exhibiting 
a much greater resistance to crystallization (Inoue et al. 1990; Peker and Johnson 
1993) triggered intensive investigations of their dynamical behavior. Tracer meth- 
ods of diffusion investigations since that time have been feasible (Knorr et al. 
1999). By means of broadband inelastic neutron scattering the vibrational proper- 
ties and relaxational motion in different metallic glasses could be studied (Meyer 
et al. 1996b, 1999; Meyer 2002). The energy resolution is, however, always the 
“Achilles heel’ of neutron scattering methods. Some limited access to the much 
higher-energy resolution, of the order of nano-electronvolts, provides application 
of the Mossbauer effect, but in practice this is limited to only one Mossbauer iso- 
tope which can be reasonably measured, i.e., the °’Fe isotope. Fast relaxation in a 
metastable metallic melt has been studied with this technique (Meyer 1996a). Nu- 
merous Mossbauer spectroscopy studies were performed on amorphous or nano- 
crystalline soft- and hard-magnetic alloys; see, e.g., Miglierini and Grenéche 1999, 
2003; Stankov et al. 2005). Even more frequent have been studies of dynamics in 
organic glasses, e.g., by Lichtenegger (1999) and Parak (2003a,b). 

In order to elucidate the role of high-resolution scattering methods and time- 
resolved diffraction in the studies of amorphous materials, a few illustrative tech- 
niques will be presented. Most of these techniques are brand new and until now 
only feasibility tests have been performed to present their possibilities but, in our 
opinion, their potential for development justifies their presentation in this review. 

The main feature distinguishing amorphous from crystalline materials is the 
lack of long-range order. This difference, which is sometimes regarded as negligi- 
ble, is, however, the origin of delicate and demanding experimental problems. A 
very basic approach to the calculation of the structure factor will be presented 
below. 

Description of atomic movements by the self-correlation function Gs(r,t) is 
actually only a special case (usually conditional on the measuring method) of the 
more general description by the pair correlation function G(r, t) (see the begin- 
ning of Section 13.1.3.3). An expression for coherent scattering function on Bravais 
lattices has been given by Ross and Wilson (1978); this problem is also discussed 
by Springer (1972, 1998). One can split the pair-correlation function (Kaisermayr 
2001; Kaisermayr et al. 2001c) into a time-dependent part G’(r,t) and a static 
part, as in Eq. (32), where r; is the ith lattice site, N is the total number of lattice 
sites, and c is the concentration of the scattering atoms on the Bravais lattice. 


N 


G(r, t) = G'(r, t) +c) òlr- ri) (32) 


i=0 
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Inserting this equation into Eq. (22) and solving by Fourier transformation with 
the boundary condition G(r, 0) = (1 — c)ö(r) + ey „er — 1;), the coherent scat- 
tering function is obtained [Eq. (33)]. 


+ cNd(w)5(Q — G) (33) 





(ho)? 


Equation (33) describes coherent scattering on Bravais lattices (Kaisermayr et al. 
2001c). A quasielastic term with a Lorentzian lineshape is, apart from a factor 
c(1— c), identical with the scattering function calculated from the self-correlation 
function Gs(r,t) for incoherent scattering, Eq. (25). A purely elastic term de- 
scribes scattering in Bragg directions, where Q = G. A quasielastic term describes 
isotropic diffuse scattering (note the prefactor c(1— c) characteristic of the Laue 
diffuse scattering (Schonfeld 1999)). Note that Eq. (33) is derived for an unrealis- 
tic lattice occupied by one type of scattering atoms only. Derivation for the case of 
a non-Bravais lattice with one or more scattering atoms is also possible and was 
obtained for the first time by Kaisermayr (2001) and Kaisermayr et al. (2001c). 
The general conclusions are similar to those for Bravais lattices — no quasielastic 
broadening apart from a negligible contribution from diffuse scattering can 
be expected in the Bragg reflections, irrespective of whether these are funda- 
mental or superstructure peaks. If the lattice is occupied by more than one 
scattering element, the different quasielastic parts are obtained by simple sum- 
mation of all the elementary contributions. This prediction was proven experi- 
mentally by measuring diffusion in the B2 ordered intermetallic phase CogoGa4o 
using time-domain interferometry of synchrotron radiation (Kaisermayr et al. 
20014). 
From Eq. (33) and from equivalent derivation for non-Bravais lattices (Kaiser- 
mayr et al. 2001c) the following conclusions can be drawn. 
e The coherent scattering function is elastic for scattering in 
fundamental and in superstructure Bragg directions. 
e In the regions between the reciprocal lattice points, 
quasielastic diffuse scattering can be observed, i.e., scattering 
due to lattice disorder (Laue diffuse scattering). The 
scattering function S(Q, œ) is calculated in the same way as 
the scattering function for the incoherent scattering, i.e., it is 
calculated from the self-correlation function equations (23) 
and (25). 
e The maximum intensity of the quasielastic component is in 
the superstructure lattice direction of the non-Bravais lattice; 
however, it is not possible to measure quasielastic broadening 
in these positions since the diffuse intensity will be 
completely hidden under the elastic Bragg line. 
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e Observation of the diffuse scattering is difficult due to the 
very low intensities, and thus large detectors are necessary. 
Higher intensities are expected by scattering on samples 
without lattice structure, i.e., on glassy samples. Such a 
glassy sample was measured in the first time-domain 
interferometry experiment of Baron et al. (1997). 


It is well known that the energy resolution of X-rays is limited due to the laws of 
electronic Bragg diffraction (see, e.g., Chumakov et al. 1997; Toellner et al. 1997). 
In order to improve the resolution into the nano-electronvolt range, elastic nu- 
clear resonant scattering has to be applied (see the Section 13.1.3.3.2). It is self- 
evident that an ideal instrument for inelastic scattering with nano-electronvolt 
energy resolution would be a resonant triple-axis spectrometer (Burkel 2000). Re- 
emision from the nuclear state excited by the synchrotron radiation pulse defines 
the energy width and a nuclear absorber as analyzer is used to study the behavior 
of the sample containing no resonant isotopes. A pilot experiment of this kind 
was performed by Baron (1997) and is called time-domain interferometry. A refer- 
ence interference pattern is temporally modulated by the scattering from the 
sample, providing a quantum beats pattern variable as a function of the sample 
temperature and the momentum transfer. The deconvolution of the beat pattern 
determines directly the scattering function S(q,t) and thus the full information 
about dynamics in the sample. The feasibility test was performed on the glass- 
forming liquid glycerol covering the time scale from about 15 ns to about 200 ns 
corresponding to 5-50 neV on the energy scale. An extension of the method to 
investigate the dynamics via quasielastic scattering of X-rays on amorphous solid 
systems actually has only the problem of the beam intensity, which will be in- 
creased significantly in the future due to application of free-electron lasers. 

With increasing intensity of the beamlines another method, SR-based 
perturbed angular correlation (SRPAC) (Baron et al. 1996), has now become 
accessible for applications in glassy states. SRPAC is a scattering variant of time- 
differential perturbed angular correlation, a method using special radioactive iso- 
topes with a y-y cascade applied to characterize the probe’s atom lattice location 
via precession of probe nuclei, which is proportional to the internal nuclear hy- 
perfine field interactions, see, e.g., Butz (1996). 

In SRPAC the intermediate nuclear level is not excited via a cascade originating 
from the decay of a radioactive parent, but from the ground state during resonant 
excitation of a °’Fe resonant atom by SR. Directional selection and timing are ob- 
tained in SRPAC by the direction and the timing of the incident SR short pulse of 
radiation. The intermediate nuclear level split by magnetic dipole or electric quad- 
rupole interaction allows one to investigate the hyperfine interactions and rota- 
tional dynamics of the nuclear probe atoms. 

One of the first experiments with the SRPAC method was performed on 
the rotational dynamics above the glass transition up to the liquid state of the 
molecular glass former dibutyl phthalate (DBP) with ferrocene molecules as 
probes (Sergueev et al. 2006). In the regime of slow relaxation, the damping of 
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the hyperfine beats can also be observed by nuclear forward scattering (see Sec- 
tion 13.1.3.3.2) on °’Fe-enriched ferrocene molecules. At higher temperatures, 
however, in the regime of fast relaxation, only SRPAC spectra can be measured 
which approach the natural decay in a characteristic way. The probe molecules re- 
produce the dynamics of the glass former and enable extraction of the pure trans- 
lational relaxation rate (Asthalter et al. 2006). 

The potential of SRPAC relies on the fact that single-nucleus scattering 
depends neither on recoil-free emission and absorption, nor on translational 
motion. Therefore, SRPAC allows one to continue Mössbauer investigations of 
hyperfine interactions and rotational dynamics into regions where the Lamb- 
Mössbauer factor vanishes, i.e., in viscous glasses and liquids. It is expected that 
SRPAC using the °’Fe resonance can cover a dynamic range of relaxation times 
of five orders of magnitude, from ~10 ps up to 1 us (Blachowski and Ruebenba- 
uer 2004). 

Brilliance of third-generation synchrotron sources at the present time is still in- 
sufficient for the measurement of diffusion in metallic amorphous materials. It 
is, however, important to take this method into account due to its prospective de- 
velopment if the next generation of synchrotron sources become available, espe- 
cially the free-electron laser sources with their brilliance higher by up to ten or- 
ders of magnitude (cf. Section 13.1.4.1). 


13.1.4 
Time-Resolved Scattering 


High temporal resolution is of increasing experimental interest in order to study 
the kinetics of nonequilibrium alloys or to investigate equilibrium dynamics. 
Real-time investigations have natural advantages over post-facto studies. In real- 
time studies, there is a complete temporal record, ensuring that no kinetic fea- 
tures of the evolution are missed. There is no concern about sample changes in 
the period between the end of processing and the post-facto measurement. This 
is of particular concern if the sample evolution occurs at a high temperature 
where relaxation can be relatively rapid. Finally, in building a quantitative kinetic 
record of the sample evolution for a given set of experimental parameters, real- 
time studies do not suffer from run-to-run processing variations that can occur 
if the kinetic evolution is instead reconstructed through post-facto measurements 
made on samples from different runs. 

Because of the complexity of many growth and processing geometries, it is 
often difficult to implement real-space probes (e.g., electron microscopy or atomic 
force microscopy) for real-time study. Instead, scattering techniques are naturally 
applicable and high time resolution studies are therefore of high interest. The 
definition of what constitutes “high” resolution in this context is open to interpre- 
tation. The important time scale in a given material is, of course, set by the phys- 
ical process of interest. In an alloy undergoing atomic ordering, phase separation, 
or crystallization, diffusion is necessary and sets the fundamental time scale. 
Since atomic diffusivities are thermally activated, this fundamental time scale 


749 


750 


13 High-Resolution Experimental Methods 


can easily vary between times of the nanosecond order near the melting point to 
times greater than the age of the universe at very low temperatures! On the other 
hand, if nonequilibrium lattice vibrations are of interest, then the appropriate 
time scale is femtoseconds. Given the widely disparate time scales relevant to 
different materials, here we will take a relatively broad view of what constitutes 
“high resolution’ in the time domain. 

The current state-of-the-art in traditional scattering studies of metallic alloys is 
discussed first below, including technical possibilities and the type of alloy science 
being performed. 


13.1.4.1 Technical Capabilities 

At least three things are necessary in order to perform time-resolved studies suc- 
cessfully: a source of sufficient intensity, an appropriate physical stimulation to 
initiate the desired alloy response, and appropriate detectors. We begin by briefly 
discussing source technology, since that has been the most rapid to develop. 
While time-resolved neutron scattering studies continue to be important, the con- 
tinuing increase in the X-ray intensity available from synchrotron sources has 
been a primary factor in facilitating the development of time-resolved studies 
since the mid-1980s. The typical X-ray intensity available in the hard X-ray regime 
at a third-generation is of the order of 1014 photons s™!. The scattering cross- 
section plays a pivotal role in determining the time resolution accessible with rea- 
sonable counting statistics. The diffuse scattering from an amorphous or disor- 
dered alloy, or from an alloy surface, is sufficiently weak for times shorter than a 
millisecond not to be usually accessible - indeed, many such studies more natu- 
rally operate in the second regime. On the other hand, the evolution of an intense 
Bragg peak can be monitored on the nanosecond or shorter time scale. However, 
most time-resolved studies of metallic alloy evolution have been on the slower 
time scales. 

Due to difficulties in temporally resolving X-ray scattering patterns on fast time 
scales, as well as the desire to synchronize experiments with a “pump” process 
starting a transformation, a pulsed source-structure is most often employed 
for the highest time resolution experiments. The bunch structure of electrons/ 
positrons in storage rings is one available source of short X-ray pulses. Typical 
X-ray pulses from a single electron/positron bunch in a third-generation storage 
ring have a duration of ~100 ps. In cases where it is desirable to extract pulses of 
shorter duration, it has recently been shown that the X-ray beam intensity can be 
modulated on the subpicosecond time scale using acoustic pulses induced by a 
fast optical laser pulse in a transmitting crystal (DeCamp et al. 2001). New 
fourth-generation sources are already producing intense hard X-ray pulses in the 
femtosecond regime. The subpicosecond pulse source (SPPS) at the Stanford Lin- 
ear Accelerator (SLAC) in Stanford, California, uses chirped electron bunches, 
energy-dispersive magnetic chicanes and undulators to produce 100 fs X-ray 
pulses. Toward 2010, it is expected that 100 fs coherent hard X-ray pulses with 
~10" photons per pulse will become available from the X-ray free-electron lasers 
(XFELs) being constructed at the Linac Coherent Light Source (LCLS) project in 
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California, the European XFEL Project at the Deutches Electronen-Synchrotron 
(DESY) in Hamburg, and at the SPring-8 XFEL project in Japan. At this point, 
few studies of alloys have been contemplated on such fast time scales. Most 
studies on semiconductors so far have focused on phonon behavior and energy 
relaxation from excited electrons to phonons. 

Laser-induced plasmas are an alternative to accelerator-based pulse sources and 
are also undergoing rapid development. Because these sources emit isotropically, 
their intrinsic brightness is lower than that of current accelerator-based sources. 
However, with focusing, plasma X-ray sources have now achieved 10* photons 
per 100 fs pulse. This is sufficient for studies of Bragg peak changes on these 
time scales and these X-ray beams have proven to be a valuable tool for ultrafast 
kinetics studies of phonon dynamics in semiconductors (Bargheer et al. 2004). 

In order to observe the scattering evolution from a material in a meaningful 
manner, it is also usually necessary that the entire illuminated volume is evolving 
uniformly. This requires that the state of the sample be changed suddenly on a 
time scale shorter than that of the physical processes of interest. For the fastest 
processes, such as nonequilibrium laser-induced melting, the only pump avail- 
able is a fast laser pulse. Slower processes, however, can be induced by shock 
loading, sudden changes in electric or magnetic field, or sudden changes in tem- 
perature or pressure. Typically metallic alloy phases are not particularly sensitive 
to pressure (unlike the situation for soft condensed matter), so inducing phase 
transitions by pressure jumps is usually not feasible. Very fast upward jumps in 
temperature are possible with laser heating. Relatively rapid drops in temperature 
are also possible if only a surface layer is heated so that the heat can diffuse back 
through the colder part of the sample. For macroscopic samples, limited thermal 
conductivity can increase cooling times to milliseconds and beyond. Perhaps the 
fastest controllable temperature changes in macroscopic samples are achieved 
with resistive heating of thin ribbons (Brauer et al. 1990). 

A final requirement for successful time-resolved scattering studies is the avail- 
ability of appropriate detectors. At the slowest end, CCD-based area detectors have 
traditionally had read times of the order of 1 s, though faster read times are pos- 
sible. It has been shown that in the high-energy X-ray study of diffuse scattering 
from an alloy, an area detector is a very powerful tool for quickly recording the 
scattering over a significant portion of a plane (Reichert et al. 2005). For the fu- 
ture, this offers attractive options for the time-resolved study of diffuse scattering 
evolution. For faster times, silicon avalanche photodiode detectors with nanosec- 
ond resolution are becoming more widely available — these facilitate the use of 
pump-probe studies, though this technique has not yet been widely applied to 
the study of metallic alloys. Streak cameras, in which the recording medium is 
physically moved rapidly behind an aperture during exposure, give the highest 
time resolution accessible and are now capable of subpicosecond resolution. 


13.1.4.2 Time-Resolved Studies — Examples 
Time-resolved scattering has been used to examine a number of different phe- 
nomena in metallic alloys, but studies of diffusive phase transition kinetics have 
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been particularly widespread because of their technological importance, signifi- 
cant theoretical interest, and (sometimes) accessible time scales. In examining 
“phase ordering” processes of atomic ordering or phase separation, a distinction 
is usually made between the “early-stage” transformation kinetics, during which 
the material transforms locally to the new phase, and the “late-stage” transforma- 
tion kinetics, during which domains of the ordered phase coarsen to reduce their 
interfacial energy. The late-stage coarsening process is typically much slower than 
the early-stage processes and has been studied widely. In this case, theory and 
simulation predict that the average domain size grows with time as t!/" where 
n = 2 for a nonconserved order parameter (e.g., atomic ordering) and n = 3 fora 
conserved order parameter (e.g., phase separation). Theory also predicts that the 
evolving structure factor obeys a dynamic scaling with S(q,t) = t4/"F(qt/"), 
where F(x) is a scaling function. These general predictions agree well in some 
cases with scattering studies of late-stage coarsening kinetics (Shannon et al. 
1992), but other alloys appear to show more complex behavior (Mazumder et al. 
1999); it is not yet clear whether this is due to elastic interactions or other factors. 

Fewer experiments have examined in detail the early-stage kinetics. The devel- 
opment of order can proceed by nucleation and growth or by a spinodal (continu- 
ous) process. Following a quench from the disordered phase into the ordered 
phase, fluctuations grow. If the disordered phase is quenched into an unstable re- 
gion of a phase diagram, below a second-order transition line or below a mean- 
field instability (spinodal) line, then some fluctuations are unstable. If the disor- 
dered phase is instead quenched into a metastable region of the phase diagram, 
below a first-order transition line, then most fluctuations will equilibrate to a new 
metastable level. Statistically rare fluctuations will eventually nucleate the ordered 
phase. In some cases, there is a mean-field spinodal, or instability, buried below a 
first-order transition. In such cases, we would expect a crossover in ordering ki- 
netics behavior for disordered samples rapidly quenched to temperatures above 
and below the instability. Ludwig et al. (1988) and Tanaka et al. (1994) have exam- 
ined the equilibration of fluctuations and its relationship to this crossover in the 
case of ordering in the classic Cu—Au alloys. In this case, the initial kinetics is 
relaxational in nature, even for quenches to temperatures below the mean-field 
instability point. The simplest theory of ordering or decomposition kinetics in the 
unstable regime is the linear theory of Cahn, Hilliard, Khachaturian, and Cook 
(Cahn and Hillard 1959; Khachaturian 1968; Cook et al. 1969) (cf. Chapters 5, 7, 
and 10). It neglects coupling between fluctuations at different wave vectors q and 
predicts that at each wave vector the structure factor should either relax exponen- 
tially (if fluctuations at this wave vector are stable) or grow exponentially (if fluc- 
tuations at this wave vector are unstable). Many of the time-resolved scattering 
studies of early-stage kinetics in alloys have sought to examine to what extent 
the simple linear theory could describe fluctuation equilibration in the disordered 
phase and the growth of fluctuations during the early stages of a spinodal phase 
transformation. It is now generally believed that nonlinear theories are necessary 
to describe early-stage fluctuation growth quantitatively. Among the clearest ex- 
periments testing nonlinear theory are the time-resolved X-ray studies of early- 
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stage spinodal decomposition in Al-Zn alloys by Mainville et al. (1997) and Hoyt 
et al. (1989). For ordering, Livet et al. (2002) have carefully examined the fluctua- 
tion equilibration process experimentally above the critical point, as well as the 
ordering process just below. 

Although most experiments have focused on the development of order from 
disorder, there have also been interesting studies of order relaxation following 
temperature changes within the ordered phase (Park et al. 1992; Park 1996) and 
of the dissolution of order (Okuda et al. 1997). 

While much of the time-resolved work in the field has focused on very funda- 
mental comparisons with statistical mechanics models employing simple pair in- 
teractions or Ginzburg-Landau free energies, most alloys exhibit more complex 
ordering kinetics. Gao and Fultz (1993) have identified the formation of a clear 
precursor phase during ordering in Fe-Al alloys. In other cases, time-resolved 
studies have been able to examine the kinetics in the presence of complex phase 
behavior, with competing phase separation and ordering tendencies, as in Fe-Al 
(Allen et al. 1992), or the presence of long-period superlattice (LPS) phases in 
Cuo.79Pdo.21 (Wang et al. 2005). In the latter case, the satellite peaks associated 
with the modulated LPS structure are observed to grow faster initially than the 
superlattice peaks associated with the short-range order (Fig. 13.6a). This is 
apparently due to the presence of an instability to modulated order below the 
first-order transition and can be reproduced in Monte Carlo simulations of the 
ordering process (Fig. 13.6b) using effective pair potentials derived from diffuse 
scattering data, as discussed below in Section 13.1.5.2. 

The agreement with the real-time X-ray scattering data suggests that the real- 
space kinetics of the Monte Carlo simulation (see Fig. 13.7) is reproducing the 
actual growth kinetics reasonably well. Figure 13.7 shows the simulated develop- 
ment of the modulated order in real space. Here the lattice has been broken into 
a set of unit cells, and the dominant sublattice on which order exists in each unit 
cell has been identified. A single plane of unit cells has been selected for display. 
The four possible sublattice orderings on the fcc lattice are denoted by the four 
non-black shades; those unit cells without a clear sublattice ordering are denoted 
by black. Antiphase boundaries (APBs) between degenerate ordered regions can 
be either “conservative” or “nonconservative.” A conservative APB separates two 
ordered regions that differ from each other by a translation vector parallel to the 
plane of the APB interface, while a nonconservative APB separates two ordered 
regions that differ from each by a translation vector with a component perpendic- 
ular to the APB interface. Because the nearest-neighbor atomic environment 
is not changed by the presence of conservative APBs, these are typically lower 
in energy than nonconservative APBs, and they are the equilibrium APB’s in the 
1D LPS structure of Cuo.79Pdo.21. Careful examination of the first frame shows 
that, at 10 MCS (Monte Carlo steps), local ordered regions are already forming, 
but there are many nonconservative antiphase relationships between neighboring 
ordered regions. This is consistent with the initial growth primarily of the satel- 
lite peaks in the X-ray data. The second frame (30 MCS) shows that, with passing 
time, the conservative APBs have propagated at the expense of the neighboring 
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Fig. 13.6 Kinetics of ordering in a Cuo.79Pdo2 alloy: a) the growth of 
X-ray peak intensities for the satellite (due to modulated order) and 
superlattice peaks following a quench to 723 K. The satellite peak 
initially grows faster than the superlattice peak. b) Results from Monte 
Carlo simulations of the ordering kinetics using effective pair 
interactions derived from diffuse X-ray scattering. 


regions having nonconservative antiphase relationships. This produces elongated 
domains, typical of the true 1D LPS structure. By the third frame (150 MCS), 
we can clearly see small 1-d LPS regions of different variants, and the same vari- 
ant, abutting each other. The kinetics is now slowing considerably. The fourth 
frame (600 MCS) largely shows the coarsening effects between different 1D LPS 
regions. A particularly interesting boundary occurs slightly to the left and below 
the center. Here two modulated structures confront each other; they have a hori- 
zontal modulation wave vector and a conservative boundary at their interface. 
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Fig. 13.7 The evolution of order in one plane of atoms from Monte 
Carlo simulations of ordering in 1D LPS Cuo.79Pdo.21 alloy. Times are in 
Monte Carlo steps and each cell has been colored to represent on 
which of the four possible sublattices local ordering has occurred. 


While most studies of ordering and phase separation kinetics have focused on 
bulk phenomena, Reichert et al. (1997, 2001a) have used surface-sensitive time- 
resolved X-ray scattering to examine how ordering kinetics differs at the surface. 
Among other effects, they have found distinct anisotropies in the rate of ordering 
perpendicular to the surface and parallel to it. 

The processes of crystal and quasicrystal formation from metallic glasses have 
also been widely studied with time-resolved X-ray scattering. Studies have partic- 
ularly focused, during crystallization from the glass, on transient phase formation 
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(Sutton et al. 1989; Brauer et al. 1992; Bruning et al. 1996) and nanocrystalliza- 
tion (Antonowicz 2005). Crystallization from the liquid has been studied less, 
probably because the nucleation occurs in a very heterogeneous manner and sub- 
sequent local growth is rapid. In an interesting experiment, however, Yasuda et al. 
(2004) were able to examine dendritic growth during uniaxial solidification and 
observe the detachment of fragmented dendrite arms (cf. Chapter 3). 

Though not as well studied as atomic ordering and phase separation, diffusion- 
less transformations in alloys are also of great interest and time-resolved studies 
can offer unique insights. For example, Babu et al. (2005) were able to examine 
lattice parameter fluctuations in austenite and how they evolved during the trans- 
formation to bainite. Abe et al. (1994) carefully examined the kinetics of the mar- 
tensitic transition in In-Tl alloys and could make detailed comparisons with 
theory. The studies of transition kinetics between the cubic and the orthorhombic 
phases in Ni;Sb after sudden changes in temperature by Svensson et al. (1997) 
elucidate well the continuity between the concepts of athermal and isothermal 
martensitic transformations. 

One of the great strengths of the time-resolved scattering technique is its wide 
applicability. Thus, in addition to its use for more fundamental studies of alloy 
kinetics, it has been increasingly employed as a unique tool for investigating 
applied materials problems. Thus, Meneghini et al. (2003) have used time- 
resolved scattering to examine microstructure evolution in Co—Cu giant magne- 
to-resistance (GMR) alloys, Chen et al. (2003) examined the evolution of a nickel 
aluminide bond coat at high temperatures, Elmer et al. (2004) tracked phase for- 
mation during the welding of steel, Choo et al. (2004) followed the strain evolu- 
tion during tensile loading and creep in a TiAl-W alloy, and Wada and Tabira 
(2005) examined the structural evolution of gas-storage alloys during chemical 
reactions. 


13.1.5 
Diffuse Scattering from Disordered Alloys 


Many alloys of interest are disordered at the atomic scale, either being amorphous 
or being crystalline without long-range chemical order. In either case, the local 
structure of the material is best studied through diffuse scattering. X-ray and 
neutron diffuse scattering studies of alloy structures with X-ray and neutrons 
have a long history, but the development of synchrotron sources has enabled the 
full development of techniques which were previously quite difficult. The broad 
energy spectrum of synchrotron sources has enabled anomalous X-ray scattering 
to be used to separate different contributions to diffuse scattering. Meanwhile the 
high collimation of synchrotron beams has facilitated the further development of 
small-angle X-ray scattering (SAXS) and surface-sensitive scattering to study long- 
range chemical fluctuations and near-surface structure, respectively. Finally, the 
use of high-energy synchrotron X-rays has enabled the measurement of diffuse 
scattering over entire surfaces at a single shot using position-sensitive detectors. 
Schweika (1998) has written a general review of the field. 
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13.1.5.1 Metallic Glasses and Liquids 

The elastic scattering intensity from a single-component amorphous material can 
be simply written as Eq. (34), where the sum is over the N atoms, b; is the scatter- 
ing factor (or scattering length in the case of neutrons), and rij = ri — tj. 


oc > bb’ elas = LPD ij = N|b|?S(q) (34) 
tJ J 


For an isotropic material, the structure factor S(q) depends only on the magni- 
tude of q. The quantity of interest is usually the real-space pair correlation func- 
tion g(r), which can be obtained from the experimental S(q) by transformation 


[Eq. (35)]. 





nt ri) = an q(S(q) — 1) sin(qr) dq (35) 


The quantity Anr?g(r) dr gives the average number of atoms around a central 
atom at a distance between r and r + dr. Thus, for instance, g(r) yields the aver- 
age number and distance of nearest neighbors. Because of the averaging over ori- 
entation and individual atoms taking place in g(r), some researchers have instead 
used the scattering data as input to inverse Monte Carlo simulations, which then 
yield real-space models of the material structure (cf. Chapter 12). This has been 
particularly useful when examining the intermediate range order in alloys, i.e., 
order on length scales of about 1-2 nm. The pair distribution function is less sen- 
sitive to such structures than is S(q), particularly in the relatively low-q region. 
While an inverse Monte Carlo simulation result is not unique, the procedure 
does allow input of other physical information, such as steric constraints and av- 
erage density, as well as data from other experimental techniques, such as ex- 
tended X-ray absorption fine structure (EXAFS). For example, Sheng et al. (2006) 
have recently used this approach to examine short-range and medium-range 
order in a Ni-P metallic glass. Also helpful in discerning structural features is 
the collection of high-q data from high-energy X-ray or neutron scattering. In per- 
forming the transform to obtain g(r), this ensures that peak widths are not broad- 
ened by the truncation of the integral and better allows individual coordination 
shells to be identified (Billinge 2004). 

In alloys with N chemical components, where each component has a different 
average scattering factor (or length), the scattering intensity can be written as the 
sum of N(N + 1)/2 partial structure factors associated with each kind of pair [Eq. 


(36)]. 


=)», babze ie D Sap(q) (36) 
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If the individual partial structure factors have been determined, they can be trans- 
formed to yield the chemically specific partial distribution functions g,s(r) [Eq. 


(37)]. 


Bul) = ae D ory) ==] 50) -1 sinlar) dg (37) 


Obviously these are more revealing than the total pair correlation function in a 
metallic glass alloy. To determine the individual partial structure factors, the scat- 
tering factors f, can be varied (Keating 1963), either by isomorphous substitution, 
by isotopic substitution to vary the neutron scattering lengths in the case of neu- 
tron scattering, or by utilizing the “anomalous,” or resonant, change in the X-ray- 
scattering factor near the absorption edges of the elements as in Eq. (38), where 
fo(q) is the atomic form factor, the Fourier transform of the real-space electron 
distribution of an atom, and f’ and f” are the real and imaginary anomalous 
scattering factors, respectively. 





b.(q, E) = fox(q) + f'a(E) + if"4(E) (38) 


In the case of isomorphous substitution there is always concern that the amor- 
phous material prepared with an isomorphous replacement will not have an iden- 
tical local structure to the original; therefore the methods of isotopic substitution 
or resonant X-ray scattering are more often preferred. Unless isotopes are avail- 
able to change the sign of one or more of the components’ scattering lengths, 
however, the resulting equations are usually rather ill-conditioned, making accu- 
rate determination of the S,s(q) difficult. Again, one way to try to overcome this 
problem is to use inverse Monte Carlo simulations to fit several independent sets 
of scattering data while also incorporating known physical constraints. 

An alternative to taking N(N + 1)/2 independent data sets to determine all the 
partial structure factors fully is to change only the scattering factor of a single 
chemical component and measure the difference, so that all correlations not 
involving that atomic species cancel. This is differential anomalous scattering 
(Fuoss et al. 1981); it is particularly convenient using two X-ray photon energies 
just below the absorption edge of one atomic species we designate A. Then we 
may often neglect the change in scattering that occurs for the other atomic spe- 
cies and Eq. (39) applies, where we assume that the scattering factor is real for 
simplicity. 





Allg) = I(q, E1) — I(q, E2) = Af'a X foSap (39) 
B 
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This can be normalized and transformed to give a “differential distribution func- 
tion,” which is sensitive only to the environment around that species of atom. 
Recently this approach has been used, for instance, to examine short- and 
medium-range order in complex Zr-Cu-Ni-Ta-Al amorphous alloys (Hufnagel 
and Brennan 2003). 


13.1.5.2 Diffuse Scattering from Disordered Crystalline Alloys 

Disordered crystalline alloys exhibit diffuse scattering because of the nonperiodic 
arrangement of individual atomic species on lattice sites, the displacement of 
atoms from their ideal lattice positions, and thermal vibrations. Thus the scat- 
tered intensity is given by Eq. (40), where Ry = Rj — Rj is the difference in ideal 
lattice position of two atoms i and j in the crystal and ôi, 6; are the displacements 
of these atoms from their ideal positions. 


Ia) x I, bie =) bibj et the) (40) 
i,j i,j 


In the case of a binary alloy we can write Eq. (41) (Borie and Sparks 1971), where 
ca and cz are the chemical compositions, and Pi is the (conditional) probability 
of finding an atom of species £ on site j if there is an atom of species « on site i. 


<bibr et), 
A) 


= capp babi <et -A >+ cap babe) 


+ + pe) (41) 


The exponentials with the displacements can also be expanded in a power series, 
Eq. (42), where dj = ôi. 


CUIA 


ei =1+ i<q a õi) = 5 tis (42) 


The total intensity can then be written as the sum of a fundamental intensity, 
which is independent of the local order, a term dependent only on the short-range 
order (SRO), a first-order displacement term (1D), and higher-order displacement 
terms (hD), as in Eq. (43), where the first term on the left-hand side is given by 
Eq. (43), using thermal Debye-Waller factors as in Eq. (44). 





1(q) = Tfuna(q) + Isro(q) + Ip + Inn(q), 


Irma(q) = [caba ™ E + cpbpe™™2 |? Y eier (43) 
ij 


EM? = | ely) = g= <(46)>/2 (44) 
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We introduce the Warren—Cowley SRO parameter dimn = 1 — < pi”? /cp, where 
the indices |, m, n refer to the relative positions of two atoms in lattice units. In 
a cubic crystal, the SRO intensity is given by Eq. (45), where hı, ha, h; are the in- 
dices of the scattering wave vector in reciprocal lattice units and the exponential 
term represents an effective Debye-Waller factor between neighbors separated by 
l, m, n lattice units. 


Isro(q) = Neace|ba — bal? XO oimne MP4 cos[a(hl+ ham +hgn)] (45) 


l,m,n 


The different factors entering the diffuse scattering, SRO, atomic displacements 
and thermal vibrations, can be separated by using their different symmetries 
in reciprocal space and by varying the scattering factors of the atomic species. To 
vary the scattering factors, isotopic substitution in neutron scattering or resonant 
X-ray scattering can be used. Isomorphous substitution is also possible, but, as 
is the case with diffuse scattering studies of amorphous materials, it can always 
suffer the problem that the structure of separate alloy samples prepared using 
isomorphous elements may not be identical. A relatively recent review of the sub- 
ject, within the context of using resonant X-ray scattering to help separate the dif- 
ferent components, has been written by Ice and Sparks (1999). 

Most diffuse scattering studies examining SRO and displacements have tradi- 
tionally used a point detector to measure the diffuse scattering throughout a 
symmetry-chosen volume of reciprocal space. However, using relatively high 
X-ray or neutron energies in conjunction with an area detector, a relatively flat 
plane of diffuse scattering data can be recorded in a single exposure (Reichert 
et al. 2001b). This considerably facilitates the study of local alloy structure as a 
function of temperature and/or concentration. 

With the information on SRO and displacements available from diffuse scatter- 
ing, much can be learned about the nature of interactions in the alloy. As some 
recent examples, local atomic arrangements in a Fe—Ni invar alloy have been cor- 
related with the materials anomalous thermal properties (Robertson et al. 1999), 
and Mezger et al. (2006) have been able to determine effective interactions in 
Ni-Pd alloys. Wang et al. (2006) examined the diffuse scattering from classic 
LPS Cu-Pd alloys. These exhibit diffuse peaks at satellite positions associated 
with the modulated order, rather than at the superlattice sites themselves (Figs. 
13.8a, 13.9a). From the diffuse scattering data, they derived effective pair poten- 
tials which exhibit Friedel oscillations (Fig. 13.8b), presumably due to Fermi- 
surface nesting in the <110>) directions (Sato and Toth 1961). The experimental 
effective pair potentials were then used in Monte Carlo simulations to reproduce 
the temperature dependence of the diffuse scattering fine structure in the disor- 
dered phase (Fig. 13.9b) and the details of the ordering kinetics, as discussed 
above in Section 13.1.4.2. 
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Fig. 13.8 a) Short-range order diffuse X-ray scattering in disordered 


Cuo.79Pdo.21 alloy; b) effective pair interaction between atoms derived 
from the diffuse scattering; the pair potential shows Friedel oscillations. 
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Fig. 13.9 a) Temperature dependence of observed diffuse scattering fine 
structure in disordered Cug79Pdo21 alloy; b) results of Monte Carlo 
simulations of the diffuse scattering fine structure as a function of 
temperature above the spinodal ordering point Tsp. 


13.1.6 
Surface Scattering — Atomic Segregation and Ordering near Surfaces 


While X-rays have traditionally been considered a probe of the bulk structure 
of a material due to their relatively long penetration depth (typically microns), 
the high brightness of synchrotron sources has facilitated the development of 
surface-sensitive X-ray scattering. This uses glancing incidence/exit angles 
comparable to the critical angle of total reflection (of order 1°), or the weak X-ray 
scattering from surfaces at specific reciprocal space positions to achieve its sur- 
face sensitivity. Most of the early applications of the technique focused on surface 
structure in ultrahigh vacuum (UHV); an overview can be found in Robinson and 
Tweet (1992). The variety of surface structural information that can be obtained 


13.1 High-Resolution Scattering Methods and Time-Resolved Diffraction 


from surface-sensitive X-ray diffraction is remarkable. In the case of surface re- 
construction (or any process yielding a surface layer of different structure than 
the bulk) the atomic structure of the surface can be determined with two- 
dimensional crystallography, in analogy to traditional three-dimensional crystallo- 
graphic methods. Two-dimensional structures, such as a surface reconstruction 
or the simple termination of a bulk crystal, have Fourier transforms that fall off 
slowly in the normal direction. Thus, instead of Bragg peak spots, surface struc- 
ture gives rise to truncation rods of diffracted intensity normal to the surface. 
These crystal truncation rods are also very sensitive to the structure normal to 
the surface, particularly at the anti-Bragg points, halfway between the normal 
Bragg reciprocal lattice points perpendicular to the surface. At these anti-Bragg 
points, alternating crystal layers give scattering amplitudes with alternating signs, 
so that the scattering there is very sensitive to the surface structure. The rods can 
be analyzed to determine the depth of the surface structure, surface roughness, 
surface chemical segregation, vertical relaxation of outer surface layers, and regis- 
try of the surface structure with the underlying crystal. 

Of particular interest for alloy physics is preferential atomic segregation or 
ordering at a surface (Dosch and Reichert 2000). This can become quite complex, 
with ordering and segregation apparently competing in many cases, as has been 
observed in Ni-Al alloys (Drautz et al. 2001). The nature of critical phenomena 
near surfaces, and how it relates to the bulk phase transition, can also be eluci- 
dated, as seen, for instance, in the case of V2H (Trenkler et al. 1998). 


13.1.7 
Scattering from Quasicrystals 


X-ray and neutron scattering have been crucial for elucidating the structure of 
quasicrystals. Kelton (1993) conducted an older review, while Steurer (2003, 
2004) has reviewed much of the more recent structural work (cf. Chapter 2). In 
general, the diffraction pattern from a perfect quasicrystal consists of a mathe- 
matically dense set of Bragg spots. Often it is useful to reference the observed 
three-dimensional reciprocal space structure to a Bravais lattice in a higher- 
dimensional space. As in conventional three-dimensional crystallography, the 
intensity of Bragg spots can then be used in an effort to determine the atomic 
structure of the quasicrystal. Due to the large number of reflections from a quasi- 
crystal, however, this is difficult. In practice, of course, only a finite set of peaks 
will be sufficiently intense to be observed in a given diffraction experiment. 
Indeed, in real quasicrystals, the diffuse scattering between the most intense 
Bragg peaks can often be dominated by long-wavelength phason modes rather 
than the weaker Bragg peaks from the perfect quasicrystal (de Boissieu and Fran- 
coual 2005). Chemical and local structural disorder may also be significant. Given 
these difficulties in determining the long-range order of perfect quasicrystals, 
much more information has so far been gained about the nature of short-range 
order. 
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13.1.8 
Outlook 


Development of new experimental techniques has always revitalized old scientific 
disciplines, pushing their evolution and growth in an unexpected way. Over the 
past few decades, the same has happened with X-rays. In the 1970s, following 
decades of steady progress in the theoretical understanding of the interaction of 
X-rays with matter, a technique over a century old (1895: W.C. Röntgen) received 
an enormous impulse, thanks to the development of a new radiation source. It 
was realized that the synchrotron radiation emitted from electrons circulating in 
storage rings is potentially a much more intense and versatile source of X-rays 
than the widely used X-ray tubes. The period since the mid-1970’s has been a 
time of continuous development of this kind of X-ray source, culminating to 
date in the so-called third-generation synchrotron sources as indicated in Fig. 
13.1. The next generation of sources, based on a free-electron laser (FEL), is 
already operating and will culminate with the hard X-ray FELs expected to be 
turned on near 2010. The unprecedented increase in photon flux from synchro- 
tron sources was certainly the main reason for application of X-rays in an enor- 
mously broad range of materials and life sciences, and it revolutionized the whole 
of X-ray physics. In particular, nanoscience benefits significantly from the new 
brilliant sources through development of spectroscopic methods revealing funda- 
mental dynamical processes in nanoscale materials with unprecedented spatial 
and temporal resolution. The very low emittance defined by the size and the 
angular divergence of the electron beam circulating in the rings contributes to 
the high coherence of the new sources, which, as a consequence, has permitted 
development of time correlation techniques previously feasible with visible-light 
lasers only. New experimental techniques based on inelastic X-ray and nuclear 
resonant scattering reveal fundamental dynamical processes in nanoscale materi- 
als. Within a decade the technique of using X-rays for the study of vibrational 
excitations achieved amazing results and opened a qualitatively “new window” 
of scientific research. 

It is clear that in some applications X-ray methods are and will in the future 
continue to be complementary to other techniques, especially to neutron scatter- 
ing. One should take into account, however, that neutron spectroscopy seems to 
be in a relatively mature phase of its scientific development. Over three decades, 
the maximum intensity of the available neutron sources has not significantly in- 
creased. Moreover, more and more techniques previously reserved for neutrons 
can now be performed with X-rays, e.g., magnon spectroscopy, phonon spectros- 
copy, and the study of magnetic switching and magneto-optics. 

In sum, we are confident that X-ray scattering techniques will dominate the al- 
loy research landscape during many decades of the 21st century. Since a funda- 
mental goal of materials research is to understand and successfully predict the 
evolution of materials during thermal or other processing, the kinetics and dy- 
namics of alloys is likely to continue to grow in interest. Ideally, the continued 
development of computational speed will increasingly facilitate the comparison 
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of first-principles predictions with experiment. Looking ahead, we cannot say in 
detail what new science will develop if a new generation of radiation sources 
with still 10° times higher brilliance appears, but it will be certainly very exciting. 
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13.2 
High-Resolution Microscopy 
Guido Schmitz and James M. Howe 


In this chapter we describe techniques to investigate the micro- and nanostruc- 
ture of metals and alloys. For the most part, the physics of alloys deals with 
defects — local deviations from the ideal structure or composition — since these 
defects are responsible for many application-relevant properties. Point defects, 
like vacancies and impurities, or interstitials and antisite defects, determine the 
kinetics of atomic transport. Structure, shape, and stability of interfaces and sur- 
faces control the reaction mechanism of phase transformations and catalytic pro- 
cesses, respectively. Dislocations, their spatial distribution, and their interaction 
with precipitates determine plasticity, hardness, and toughness of structural ma- 
terials. The size distribution and spatial arrangement of precipitates are decisive 
for pinning mechanisms in superconductors or ferromagnets in order to increase 
the critical current or coercivity, respectively. These examples emphasize the role 
of defects and justify the enormous experimental effort that is frequently under- 
taken to clarify the defect structure of materials. 

To some extent, the analysis of defects is possible by integral methods such 
as scattering techniques or various types of spectroscopy, such as Méssbauer 
spectroscopy, positron annihilation, or nuclear magnetic resonance. Also, indirect 
methods such as the measurement of electric conductivity are quite helpful to de- 
termine the number density of known defects. However, in all these cases the in- 
terpretation of data requires an a-priori model based on a considerable amount of 
pre-knowledge about the nature and geometry of the respective defects. Thus, 
whenever the nature of defects needs to be identified, different types of defects 
overlap in their effect, or their absolute position within the specimen must be 
clarified, integral techniques are not sufficient. Instead, microscopic methods 
which directly image the defect arrangement or provide local chemical or spectro- 
scopic information are the suitable choice. 

As the number of major microscopy techniques is large and the possibilities 
of their variation even larger, we restrict this discussion to review three major 
branches of high-resolution microscopy, namely surface analysis by scanning 
probe microscopy (SPM), structural imaging by high-resolution transmission 
electron microscopy (HRTEM), and local chemical analysis by atom probe tomog- 
raphy (APT). As their common outstanding feature these techniques are distin- 
guished by atomic resolution. Each of them represents the state-of-the-art in their 
respective fields. 

The application of scanning probe microscopy in materials science is rapidly 
expanding, so that textbooks and comprehensive reviews are published regularly. 
Thus, we restrict our discussion to a general orientation on the various branches 
of the technique and their importance for materials science. Transmission elec- 
tron microscopy is widely utilized among materials laboratories throughout the 
world. Since the physics of imaging and wave optics is already taught routinely 
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in university courses, we focus our review on a dedicated variation of the method, 
the in-situ microscopy of structural transformations, to emphasize this important 
aspect of alloy physics. HRTEM is able to image single point defects (see Chapter 
5) in materials in only special situations. In contrast, field ion microscopy (FIM) 
can be used to study vacancies and surface atoms and therefore extends the avail- 
able resolution to atomic dimensions. When combined with a mass spectrometer, 
the field ion microscope becomes an atom probe (APFIM), easily capable of com- 
positional analysis of local areas of samples. This technique is therefore highly 
complementary to HRTEM and is discussed in Section 13.2.3. The latest version 
of analytical FIM so-called atom probe tomography, is currently used by only a 
few groups throughout the world, so that we expect the reader to have less knowl- 
edge. Therefore, a somewhat more detailed description of the physical principles 
is presented before recent applications to the physics of alloys are discussed. 


13.2.1 
Surface Analysis by Scanning Probe Microscopy 


Scanning probe microscopy (SPM) has become an absolutely essential tool for the 
analysis of surfaces down to the nanometer scale. The term SPM identifies a fam- 
ily of techniques that are based on the surprisingly simple principle of mechani- 
cal scanning. Binning and Rohrer (1982) were the first to demonstrate that the 
surface of a conductive material may be scanned at atomic resolution, if piezo- 
electric actuators are controlled by the tunnelling current between a tip-shaped 
probe and the sample. Plotting the height of the probe versus its lateral position, 
the topography of the surface could be imaged. The scanning tunneling micro- 
scope (STM) was born. Owing to its atomic resolution, the STM has been used 
extensively to study the growth mechanisms of metallic thin films. Various 
growth modes - layer by layer Frank-van der Merwe mode, Vollmer—Weber is- 
land nucleation, or the two-stage Stranski-Krastanov mode - could be easily dis- 
tinguished from the topography of the growing film. Furthermore, the nucleation 
of first islands and even details of the behavior of individual adatoms, such as 
general surface diffusion or the influence of the Schwoebel barrier at the edges 
of terraces, could be investigated quantitatively. Other important applications are 
the study of atomic reconstruction and order of alloy or intermetallic surfaces 
(Göken et al. 2002). In addition to imaging, local spectroscopy is possible by vary- 
ing the bias voltage. This scanning tunneling spectroscopy (STS) reveals informa- 
tion on filled and empty states of the electronic structure. The local density of 
states, including band gaps and surface states, can be measured. 

Although it is not critical within the context of this book on alloy physics, a gen- 
eral disadvantage of STM is its restriction to conductive materials and surfaces. 
Shortly after the introduction of STM, it was shown that the required control of 
the probe-specimen distance could also be achieved by measuring the atomic 
interaction forces between the sample and probe (Binning et al. 1986). With this 
so-called atomic force microscopy (AFM), a standard resolution in the nanometer 
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range is obtained in the so-called repulsive contact mode. Applying a more elabo- 
rate dynamic mode, even atomic resolution has been demonstrated (Giessible 
1995) and the atomic surface structure of nonconductive materials could be im- 
aged for the first time. The emergence of the AFM triggered the development of 
many different branches of scanning probe techniques, among them magnetic or 
electric force microscopy (MFM, EFM) (Hartmann 1999) or scanning near-field 
optical microscopy (SNOM). For the latter, the surface is excited by a local optical 
field a few tens of nanometers in diameter that is scanned across the surface by 
manipulating a suitable optical fiber. The variety of different techniques and mod- 
ifications is so large that providing a thorough overview is difficult. The interested 
reader may find detailed descriptions of scanning probe techniques in textbooks 
(e.g., Maganov and Whangbo 1996; Meyer et al. 2003) or recent reviews (Meyer et 
al. 2004). 

AFM and the scanning probe techniques in general do not need a vacuum 
environment. They are performed in many different atmospheres, even under 
liquids, so that some materials are easily studied that could not be investigated 
with conventional electron microscopy due to their lacking vacuum compatibility. 
Therefore, the number of applications to soft matter such as polymer films or bi- 
omaterials under their living conditions is increasing. In addition, AFM often 
requires minimal effort in specimen preparation so that this method is also 
advantageous in the analysis of traditional materials, provided that surface infor- 
mation is sufficient to solve the relevant problem. In the context of metallic mate- 
rials, the possibility of measuring local mechanical properties is of particular 
interest, since alloys are still used predominantly as structural materials. By 
replacing the usual cantilever with a diamond nanoindenter that is driven by ca- 
pacitive actuators, local measurements of hardness are possible with a spatial res- 
olution of a few tens of nanometers. This way, even local stress fields at crack tips 
or the plastic properties of embedded particles can be investigated (Kempf et al. 
1998). 

Below, we briefly describe the functional principles of scanning tunneling and 
atomic force microscopy, albeit with a focus on the latter technique. The applica- 
tion to typical problems in the physics of alloys is demonstrated using several ex- 
amples, including surface reconstruction of ordered intermetallics investigated by 
STM, the characterization of heterogeneous microstructures by AFM, the study 
of grain boundary grooving, local measurement of plasticity, and also the imaging 
of magnetic domains and the substructure of domain boundaries with MFM 
techniques. 


13.2.1.1 Functional Principle of Scanning Tunneling and Atomic Force Microscopy 
The various branches of scanning probe microscopy follow a common principle: 
a probe as sharp as possible scans a surface using very accurate mechanics. The 
respective interaction between the specimen and probe is measured and the sig- 
nal is used in a feedback loop to control the height/distance between the surface 
and probe. An image of the surface topology is obtained by plotting the height 
position of the probe versus the scanned area. 
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Fig. 13.10 Schematic representations of scanning probe techniques: 
a) scanning tunneling microscope; b) atomic force microscope. 


In Fig. 13.10(a), a schematic representation of the original scanning tunneling 
microscope is shown. A metallic tip produced by electrochemical or mechanical 
means is mounted on a piezoelectric drive, which allows to move the tip in 
3D space with sub-angstrom accuracy. If a bias potential of a few volts is supplied 
to the probe with respect to the conducting sample, a tunneling current develops 
as soon as the tip approaches the surface at distances below a few nanometers. 
According to quantum mechanics, the tunneling current is proportional to the 
density of electronic states of the specimen and probe, and proportional to the 
transfer matrix, which depends on the relative spacing. For practical purposes a 
simple exponential decay function (46) is often sufficient to estimate the current. 


ICh) oc exp[—Kyg «bl (46) 


Since the decay constant xep is of the order of a reciprocal angstrom, the current 
depends very sensitively on the probe height h (i.e., the sample-probe distance). 
During the lateral scan, the height of the tip is controlled via the driving voltage 
of the respective piezoactuator so that the tunneling current is always maintained 
constant at a certain preset value. Plotting the required driving voltage versus 
the lateral position yields an image of the surface topology. If the sharpness of 
the probe tip is sufficient — in ideal cases only one final atom may protrude 
from the apex — even the atomic corrugation of the surface becomes visible, so 
that the lateral resolution amounts to about 2-3 A. The accuracy of the height 
profile can be better than 0.1 A. In the case of alloys, the local electronic density, 
and thus the tunneling current, depend on the atomic species. As a consequence, 
the image contains a superposition of local topology and chemical information, 
which can be used to identify individual atoms. 
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Tunneling currents can only be measured when conductive specimens are 
observed. In order to extend the principle of mechanical scanning to insulating 
materials, local atomic interaction forces, instead of tunneling currents, are mea- 
sured in an AFM. Therefore, a fine, tip-shaped probe is attached to an elastic can- 
tilever as illustrated in Fig. 13.10(b). In most cases, the cantilever deflection is 
measured by means of an optical beam, which is reflected onto a position- 
sensitive detector (Meyer and Amer 1988). Using a four-quadrant device, as indi- 
cated in Fig. 10(b), has the interesting advantage that deflection of the beam may 
be determined in two independent directions, so that interaction forces normal 
to the surface as well as lateral friction forces may be measured. Alternatively, ca- 
pacitive sensors can be used to detect the deflection of the cantilever. State-of-the- 
art detectors are very precise - a shift of the probe by fractions of an Angstrom is 
readily detected. Since cantilevers are produced by methods of microfabrication 
such as lithography, anisotropic etching, and ion beam sputtering, they may be 
formed in varying geometry and dimensions. Typical spring constants are in the 
range of 102-5 x 10? N m}, so that using Hooke’s law, interaction forces in the 
piconewton range may be measured. 

Using modern techniques of microfabrication, the resonance frequency of can- 
tilevers is adjusted in the range of 10-1000 kHz, which allows a typical scanning 
rate of about 500 Hz. If, for example, an area of 30 x 30 um? should be scanned 
in 512 x 512 pixels, the total measuring time per frame amounts to 10 min, so 
that there is little possibility of increasing the field of view significantly. This is 
one of the important drawbacks of SPM in comparison to electron microscopy. 

At first hand, any microscopy should provide an image of the real specimen. 
Thus, any modifications of the specimen structure during the imaging process 
should be carefully avoided. But, in addition to merely imaging, the scanning 
probe technique may be used to intentionally modify the local surface structure 
of the sample. Thus, the probe is applied as a unique nanotool. Pushing and po- 
sitioning of individual atoms on a surface, like writing with “atomic ink” on a 
sheet of micrometer dimensions, is certainly the most exciting example per- 
formed with low-temperature STM, which allows the vision of data storage in an 
extreme packaging density. There are other examples of using the scanning probe 
as a kind of nanolaboratory. Since the AFM is a device for producing and measur- 
ing local forces, it is natural to suggest using this instrument for local mechanical 
testing. However, the usual cantilever geometry is not suited for this purpose be- 
cause the maximum forces that can be supplied are not sufficient, and further- 
more, because of the lever geometry, vertical movement is always accompanied 
by a slight lateral shift, which hinders an exact vertical indentation and thus the 
correct measurement of hardness. Finally, the conventional probe tips are just too 
fragile to be used as a nanoindenter. As a result of these features, an instrumental 
modification known as NI-AFM (nanoindenter AFM) has been successfully ap- 
plied in local elasticity and hardness tests. Its principle is illustrated by Fig. 
13.11. Instead of a cantilever spring, a capacitive transducer is used to operate a 
small pyramidal diamond indenter (Bhushan et al. 1996) in analogy to a measure- 
ment of conventional Vickers hardness. Forces ranging from 100 nN up to 10 mN 
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Fig. 13.11 A capacitive transducer for an NI-AFM. 


are produced by the electrostatic field of a capacitor. The penetration depth is 
measured via a capacitive sensor, as indicated by the three electrodes in the sche- 
matic drawing. Before and after the indentation the diamond tip may be used to 
produce a topology image, just as with a standard cantilever AFM. 


13.2.1.2 Modes of Measurement in AFM 

To produce an atomic force image, two particular modes of operation are com- 
mon practice, the so-called contact mode and the dynamic mode. For imaging in 
the contact mode, the cantilever deflection is used directly as a feedback signal. In 
other words, a preset value of interaction force is maintained during the scan, so 
that the image represents a surface of constant interaction force. Usually, the re- 
pulsive core interaction is probed. A typical preset for the force may be 100 pN. 
The important advantage of the contact mode is the ease of operation, i.e., com- 
plex electronics are not required. However, since the probe is in direct contact 
with the sample, the probability of specimen damage is relatively high. On the 
other hand, in direct contact with the surface lateral friction forces may be mea- 
sured, which makes the AFM an important tool for tribology. A simple estimate 
demonstrates that atomic resolution is hardly achieved in contact mode. For real 
atomic resolution, a probe with a protruding area in the range of one atom would 
be required, which leads to a contact area of about 5 A”. Given a force of 100 pN, 
a stress of 2 GPa would develop underneath the tip, which is close to the theoret- 
ical strength of either the specimen or probe material. Therefore, probes for the 
contact mode are usually much more blunted than this. If image details of appar- 
ent atomic resolution appear under these conditions, they are usually due to 
Moiré effects and do not provide real information at the angstrom level. In prac- 
tice, the lateral resolution of the contact mode ranges from 1 to 10 nm. 

In the dynamic mode the tip is excited to periodic oscillations near the reso- 
nance frequency of the cantilever. During the major part of the oscillation period 
the probe is far (10-100 nm) from the surface, so that predominantly long-range 
forces, e.g., van der Waals, or magnetic and electrostatic interactions may be 
sensed. Since contact with the specimen is reduced to a short intermittent “tap- 
ping,” the technique is also called the tapping mode. The danger of specimen 
damage is significantly reduced in this dynamic mode. Since periodically varying 
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quantities can be measured very accurately with lock-in amplifiers, the dynamic 
mode is much more sensitive than the contact mode, and under optimum condi- 
tions even atomic resolution may be obtained. Usually the oscillation amplitude 
or phase with respect to the excitation at constant frequency is used as a feedback 
signal to control the piezodrive of the height position. 

The oscillating cantilever may be described as a damped harmonic oscillator 
with an equation of motion given by Eq. (47), in which wo, ho, do denote the res- 
onance frequency, average position, and steady-state amplitude of the free oscilla- 
tor without specimen interaction; y is a damping factor that is related to the qual- 
ity factor of the oscillator. 


ah ðh oF 
ma + Va + moo (h — ho) = yma? cos o+(Fo+ 5h) (47) 


The interaction with the specimen is taken into account by the term in brackets 
on the RHS of Eq. (47), which represents a linear approximation of the interac- 
tion force. Without any lengthy calculation, it is seen by shifting the force terms 
to the LHS of the equation that the constant force term will just modify the zero 
position of the oscillator, while the first derivative, the compliance of the interac- 
tion, will modify the resonance frequency as in Eq. (48). 


OF 
mõ? == mar? — oh (48) 


Since the amplitude of the probe oscillation is given by Eq. (49) and the phase by 
Eq. (50), the shift of the resonance frequency will also modify the oscillation am- 
plitude and the phase. 
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Frequency, amplitude, and phase are well measurable quantities that may be used 
to map the local variation of öF/öh. Furthermore, the amplitude ô and in particu- 
lar the phase « depend on the damping factor. As a consequence, a phase image 
at constant frequency of oscillation will deliver information on the local damping 
factor, which depends on the local viscosity or internal friction of the specimen. 
Thus, imaging at constant phase angle may be also termed “viscosity micros- 
copy.” Such viscosity imaging has proven to be particularly useful in distinguish- 
ing soft rubber-like precipitates embedded in a thermoplastic matrix, a typical 
microstructure of high-performance polymer construction materials (Bar and 
Meyers 2004). 
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13.2.1.3 Cantilever Design for the AFM 

In the early days of STM, obtaining sharp probe tips was an art performed in the 
laboratory, but this situation has changed. Nowadays, cantilevers for AFM are pro- 
duced by state-of-the-art microfabrication using the latest lithography and etching 
techniques on Si wafers. They are usually purchased from specialized companies. 
For imaging in the contact mode, a triangular cantilever in combination with an 
Si3N4 tip, formed as a square pyramid, is often used (see Fig. 13.12a). Silicon ni- 
tride is much harder than Si and so reduces the risk of probe damage in direct 
contact with the sample. The curvature radius at the apex varies from 5 to 50 nm. 
Microscopy in the dynamic mode prefers the geometry shown in Fig. 13.12(b). 
The cantilever is formed as a rectangular beam equipped with a Si tip, which 
may be produced with sharper radii below 10 nm. 

In the design of the cantilever geometry, one has to compromise between a low 
spring constant, which results in high force sensitivity, and a large resonance fre- 
quency, which is important for high scanning rates. The minimum force that can 
be measured is limited by thermal noise to Fmin, as given by Eq. (51), where B, kg, 
fo, and Q denote the bandwidth (which means the pixel frequency of the mea- 
surement), spring constant, resonance frequency of the cantilever, and quality 
factor, respectively (Yasumura et al. 2000). The latter is defined as usual by 
Q = 2nEv/AE, the inverse of the relative energy dissipation per oscillation period. 
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Fig. 13.12 Cantilever geometries: a) triangular cantilever for the contact 
mode equipped with an Si3N4 tip formed as a regular square pyramid; 
b) rectangular beam with a Si tip as preferred for work in the dynamic 
mode. 
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Thus, high force sensitivity requires a small spring constant and at the same time 
a high resonance frequency, two requirements which normally oppose each other. 
The only way to decrease the spring constant without lowering the frequency is 
by optimizing the shape of the cantilever in accordance with Eq. (51) (Frederix 
et al. 2004). The spring constant of a rectangular cantilever bar of length l, thick- 
ness t, and width w is proportional to wt?/I?, while the frequency is proportional 
to \/ks/p = t/l?. Therefore, obtaining a high sensitivity requires minimizing the 
parameter combination wt?/l. In other words, very thin and narrow cantilevers 
are advantageous to optimize sensitivity, which imposes a challenge to micro- 
fabrication. In addition, the reduction of the spring constant finds a further natu- 
ral limit, if attractive forces should be measured. As soon as the increase in the 
interaction force exceeds the spring constant when approaching the surface, i.e., 
öF/öh > ks, the height position becomes unstable and the tip is drawn into the 
sample surface. As a consequence, measurements close to the sample surface be- 
come impossible if the spring constant is too low. Following typical rules of de- 
sign, usual spring constants range from 10°? to 5 x 10? N m! and the reso- 
nance frequency from 10 to 1000 kHz. 

In addition to the standard geometries discussed above, various modifications 
are in use for special purposes. For example, in tribology studies cantilevers may 
be equipped with the contact material in question; see, e.g., Fig. 13.13(a), which 
shows a glass sphere attached to a cantilever (Perry 2004), or the tip may be 
coated with a ferromagnetic layer to perform magnetic force microscopy (MFM), 
which probes the interaction with the magnetic stray field of the sample surface. 
By depositing a carbon-hydride supertip as an etching mask and subsequent ion 
beam sputtering, a single magnetic domain 50 nm in diameter may be formed at 
the tip apex as shown in Fig. 13.13(b) (Hartmann 1999). 


magnetic 
particle 


Fig. 13.13 Scanning electron micrographs of special cantilever tips: 

a) sodium-borosilicate microsphere (5 um radius) for tribology studies; 
b) magnetic supertip deposited on top of a Si tip for high-resolution 
MFM. (After Perry 2004; Hartmann 1999.) 
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13.2.1.4 Exemplary Studies by Scanning Probe Microscopy 

By discussing a few experimental studies, the various possibilities of applying 
scanning probe microscopy to the field of metals and alloys are illustrated. Con- 
sidering STM, general AFM, and MFM as well, we try to document the broad 
spectrum of these techniques using a small set of examples. 


13.2.1.4.1 Chemical Contrast by STM and Surface Ordering 
It is clear that STM provides a topological image of the atomic structure. In addi- 
tion, since the tunneling current depends on the density of electronic states of the 
tip and sample, one may expect some chemical contrast between different atomic 
species, provided that the electronic states are not too delocalized. However, this 
expectation is in conflict with the general electronic property of metals. If the cor- 
rugation seen by the STM is estimated with a calculation of the local density of 
states (LDOS), the predicted contrast is usually so low that it is hardly distin- 
guishable from the background noise. As such, it was a surprise that Schmid 
et al. (1993) could nevertheless demonstrate chemical contrast at atomic resolu- 
tion on dense-packed surfaces of PtNi alloys. Meanwhile, chemical contrast has 
been found also on other alloy systems such as PtNi, PtRh, PtCo, PtAu, and 
AgPd (Varga and Schmid 1999). As all these alloys contain at least one noble ele- 
ment known for catalytic activity, knowledge of surface structure, potential segre- 
gation, and chemical ordering may be quite important. 

Figure 13.14 shows an example of an STM image of a Ptz5Niz5 specimen. The 
image was obtained in a constant current mode. To obtain atomic resolution with 


b) 





Fig. 13.14 a) STM image produced at a constant current of 16 nA; 
sample bias —0.5 mV. The image size represents an area of 

125 x 100A?; b) atomic arrangement at the surface of a Ni-37 at.% Pt 
alloy at 420 K as predicted by MC simulation. (After Schmid et al. 
1993.) 
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the quality shown, the sample had to be cleaned under UHV conditions by sev- 
eral cycles of ion sputtering and annealing. It is easily seen without any quantita- 
tive contrast analysis that two kinds of atoms can be distinguished by bright and 
dark intensity. If “dark” and “bright” species are counted, the average composi- 
tion of the surface layer is determined to have concentration of about 50 at.%. 
However, the relative amount of “dark” species increases with sputter cleaning 
time. Since it is known that Ni has a higher sputtering probability, this observa- 
tion indicates that the “dark” species represent Pt, while the “bright” ones repre- 
sent Ni atoms. 

The clarity of the chemical contrast is remarkable. If image intensity is con- 
verted into corrugation, a height difference of 0.3A between the two species is 
found, which is about twice the difference in atomic radius of Ni and Pt, so that 
a simple geometric explanation of the contrast is not possible. Furthermore, also 
from the LDOS, a significantly smaller height contrast of only 0.03 A is expected. 

Even more puzzling, the better the cleaning of the specimen before investiga- 
tion, the lower the probability of obtaining pronounced chemical contrast. There- 
fore, Varga and Schmid have suggested an indirect mechanism of adsorbate 
atoms. If an adsorbate atom, for example an oxygen or sulfur atom, is picked ran- 
domly by the probe tip, and this adsorbate has different chemical affinity to the 
different constituents of the alloy, a situation as sketched in Fig. 13.15 may ap- 
pear. The adsorbate is slightly drawn toward the surface atom of the attractive 
species, establishing a kind of temporary chemical bond, so that the tunneling 
current increases due to the elevated density of states between the surface and 
the adsorbate. 

The experiment discussed is also very interesting from the point of view of al- 
loy physics, since short-range order on surfaces has been documented for the first 
time by direct imaging. Chemical (atomic) order is an important phenomenon 
of intermetallic compounds which determines plasticity and also functional 
properties such as conductivity or magnetization. In the bulk case, order may be 
completely described by the degree of long-range order and its fluctuations. At 
surfaces, however, several additional phenomena may be present. First, the com- 
position may deviate due to segregation to the surface. Second, in intermetallic 
compounds different kinds of crystal planes can be distinguished in certain low- 
indexed stacking directions. What is the preferred terminating layer at the sur- 
face? Last, but not least, one may ask whether the order symmetry of the bulk is 





Fig. 13.15 Principle of contrast formation by chemically selective adsorbates. 
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disturbed or reconstructed at the surface. The experimental result in Fig. 13.14(a) 
can answer these kinds of questions, if the detected surface arrangement is com- 
pared to Monte Carlo (MC) simulations, e.g., based on embedded atom poten- 
tials, as has been pointed out by Schmid and coworkers. 

In contrast to a random arrangement of both species, the STM image in 
Fig. 13.14 reveals a row-like alignment of like atoms, while, according to the 
bulk phase diagram, the PtzsNi7s alloy should be a disordered solid solution. In 
addition, a Pt content of about 50 at.% is seen in the surface layer, in striking con- 
trast to the mean composition of the sample. All these facts are well understood, 
if it is assumed that the material close to the surface becomes enriched in Pt to 
about 37 at.% by selective sputtering. According to the result of MC simulations 
shown in Fig. 13.14(b), the surface layer will be further enriched under these cir- 
cumstances to about 50 at.% Pt by segregation. At this composition a bulk alloy 
would be ordered in an L1, superlattice, of which the (111) net planes are charac- 
terized by rows of like atoms along (011) directions (see the inset of Fig. 13.14a). 
As is clear from the experimental image and MC simulation, the terminating 
plane prefers this ordered state of aligning like atoms in nearest-neighbor chains. 
However, since the Li, structure is tetragonal, long-range order in the surface 
would lead to considerable strain with respect to the cubic bulk material below. 
To avoid the related stress, Nature prefers splitting the ordered surface structure 
into small domains, so that strain vanishes on a coarser scale. In consequence, 
the surface plane appears to be short-range ordered, as is evident from the STM 
image. 


13.2.1.4.2 Microstructure Characterization and Surface Topology by AFM 

Atomic resolution of the STM has been an important prerequisite in the previous 
example. Usually, atomic resolution in AFM is achieved only with considerable 
effort, in vacuum and with special surface preparation. Therefore, apart from a 
few exceptions, the main use of AFM is for studies on a somewhat coarser scale, 
say 1-10 nm. In this regime, AFM has the important advantage of rather simple 
specimen preparation and operation of the instrument, since the investigation 
may be performed under the ambient atmosphere. In this regard, the AFM is 
even replacing optical microscopy for surface metallography, since it offers 
much better resolution. 

The application of AFM in metallography has been checked in a detailed 
study (Durst and Géken 2001) by comparison with results from transmission 
electron microscopy (TEM) and conventional scanning electron microscopy 
(SEM). The decomposed microstructure of a superalloy (“Waspalloy’’) was charac- 
terized in terms of the volume fraction, average size, and size distribution of 
the precipitates. This specific microstructure had the advantage of a bimodal size 
distribution of the y’ precipitates with 16 nm and about 110 nm mean diameter. 
Therefore particle contrast and resolution could be determined in different size 
regimes. Similarly to established practice in SEM work, the surface of the sample 
was mechanically polished with 0.25 um diamond paste and electrochemically 
etched to achieve suitable particle contrast by selective etching. In Fig. 13.16(a), 
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Fig. 13.16 Microstructure of “Waspalloy” with y’ particles in a 
disordered matrix as observed by contact-mode AFM: a) topology 
micrograph; b) histogram of pixel intensities corresponding to surface 
heights. (After Durst and Goken 2001.) 


an AFM image of the bimodal particle microstructure is shown which was pro- 
duced in contact mode. Both particle fractions are clearly imaged with very simi- 
lar brightness. Therefore, only two maxima appear in the histogram of the height 
profile (Fig. 13.16b), corresponding to the matrix and precipitate phase, respec- 
tively. By correct choice of a threshold between them, precipitates are reliably 
distinguished from the matrix independently of their size. This ease of contrast 
interpretation is an important advantage of AFM, when micrographs need to be 
analyzed automatically. 

From the spacing between the two maxima in the histogram, the effect of selec- 
tive etching is determined quantitatively. In the example shown, y’ precipitates 
tend to protrude from the matrix level by about 16 nm. Quantitative comparison 
of the size distributions obtained by AFM and both electron microscopy tech- 
niques revealed that the size scale of the AFM is systematically shifted by a cer- 
tain amount. In comparison to electron microscopy, the average size of the small 
and the large particles appeared enlarged by the same amount. The schematic 
drawing in Fig. 13.17 illustrates how this discrepancy is understood as an artifact 
introduced by the finite radius of the probe tip. However, having identified this 
situation, the particle size measured by AFM can be corrected if the tip radius 
and height difference between protruding particles and matrix are known. By ge- 
ometric considerations, the erroneous overestimate x of the particle radius is 
given in Eq. (52) (for definitions of the symbols, see Fig. 13.17). 


x? +h? 
R=- (52) 


The height difference h for a particular measurement can easily be determined 
from a histogram like that presented in Fig. 13.16(b), while the unknown radius 
of the probe tip must be calibrated by measurements of known microstructures 
or on special samples containing sharp vertical edges. The authors (Durst and 
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Fig. 13.17 Schematic diagram of particle contrast formation with AFM. 
Particles protrude by a height difference h with respect to the matrix 
level. The ideal height profile (top) appears experimentally broadened 
(bottom) at a scale defined approximately by the tip radius R. (After 
Durst and Goken 2001.) 


Göken 2001) conclude that AFM is a well recommended alternative to optical 
microscopy. As the main advantages, they point to a much better spatial resolu- 
tion than is achieved with optical microscopy or even SEM, the uncomplicated 
specimen preparation in comparison to TEM, and the clear image contrast allow- 
ing an easy numerical evaluation of the information. AFM is also used for the 
routine microstructural characterization of technical polymers, which often con- 
sist of a two-phase microstructure of thermoplastic matrix and soft rubber-like 
particles to improve the toughness. In this case, the tapping mode in combina- 
tion with a phase image delivers optimal contrast due to the different viscoelastic 
properties of the polymer materials (Bar and Meyers 2004). 

In order to characterize heterogeneous microstructures as in the previous ex- 
ample, an artificial height contrast has to be produced by specimen preparation. 
Thus, the surface sensitivity of SPM is used only as an indirect means to obtain 
information on volume properties. There are many other cases in which the sur- 
face plays a direct role in the physical mechanism of interest, and surface infor- 
mation is desired. An illustrative example is the study of thermal grain boundary 
grooving, which allows the determination of surface diffusion coefficients and of 
grain boundary energies as well. At the intersection of a grain boundary with the 
free surface, a groove develops during annealing because thermal equilibrium re- 
quires that the tensions of the free surface and the grain boundary compensate 
each other. A general theory that allows the quantitative evaluation of the groov- 
ing profile was developed by Mullins (1957). Therefore, grain boundary grooving 
has already been investigated in the past by means of optical microscopy on cross- 
sections of the surface. 

However, low-indexed grain misorientations are distinguished by particularly 
small grain boundary energies. The resulting shallow grooves are hardly detect- 
able by optical microscopy, so that AFM provides an important improvement 
in this area. In Fig. 13.18(a), a contact-mode AFM image of a surface region is 
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Fig. 13.18 a) AFM at the surface of a Cu polycrystal, showing a region 
with a triple-junction of three grains and the related surface grooves. 
b) Height profile measured by AFM across a grain boundary groove of 
Ni. The solid line represents a solution of Mullins’ model. (After Weber 
et al. 2000.) 


shown, where three Cu grains meet in a triple line (Weber et al. 2000). All three 
grain boundaries are marked by well-developed grooves. An experimental height 
profile across such a grain boundary is presented in Fig. 13.18(b). The trace of the 
profile is well described by Mullins’ theory. By quantitative evaluation of the 
height profiles, Weber and coworkers could determine the energy of various low- 
sigma grain boundaries (even £3) and the temperature dependence of the surface 
diffusion coefficients for Ni and Cu polycrystals. The improved accuracy of the 
AFM method has been utilized also to investigate in detail deviations from Mul- 
lins’ theory that are due to the anisotropy of the surface energy (Rabkin et al. 
2000). 

As a final example of modern metallography by AFM methods, the measure- 
ment of local hardness by nanoindentation is presented. For this, the special 
transducer of the NI-AFM technique is used, as discussed previously (see Section 
13.2.1.1). Since the transducer may also be used to produce an image of the topol- 
ogy, a heterogeneous microstructure, suitably etched, may be imaged before and 
after the hardness test. In this way, the location of the indent can be controlled 
very precisely, even to an accuracy of a few nanometers. In Fig. 13.19, an NI- 
AFM image is presented from a study by Göken and coworkers (Göken and 
Kempf 1999). The heterogeneous microstructure of a Nimonic superalloy is 
shown that is characterized by ordered cube-shaped y’ particles (bright) em- 
bedded in disordered matrix (dark). Due to the small size of the diamond in- 
denter, hardness measurements could be performed on precipitate and matrix 
phases independently. Four indents distributed on the matrix and precipitates 
are seen as dark triangles, about 100 nm in size. If the maximum load is reduced 
to about 100 uN, the size of the indents may even be reduced to about 20 nm. The 
indents located on the matrix phase seem to be slightly larger than those on the y’ 
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Fig. 13.19 a) NI-AFM image of a heterogeneous Nimonic superalloy 
structure. Indents produced by the nanohardness tester with a 
maximum force of 500 uN are evidenced by dark triangles on the matrix 
(dark) and the precipitate phase (bright). b) Local stress—strain curves 
of the matrix and precipitate phase. (After Göken and Kempf 1999.) 


phase, which indicates that the intermetallic compound has a greater hardness. 
However, on the short length scale, the boundary of the indents is not well de- 
fined, which makes quantitative evaluation of the indent’s cross-section — usually 
used for hardness measurements - difficult. In order to overcome this difficulty, 
use of the unique feature of a local force measurement is suggested. Thus, 
stress-strain curves are determined separately for each phase, as presented in 
Fig. 13.19(b). These curves provide the desired information on irreversible plastic 
deformation and confirm the greater hardness of the intermetallic precipitates. 
They yield further additional information on local elasticity. 


13.2.1.4.3 Imaging of Nanomagnets by Magnetic Force Microscopy 

Using special probes, the AFM principle allows to study a broad range of particu- 
lar interaction forces. Here, magnetic force microscopy (MFM) is discussed as a 
variant which is especially important for the study of metals and alloys. In order 
to investigate magnetism on a microscopic scale, a single-domain nanoparticle is 
attached to the probe tip as described previously in Section 13.2.1.3. Usually, the 
dynamic mode with a comparatively large specimen-tip distance is used to avoid 
artifacts, which may be induced by the tip field. Frequency-shift images reflect the 
interaction of the tip with the stray field of the sample as discussed in Section 
13.2.1.2. The MFM technique dates back to 1987 (Martin and Wickramasinghe 
1987; Saenz et al. 1987). Important breakthroughs are the resolution of the inter- 
nal fine structure of domain boundaries (Göddenhenrich et al. 1990) and the 
imaging of individual flux lines in superconducting materials (Moser et al. 1995; 
see Chapter 14, Section 14.6), which were made possible by attaching nanoscale 
ferromagnetic probes to the tip (see Fig. 13.13). Nowadays, AFM is used routinely 
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in industry for checking the domain structure and quality of data storage devices 
such as surfaces of hard disk coatings or giant magneto-resistance (GMR) record- 
ing heads (see Chapter 14, Section 14.4). Its widespread use is particularly due to 
the fact that no special specimen preparation is required and that the technique is 
hardly influenced by surface contamination. 

Scientifically, MFM is widely used to investigate artificial magnetic nanostruc- 
tures, which currently attract considerable interest due to their potential in 
ultrahigh-density storage and spin electronics (see Chapter 14, Section 14.4). 
An experimental example (Zhu and Griitter 2004) is shown in Fig. 13.20. A regu- 
lar array of single domain particles made of Permalloy (NiFe) was produced by 
lithography. Each particle has an elongated shape, 240 x 90 x 10 nm? in size, so 
that only two well-defined magnetizations parallel to the long axis are allowed by 
shape-anisotropy. If an external field is supplied, the magnetization may be 
switched between these two states. In the micrographs, each particle appears as 
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Fig. 13.20 AFM at a regular array of elongated single domain particles: 
a) remanent state after applying a field of 304 Oe along the long axis of 
the particles; b) after applying 510 Oe. c) Ensemble hysteresis curve 
determined as the average of the microscopically observed individual 
states. d) Hysteresis curve measured by macroscopic magnetometry. 
(After Zhu and Grütter 2004.) 
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a pair of black and white dots, which is due to the positive and negative interac- 
tions with the stray field at both ends of the magnetic dipoles. Thus, the direction 
black-to-white indicates the individual magnetic state of a particle. The possibility 
of switching the elements is seen by a comparison of Figs. 13.20(a) and (b). If 
similar pictures are obtained after applying different magnetic fields, a complete 
hysteresis loop may be calculated by counting the number of individual “up” and 
“down” states, as shown in Fig. 13.20(c), which compares reasonably well with a 
hysteresis curve obtained by macroscopic magnetometry, shown in Fig. 13.20(d). 

In the present example, the spacing of the particles was chosen to avoid their 
interaction. The interaction between particles has been studied with chain-like 
patterns having a smaller spacing. The observed process may be described as 
propagation of information. In addition, the switching of individual elements 
could be investigated with AFM with the application of external fields. An inter- 
mediate vortex state was observed (Zhu and Griitter 2004). 


13.2.2 
High-Resolution Transmission Electron Microscopy and Related Techniques 


High-resolution transmission electron microscopy (HRTEM) provides unique ca- 
pabilities for determining the atomic structure, composition, and behavior of de- 
fects in alloys at or near the atomic level (Spence 1988; Williams and Carter 1996; 
Fultz and Howe 2002). Such information is critical to understanding the atomic 
structures of dislocations (see Chapters 3 and 6), or the mechanisms of phase 
transformations (see Chapters 5, 7, and 8), for example (Amelinckx and van 
Dyck 1992; Howe 1999). In this section, we describe the technique of HRTEM, 
particularly as related to in-situ dynamic studies, and then show how the tech- 
nique can be used to understand the behavior of materials at the atomic level. 
From the examples, we see that in-situ HRTEM allows direct observation of the 
dynamic behavior of atomic processes, limited mainly by the time resolution of 
the image recording devices (~0.03 s) and the averaging that occurs through the 
specimen along the projection direction. The latest generations of HRTEMs can 
obtain image resolutions of less than 0.1 nm, so that the image resolution is ade- 
quate for most materials problems of interest (Batson et al. 2002; Pennycook et al. 
2006). In fact, HRTEM is such a powerful tool for the characterization of materi- 
als that some microstructural features are defined in terms of their visibility in 
HRTEM images. 

Chemical analysis is not discussed in this section, but current HRTEMs have 
the ability to form electron probes smaller than 0.1 nm, so that chemical analyses 
can be performed near this level of spatial resolution using either energy- 
dispersive X-ray spectroscopy (EDXS) (Joy et al. 1986) or electron energy-loss 
spectroscopy (EELS) (Egerton 1996). This is because the high-energy electrons 
(usually with 100-400 keV of kinetic energy) in an HRTEM cause electronic exci- 
tations of the atoms in the specimen. The important spectroscopic techniques of 
EDXS and EELS make use of these excitations by incorporating suitable detectors 
into the HRTEM. In EDXS, an X-ray spectrum is collected from small regions 
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of the specimen illuminated with a focused electron probe, using a solid-state de- 
tector. Characteristic X-rays of each element are used to determine the concen- 
trations of the different elements present in the specimen. In EELS, a magnetic 
prism is used to separate the electrons according to their energy losses after hav- 
ing passed through the specimen. Energy loss mechanisms such as plasmon 
excitations and core-electron excitations cause distinct features in EELS. These 
can be used to quantify the elements present, to provide information about 
atomic bonding, and for a variety of other useful phenomena. The reader is re- 
ferred to the following references for more information on these techniques: Joy 
et al. (1986); Egerton (1996); Williams and Carter (1996). 

In scanning transmission electron microscopy (STEM), a focused beam of elec- 
trons (typically less than 0.5 nm in diameter) is scanned in a television-style 
raster pattern across the specimen, as in a scanning electron microscope (SEM) 
(Keyse et al. 1998). In synchronization with the raster scan, products resulting 
from the interaction of the electron beam with the specimen are collected, such 
as emitted X-rays, or secondary or backscattered electrons, to form images. Elec- 
trons that pass through the specimen can also be detected to form images that 
are similar to usual HRTEM images. An annular detector can be used to collect 
the scattered transmitted electrons and this leads to so-called Z-contrast imaging 
(Browning et al. 1997). The STEM mode of operation is particularly useful 
for spectroscopic analysis, since it permits the acquisition of a chemical map of 
the sample with subnanometer spatial resolution under optimum conditions. 
For example, an image of the distribution of Fe in a sample can be made if one 
were to record in synchronization with the raster pattern, either the emission 
from the sample of Fe K, X-rays (with the EDXS spectrometer), or transmitted 
electrons with energy losses greater than that of the Fe L-edge (with the EELS 
spectrometer). 

A fully equipped HRTEM has the capability to record the variations in image 
intensity across the specimen using mass thickness or diffraction contrast tech- 
niques, to reveal the atomic structure of materials using so-called high-resolution 
(phase-contrast) imaging or Z-contrast (incoherent) imaging, to obtain electron 
diffraction patterns from small areas of the specimen using a selected-area aper- 
ture or a focused electron probe, and to perform EELS and EDXS measurements 
with a small probe. Additional lenses can be installed in conjunction with an 
EELS spectrometer to create an energy filter, enabling one to form energy-filtered 
TEM images (EFTEM). These images allow one to map the chemical composition 
of a specimen with subnanometer spatial resolution. A block diagram of such an 
HRTEM is shown in Fig. 13.21 (Fultz and Howe 2002). 

In addition to the six main techniques of: (a) conventional imaging, (b) phase- 
contrast imaging, (c) Z-contrast imaging, (d) electron diffraction, (e) EDXS, and 
(f) EELS, it is possible to perform many other analyses in a HRTEM. For exam- 
ple, when electrons pass through a magnetic specimen, they are deflected slightly 
by Lorentz forces, which change direction across a magnetic domain wall. In a 
method known as Lorentz microscopy, special adjustments of lens currents per- 
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Fig. 13.21 Block diagram of a typical TEM with STEM capability (from Fultz and Howe 2002). 


mit imaging of these domain walls. Using in-situ microscopy, phase transforma- 
tions and microstructural changes in a specimen can be observed directly as the 
specimen is heated, cooled, or deformed in the microscope, using special speci- 
men stages. Differential pumping can be used to allow the introduction of gases 
into the microscope column surrounding a thin foil, making it possible to follow 
chemical reactions in a HRTEM in situ. Many of these techniques can be per- 
formed at spatial resolutions of a few tenths of a nanometer. The possibilities 
are almost endless, and that is why HRTEM is an indispensable tool in materials 
research (Williams and Carter 1996). Since it is not possible to cover all of these 
techniques, this section focuses on the theory and practice of HRTEM imaging 
and in-situ HRTEM techniques. 


13.2.2.1 Principles of Image Formation in and Practical Aspects of High- 
Resolution Transmission Electron Microscopy 


13.2.2.1.1 Principles of Image Formation 

HRTEM is phase contrast imaging (Spence 1988). In phase contrast imaging, 
two or more beams are allowed to interfere to form an image. With two or a row 
of beams contributing, a fringe pattern is obtained, while three or more non- 
collinear beams can produce a lattice image. The spacings which one wants to 


793 


794 


13 High-Resolution Experimental Methods 


resolve in the image must be within the resolution limits of the instrument; i.e., 
the lens system must preserve the coherence of the image-forming beams. This 
section gives a general outline of phase contrast imaging. Practical aspects of 
HRTEM imaging and interpretation of HRTEM images are discussed in Section 
13.2.2.1.2: 

The definitive feature of HRTEM is perhaps the two-dimensional nature of the 
information it provides. While a specimen may be tilted in conventional TEM 
to obtain three-dimensional information, one is generally limited with HRTEM 
to observations of projected atomic structures along low-index zone axes with pla- 
nar spacings that are within the resolution limits of the microscope. For this rea- 
son, one would like to interpret a HRTEM image directly as a simple map of the 
projected structure (crystal potential). However, such an interpretation is only 
possible for a very narrow range of instrumental parameters and specimen thick- 
nesses. This situation is well described theoretically and so provides the starting 
point for the following discussion (Cowley 1975; Spence 1988). 

A useful approximation in the description of the interaction of the electron 
beam with the specimen is found in the assumption of phase changes only, that 
is, a phase object. An incident wavefunction y, incurs a phase change propor- 
tional to the crystal potential expressed by Eq. (53), where ø is the electron inter- 
action parameter, t is the specimen thickness, and ¢,(x, y) is the projected speci- 
men potential in the x-y plane, normal to the electron beam. 


Y= Wo exp[—iot, (x, y)] (53) 


By additionally assuming a weak interaction, one obtains the weak phase object 
(WPO) approximation, Eq. (54). 


y x [1 — iatøp(x, y)] (54) 


Application of the WPO approximation requires that Eq. (55) holds. 


t<« 1/o4,(x, y) (55) 


Hence, the required thickness decreases with increasing atomic number, though 
it is less than 10 nm for most materials of interest. The above form of the exit 
wavefunction has a straightforward physical interpretation. The dominant trans- 
mitted wave is approximated by y, (unit amplitude) while to this is added a rela- 
tively weak scattered wave of amplitude otp,(x, y) and phase —7/2 relative to the 
unscattered portion, as it is purely complex. For such thicknesses and assuming 
an ideal Scherzer lens, i.e., one that imposes a phase change of —z/2 on all dif- 
fracted beams, a linear relationship between image intensity I(x, y) and projected 
crystal potential is obtained [Eq. (56)]. 


I(x, y) =1— ath, (x, y) (56) 
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The effect of the objective lens must then be considered. In an ideal lens, dif- 
fracted beams undergo a phase shift that is strictly a function of their angle from 
the optic axis. Assuming the transmitted beam is aligned along the optic axis, this 
angle is 20g, twice the Bragg angle of the respective spatial frequency. Allowing 
for spherical aberration and a lens defocus Af, the phase change is given by Eq. 
(57), 


y= nig?[1/22°C,g? — Af] (57) 


where / is the electron wavelength, g is the magnitude of the reciprocal lattice 
vector, and C, is the spherical aberration coefficient of the objective lens. By defi- 
nition, Af is the distance from the specimen exit plane to the lens object plane. 
The defocus is taken as negative for an underfocused lens (obtained by reducing 
the lens current). 

The effect of the objective lens is then to multiply each diffracted beam by the 
phase factor expliy(g)]. Under the WPO approximation, the transmitted beam 
(x = 0) and the real components of the diffracted beams interfere to form the im- 
age, i.e., a diffracted beams contribution is proportional to sin y. For negative 
sin x, the diffracted beams interfere destructively with the transmitted beams, 
producing “black” atom images, while a positive sin y leads to “white” atoms. 

It is usual to plot a linear contrast transfer function (CTF) to describe the 
lens action, that is, a plot of sin y versus g. For a WPO, the largest number of 
beams will contribute to the image by maximizing the portion of the CTF where 
sin y = —1 or close to it. Requiring that y < —1/V2 leads to the Scherzer defocus 
value, Eq. (58) (Scherzer 1949). 


Afgcn = -1.2(C,4)"? (58) 
The first zero crossover gives the Scherzer resolution limit [Eq. (59)]. 
den 0.70," (59) 


This is the highest resolution at which one may hope to interpret an HRTEM im- 
age directly. Decreasing the defocus value leads to higher-order passbands at Af, 
given by Eq. (60), which may be employed to resolve finer detail, n = 0 giving the 
Scherzer passband. 


Af, = [1/2C.A(8n + 3)]"? (60) 


Since higher passbands exclude some lower-angle scattering, their direct interpre- 
tation is not possible, particularly in the case of defect imaging, for which diffuse 
scattering possesses important image information. In addition, electronic insta- 
bilities effectively damp higher frequencies, limiting the number of useful pass- 
bands. This damping limit is usually called the information resolution limit. 
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As seen above, proper selection of defocus is employed to optimize resolution. 
While working at the microscope, one uses the minimum contrast condition as a 
reference point. At this defocus setting, diffracted beams are as close to a —n/2 
total phase shift as possible, minimizing interference with the transmitted beam: 
In other words, sin x is close to zero over a maximized range. This range is given 
by Eq. (61), and is about one-third the Scherzer defocus value. 


Afnin = —0.44( C4)" (61) 


The above concepts are commonplace in discussions of HRTEM results even 
though the WPO approximation is often invalid because it is quite difficult to pro- 
duce a good specimen only a few nanometers thick. However, they provide an 
acceptable starting point for selecting proper imaging conditions and for image 
interpretation. Proper evaluation of defect atomic positions requires concurrent 
image simulations using hypothetical models until good matching with experi- 
mental images is achieved. 


13.2.2.1.2 Practical Aspects of HRTEM 
In order to interpret HRTEM images, one needs to know how various specimen 
and microscope parameters affect image contrast. There are many factors that 
can potentially contribute to HRTEM image contrast and it is not possible to dis- 
cuss all of them here. Instead, we examine how some of the more important pa- 
rameters that tend to be less easy to control and/or quantify affect image contrast. 
Particular attention is paid to practical guidelines that can be used to obtain as 
much information about defect structures as possible without expending too 
much effort (Fultz and Howe 2002). 
Below are listed most of the specimen and microscope parameters that must be 

determined or considered when interpreting HRTEM images: 

e sample thickness 

e objective lens focus 

e beam coherence 

e beam convergence 

e objective aperture 

e crystal orientation 

e beam tilt 

e interface geometry 

e surface effects 

e thin-foil relaxation 

e exposure time 

e specimen drift 

e projection problem 

e beam damage. 


In terms of the microscope, one must optimize and quantify as many of the 
microscope parameters as possible, just as one would do for any HRTEM investi- 
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gation. Several microscope parameters such as the accelerating voltage, spherical 
aberration coefficient of the objective lens, and Gaussian spread of focus are pro- 
vided by the manufacturer and can be directly input into image simulation pro- 
grams. Other parameters that are fairly straightforward to determine include the 
objective lens defocus, which can be determined by taking a fast Fourier trans- 
form (FFT) of the amorphous edge of a specimen, the semi-angle of beam conver- 
gence, which can be determined directly from the diameter of the disks in the dif- 
fraction pattern, and the objective aperture radius (if used) and position, which 
can also be determined by using a double-exposure to photograph the objective 
aperture on the diffraction pattern. Although it is possible to quantify the degree 
of astigmatism in an HRTEM image, one ideally tries to eliminate any astigma- 
tism, since it cannot be incorporated into most image simulation programs 
(O’ Keefe 1984). 

One of the most difficult microscope parameters to quantify is the amount of 
beam tilt present during imaging. It has been demonstrated that for most defect 
structures, alignment of the electron beam to within one milliradian of the optic 
axis or better is necessary for reliable interpretation of the atomic structure, par- 
ticularly in ordered crystals with large unit cells (Smith et al. 1983, 1985). While 
the effect of beam tilt can be readily included in image calculations for compari- 
son with experimental images, it is more desirable to minimize this effect before 
imaging. One technique for obtaining the required accuracy in alignment is to 
apply equal and opposite tilts to the incident beam and then compare the result- 
ing images of the amorphous region at the foil edge. The tilts can be adjusted in 
the image mode until the same image appearance is produced for opposite tilts. 
A similar result can be achieved by taking FFTs of the two images and then com- 
paring their diffraction patterns, which should be identical. It is now possible to 
perform this operation online using an iterative computer program, so that beam 
tilt and astigmatism can be corrected to the required levels of accuracy by the 
computer after an initial manual alignment. 

Suitable specimens are essential; preferably flat, clean, gradually decreasing 
to zero thickness at an amorphous edge. Alignment of the specimen along a 
zone axis is critical. This can be particularly challenging with bent specimens, as 
often occurs for metals. Normally, Kikuchi bands or convergent-beam patterns 
are employed for precise tilt adjustment; however, any portion of specimen dis- 
playing such effects is too thick for HRTEM imaging. The thin edge is often 
slightly bent so one must use a selected area aperture, tilting the specimen until 
the intensity of diffracted spots appears to be balanced about the transmitted 
beam. 


13.2.2.2 In-Situ Hot-Stage High-Resolution Transmission Electron Microscopy 
In-situ HRTEM provides unique capabilities for quantifying the dynamic behav- 
ior of materials at the atomic level (Sinclair et al. 1988; Howe 1999; Gai and 
Boyes 2003). This section provides a brief description of particular requirements 
for performing in-situ hot-stage HRTEM studies, used to induce phase transfor- 
mations in materials. 
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The specimen and microscope requirements for in-situ hot-stage HRTEM 
imaging are not different from those of static HRTEM, except that one must 
have a heating holder and some method of recording and analyzing dynamic im- 
ages (Butler and Hale 1981; Sinclair et al. 1988). At present, most HRTEMs are 
equipped with charge-coupled device (CCD) or TV-rate cameras that are fiber- 
optically coupled to the HRTEM for recording high-resolution images. The 
simplest and least expensive method for recording in-situ images is to send the 
output from the TV-rate camera into a standard videocassette recorder (VCR). A 
reasonably good VHS format VCR and videotapes can be used to store hours of 
in-situ data for low cost. The disadvantage of this method is that the images often 
need to be converted to digital form for image processing and/or analysis and the 
TV camera response is not linear over a large range. Recording images digitally 
using a CCD camera eliminates these problems, but computer memory and ac- 
quisition rates can become limiting factors. TV-rate cameras generally acquire 30 
frames s~!, so that the time resolution of the videorecording is ~0.03 s. Faster 
video systems are available, but the signal-to-noise in the images then starts to 
become an issue. Specimen drift is usually not a significant problem when im- 
ages are being acquired at TV rates, but substantial drift makes subsequent batch 
processing and computer comparison of digital images difficult. It is possible to 
install drift compensators on the microscope that alleviate this problem, but these 
are not readily available commercially. 

The most versatile specimen holder for materials science applications is a 
double-tilt hot stage, and several manufacturers offer such holders (Williams 
and Carter 1996). A typical water-cooled holder can achieve a temperature of 
800-1000 °C with a full range of tilt. As in static HRTEM, the area of interest ide- 
ally needs to be in a zone-axis orientation in order to interpret the atomic struc- 
ture in the images. Knowledge of the temperature of the specimen in the area of 
interest as well as the heating/cooling rate when the current is changed are criti- 
cal for quantitative studies and can vary from the hot-stage thermocouple readout 
depending on the sample conductivity, how well it is in contact with the holder, 
etc. In most cases, experience has shown that the sample temperature is usually 
within about 20 °C of the thermocouple readout for most holders of this type. 

Just about any type of phenomenon that can be induced by temperature 
changes within the limits of the holder can be examined by in-situ hot-stage 
HRTEM. (The same applies to in-situ cooling experiments using liquid-nitrogen 
or liquid-helium holders, although holder vibration due to bubbling can be a con- 
sideration when using cooling liquids.) These phenomena include order/ disorder 
reactions (Howe et al. 2006), grain-boundary motion (Merkle et al. 2002), melting 
and freezing of materials (Sasaki and Sasa 1991), precipitation-dissolution (Howe 
and Benson 1995), interfacial reaction (Sinclair 1994), crystallization (Sinclair et 
al. 1993), twin and martensitic motion (Howe 1993), oxidation/reduction (Kang 
and Eyring 1991), dislocation motion (Sinclair et al. 1988), faceting /roughening 
transitions (Gabrisch et al. 2001), nanoparticle reactions (Lee and Mori 2005; see 
Chapter 9), coarsening (Iijima 1985), quasimelting (Marks 1994), catalytic reac- 
tion (Gai and Boyes 2003), etc. 
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13.2.2.3 Examples of HRTEM Studies of Dislocation and Interphase Boundaries 


13.2.2.3.1 Disclinations in Mechanically Milled Fe Powder 

Mechanical milling is a technique for producing metallic alloys with ultrafine 
grain sizes by severe plastic deformation (cf. Chapter 8, Section 8.2). These nano- 
crystalline alloys have unique mechanical properties, such as hardnesses and 
yield strengths that are several times higher than those of conventional alloys, 
that make them attractive for a variety of applications (Meyers et al. 2006). The 
mechanisms by which materials deform during mechanical milling to produce 
ultrafine grains are not known, although it is thought that turbulent shear pro- 
cesses requiring crystal rotation are operative and that disclinations contribute to 
this process (Romanov and Vladimirov 1992; Seefeldt 2001). Disclinations in crys- 
talline materials can alternatively be described in terms of individual line defects 
in the atomic structure called dislocations (see Chapter 6). It is possible to image 
the atomic structures of defects such as dislocations and disclinations in crystal- 
line solids using HRTEM (Murayama et al. 2002). 

Figure 13.22(a) shows an HRTEM image of mechanically milled, nanocrystal- 
line Fe powder. The image is approximately 20.5 nm wide. The grain that occu- 
pies most of the figure is in a <111) orientation. The hexagonal arrangement of 
columns of Fe atoms in this orientation are visible as white spots, as verified by 
image simulations. White lines were drawn periodically along the three sets of 
edge-on {110} planes in the grain, and these are shown superimposed on the 
HRTEM image in Fig. 13.22(b). Two sets of {110} planes running vertically in 
the figure are straight (or nearly so), but the set that is approximately horizontal 
in the figure bends considerably. Dark lines were drawn on all of the nearly hori- 
zontal, bent {110} planes in Fig. 13.22(b) to indicate their positions accurately. 
The black and white lines in Fig. 13.22(b) were removed from the HRTEM image 
and are shown separately in Figure 13.22(c) for clarity. The discussion that fol- 
lows emphasizes the bent {110} planes shown in Fig. 13.22(c), but it should be 
remembered that these planes are derived from the actual atomic positions visible 
in Fig. 13.22(a). 

The arrows in Fig. 13.22(c) mark two wedge-shaped regions that together form 
a partial disclination dipole. The wedge-shaped regions are approximately 3.5 nm 
apart. Each of the wedge-shaped regions contains a number of terminating {110} 
planes, which are individual dislocations with a Burgers vector, which is the 
displacement vector that describes the magnitude and direction of their strain 
field, of b = a/2<111). The partial dislocation dipoles in Fig. 13.22(c) appear to 
be wedge disclinations with a Frank vector w, which is the rotational vector which 
describes the distortional power of the disclination, which is parallel to the defect 
line (in this case, the (111) viewing direction). Unfortunately, it is not possible to 
determine exactly the wedge and twist components of the partial disclinations in 
Fig. 13.22(c), because an HRTEM image reveals atomic displacements only perpen- 
dicular to the electron beam direction, and there may be displacements parallel to 
the beam that are not visible. However, this image demonstrates that it is possible 
to observe directly the individual dislocations that constitute partial disclination 
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Fig. 13.22 a) An experimental HRTEM image 
of mechanically milled, nanocrystalline Fe 
powder taken near Scherzer defocus. The 
hexagonal arrangement of white spots in the 
image corresponds to columns of Fe atoms 
in a <111) orientation. b) White lines shown 
superimposed periodically on the three sets 
of {110} planes in a) to highlight the distor- 


tion of the nearly horizontal set of {110} 
planes. Black lines were also superimposed 
on this set of planes to indicate their position 
clearly. c) The nearly horizontal black and 
white lines in b) removed from the HRTEM 
image so that they are more clearly visible. 
The various labels are explained in the text 
(from Murayama et al. 2002). 


dipoles in metals at the atomic level, even in mechanically milled powders that 
have undergone severe plastic deformation. 

The set of terminating {110} planes that constitute the individual partial discli- 
nations, such as the ones labeled A in Fig. 13.22(c), can also be considered termi- 
nating tilt grain boundaries (Hirth and Lothe 1968). Compared to complete tilt 
grain boundaries, terminating tilt grain boundaries contain missing dislocations 
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(a) (b) (c) 
Fig. 13.23 The elastic distortion associated surfaces are allowed to close, to fill the 
with a partial wedge disclination (or wedge. The resulting crystal contains a 
terminating tilt grain boundary). a) One set terminating tilt grain boundary, i.e., a 
of planes in a perfect crystal, such as the terminating array of edge dislocations, and 
perfect {110} planes seen in Fig. 13.22 (b). considerable elastic distortion, which 
b) A wedge-shaped piece of material is increases the elastic strain energy of the 
removed from the crystal. c) The new remaining solid (from Murayama et al. 2002). 


and these are replaced by rotational elastic deformation in the crystal (Michler 
et al. 1998). Such a configuration can be interpreted as a wedge of material added 
to or removed from an ideal crystal, as illustrated in Fig. 13.23, and this is evident 
from the wedge shape of the bent white lines in Fig. 13.22(b,c). The crystal rota- 
tion produced by the partial disclinations (or terminating tilt grain boundaries) in 
Fig. 13.22 is also evident. For example, the {110} planes located between the two 
partial disclinations (labeled B in Fig. 13.22c) are rotated approximately 9° relative 
to the {110} planes located outside the dipole (such as the nearly horizontal 
planes labeled C in Fig. 13.22c). This observation provides direct confirmation 
that crystals can rotate and thereby undergo turbulent behavior during severe 
plastic deformation by the action of partial disclinations. 

The terminating dark lines in Fig. 13.22(c) reveal the arrangement of individual 
dislocations that constitute the partial disclinations associated with the dipole in 
this figure, as well as other, isolated dislocations in the metal. The generation and 
interaction of partial wedge disclinations allows reorientation of crystal volumes 
only several nanometers in size. This mode of deformation on such a fine scale 
probably‘ facilitates the fragmentation and reorientation process of metal grains 
undergoing severe plastic deformation (Romanov and Vladimirov 1992; Seefeldt 
2001), leading to an ultrafine grain size. Thus, partial disclination defects such 
as those in Fig. 13.22 can contribute to both the deformation response and the 
strengthening of metals. The generation of partial dislocation defects provides 
an alternative mechanism to grain boundary sliding, which has been suggested 
to allow rotation of nanosize crystals during mechanical milling (Schiotz et al. 
1998). It is not possible from the HRTEM image to determine exactly how the 
partial disclination dipoles in Fig. 13.22(c) formed, although the dislocations 
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probably nucleated at pre-existing defects such as grain boundaries or cell walls 
in the metal and rearranged into the terminating arrays in Fig. 13.22. 


13.2.2.3.2 Interphase Boundaries in Metal Alloys 
Interfaces that separate two crystals that differ in composition, Bravais lattice, 
or both, are referred to as interphase boundaries (Wolf and Yip 1992; Sutton and 
Ballufh 1995; Howe 1997). Understanding the details of how the atoms arrange 
at such interfaces and cross from one phase to form the other are essential to 
understanding the nature of phase transformations in materials (Wayman et al. 
1994; Aaronson 2002). Interphase boundaries are often divided into three classes 
based on the degree of atomic matching or coherency across the interface (Howe 
et al. 2000; see Chapter 7), as: 
e coherent interfaces, where there is complete continuity of 
atomic planes across the interface between two phases 
e partly coherent interfaces, where the mismatch between two 
crystal structures across the interface is accommodated by 
periodic misfit dislocations, i.e., strain localization, in the 
interface, and 
e incoherent interfaces, where atomic matching is sufficiently 
poor for there to be no correspondence of atom planes across 
the interface or localization of misfits into dislocation defects. 


In addition to this classification, which is based on the degree of atom match- 
ing and/or misfit localization, an interphase boundary may be sharp or diffuse, 
depending on whether the composition or structural changes occur abruptly 
across a single plane, or smoothly over several or more planes (Cahn and Hilliard 
1958; Sutton and Balluffi 1995; Howe 1997). The following examples illustrate 
how HRTEM has been a valuable technique for determining the dynamic behav- 
ior and mechanisms of motion of both diffuse and abrupt interphase boundaries 
displaying different degrees of coherency, as listed above. 


13.2.2.3.3 Diffuse Interface in Cu-Au 

Figure 13.24(a) shows an HRTEM image of a diffuse interphase boundary be- 
tween the long-range ordered AuCu-I phase and disordered « phase in a Au-41 
at.% Cu alloy sample taken as a frame from a videotape at approximately 305 °C 
during in-situ heating in the HRTEM (Howe et al. 2006). The position where an 
intensity profile 25 pixels wide was taken across the interphase boundary is indi- 
cated by the white rectangle in the figure. The corresponding intensity profile is 
shown in Fig. 13.24(b). In this profile, the ends of the diffuse interphase bound- 
ary on the ordered and disordered sides are indicated by the lines labeled O and 
D, respectively, and the same locations are also indicated in Fig. 13.24(a). Detailed 
HRTEM image simulations (e.g., inset in Fig. 13.15a) show that the Au-rich (001) 
planes in the ordered AuCu-I phase appear as bright lines in the image in Fig. 
13.24(a) and corresponding high-intensity peaks on the left-hand side of the in- 
tensity profile in Fig. 13.24(b). The Cu-rich planes in between are barely visible 
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Fig. 13.24 (a) HRTEM image of an order— approximately half as far apart in the « phase 
disorder interphase boundary at 305 °C with on the right-hand side; (b) an intensity profile 
a simulated image of the interphase bound- 25 pixels wide taken across the interphase 
ary for the same sample and microscope boundary over the region indicated by a white 
conditions shown superimposed on the box in (a). The positions O and D, which 
experimental image. The Au-rich (001) planes indicate the ordered and disordered sides of 
in the AuCu-l phase appear bright on the the diffuse interphase boundary, respectively, 
left-hand side of the interphase boundary, are indicated in both (a) and (b) (from Howe 
as do the {002} planes that are spaced et al. 2006). 


because they appear dark in the HRTEM image in Fig. 13.24(a) under these ex- 
perimental conditions. The lower intensity peaks spaced only half this distance 
apart to the right of position D correspond to the {002} planes in the disordered 
æ phase. In the region between O and D, the intensities change from left to right, 
leading to the interphase boundary thickness labeled L in Fig. 13.24(b). 

The positions O and D in Fig. 13.24 were determined for each frame (time in- 
terval of 0.03 s) over a period of 60 s and these are plotted in Fig. 13.25(a). The 
data in Fig. 13.25(a) show several important features. First, it is evident that the 
disordered side of the diffuse interphase boundary (top graph) generally moves 
more frequently and often over larger distances than the ordered side of the inter- 
phase boundary (bottom graph) with time. The shortest distance present in the 
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Fig. 13.25 (a) Graphs showing the positions thickness (bottom) over the same period. 
of the disordered (top) and ordered (bottom) The interphase boundary mean positions in 


sides of the diffuse interphase boundary (b) fluctuate around a value of approximately 
in Fig. 13.24(a) over a period of 60 s; 3.5 nm, which is roughly the midpoint 

(b) corresponding graphs showing the inter- between the two intensity profiles in 

phase boundary mean position (top) and Fig. 13.244(b) (from Howe et al. 2006). 


top graph (arrow) is the spacing of one {002} plane in the disordered « phase. 
Following the disordered side of the interphase boundary with time, it is evident 
that the interphase boundary fluctuates over a distance of one to as many as six 
{002} « planes. In contrast, using the {002} plane spacing as a reference, it is ap- 
parent that the ordered side of the interphase boundary in the lower graph typi- 
cally moves twice this distance, which is the spacing between the Au-rich (001) 
planes in the AuCu-I phase. In addition, it fluctuates with a frequency that is sev- 
eral times less than that of the disordered side. Thus, the position of the disor- 
dered side of the interphase boundary rapidly fluctuates between a number of ad- 
jacent {002} « planes with time, while the ordered side fluctuates more slowly 
and only between Au-rich planes in the ordered AuCu-I phase. 
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Fig. 13.26 HRTEM images showing mechanisms of interface 
movement. Fluctuations nucleate homogeneously on the interface and 
expand by steps, one (001) AuCu-l plane high, to form (a) a pillbox 
shape (white arrow) and (b) a series of steps (see arrows). The 
interface profiles are outlined. 


The different interphase boundary behaviors observed in Fig. 13.25(a) are 
probably due to a combination of several factors occurring at the interphase 
boundary, namely: (a) the tetragonality of the AuCu-I structure, which leads to 
an energetically stable (001) plane that tends to keep the ordered side of the inter- 
phase boundary parallel to this plane; (b) a lower average diffusivity in the AuCu-I 
phase as compared to the disordered phase at 305 °C; and related to this, (c) a 
longer atomic jump distance (0.357 nm) for diffusion perpendicular to the or- 
dered (001) planes in the AuCu-I ordered phase; and (d) the fact that the inter- 
phase boundary must move this same distance between Au-rich planes on the 
ordered side (Schweika et al. 2004; Butrymowicz et al. 1974). A combination of 
these effects leads to a significantly higher activation energy for movement of 
the ordered side of the interphase boundary, causing the experimentally observed 
interphase boundary behavior in Fig. 13.25(a). In fact, if one performs an approx- 
imate calculation for the average diffusivities and corresponding jump frequen- 
cies of the atoms in the ordered and disordered phases, respectively, using the 
equation T = 6D/A?, where T is the jump frequency (s~'), D is the diffusivity 
at 305 °C (m? s7!) and å is the jump distance (m), one finds that the jump 
frequency in the disordered phase is approximately 3 x 10° s~!, as compared to 
3 x 10* s! in the ordered phase, consistently with the considerations mentioned 
above. However, these frequencies are approximately four orders of magnitude 
greater than the actual frequencies of interphase boundary movement recorded 
for the ordered and disordered sides of the interphase boundary, indicating 
that collective volumes, rather than individual atom jumps, are involved in the 
process. In other words, movement of the interface involves the formation of 
critical-size fluctuations involving many atoms (Howe and Benson 1995). 
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An outline of the ordered side of the interface was drawn for a number of indi- 
vidual frames, based on the positions of the bright (001) planes. Figure 13.26 
shows two different interface profiles that were typically found, reflecting the 
mechanisms by which interface movement (or fluctuation) occurred (Howe et al. 
2006). Figure 13.26(a) shows a situation where a perturbation in the ordered 
phase appeared to nucleate homogeneously on the interface (arrow) away from 
any other features. This perturbation expanded into the disordered phase and lat- 
erally along the order-disorder interface, propagating it forward and resulting in 
the interface shape outlined in Fig. 13.26(a). The image in Fig. 13.26(b) shows a 
second case, where a fluctuation in the ordered phase nucleated near a boundary 
between two different ordered domains. This ordered portion then propagated 
along the order-disorder interface as a series of steps, one (001) high (indicated 
by arrows in Fig. 13.26b), again resulting in net movement forward. In either 
case, the interface then receded by the opposite movement of such fluctuations 
(or steps). In most of the frames examined in the video, it was found that the 
ordered phase advanced into the disordered phase at more than one region along 
the interface, with nucleation at the domain boundary constituting only a small 
fraction of the fluctuations. Hence, this intersection did not appear to have a large 
effect on the overall interface behavior. 

Based on Fig. 13.26, it is clear that the interface does not fluctuate by single- 
atom jumps, but by a collective mechanism involving the nucleation and spread- 
ing of critical-size (001) steps parallel to the interface plane. This is similar to the 
behavior of highly faceted interfaces in other diffusional transformations such as 
precipitation and crystallization (Howe et al. 1996) as shown below, and different 
from the behavior of fluctuations at solid/liquid interfaces (Hoyt et al. 2003), 
which are not constrained to a certain height, or energy minimum perpendicular 
to the interface (e.g., one (001) AuCu-I plane spacing). It also demonstrates that 
the anisotropy in the interfacial energy of the AuCu-I phase, specifically the (001) 
interphase boundary energy, is sufficient at this temperature to influence the 
morphology and behavior of fluctuations at the diffuse interface. Such behavior 
is consistent with previous observations of interfaces in other similar systems 
(Loiseau et al. 1994; Chatterjee et al. 2004). 

Since HRTEM images such as those in Figs. 13.24 and 13.26 are projections of 
the three-dimensional interface structure, the interface fluctuations seen in pro- 
jection also occur through the specimen thickness along the viewing direction, 
so that the profile in Fig. 13.26(a) for example, most probably represents a set of 
somewhat faceted, concentric pillboxes in three dimensions. It is also quite likely 
that fluctuations in the interface through the specimen thickness are similar, if 
not identical, to what is observed in the plane of projection, since the interface 
plane is (001). The minimum size perturbation (or fluctuation) observed at the 
interface was one (001) plane spacing high (~0.37 nm) and about three to four 
times this wide (1.2-1.6 nm), so that the critical fluctuation volume is on the 
order of 0.6 nm?, or about 40 atoms, assuming a circular pillbox shape, or slightly 
larger if a square faceted shape is assumed. Note that this is about the minimum 
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volume of specimen that one would expect to be observable in a specimen 20 nm 
thick. 

The interphase boundary thickness and mean position were determined from 
the data in Fig. 13.25(a) and these are plotted in Fig. 13.25(b). The mean inter- 
phase boundary position was determined as the midpoint between positions O 
and D. The average frequency of interphase boundary movement is 1.2 s-!, with 
a maximum frequency of 7.7 s™! and a minimum frequency of 0.4 s-!. This tem- 
poral variation reflects mainly that of the disordered side, which fluctuates at a 
higher frequency. Also note that the mean interphase boundary position moves 
up more often than down in Fig. 13.25(b), again reflecting the character of the 
disordered side of the interphase boundary, which tends to move into the disor- 
dered phase, expanding the interphase boundary. Thus, the interphase boundary 
fluctuations are not symmetric about the mean position, but favor the disordered 
side. The average velocities of the ordered and disordered sides of the interphase 
boundary were also determined over 60 s using the data in Fig. 13.25(a), and were 
found to be 0.3 and 1.3 nm s~}, respectively. 

The interphase boundary thickness is shown in the bottom graph in Fig. 
13.25(b). The average thickness is approximately 1.7 nm, or the equivalent of 
seven {002} planes (or about four unit cells) in the disordered « phase, although 
the thickness averages slightly more than 2.0 nm during the latter 20 s of the 
video sequence, where the disordered side of the interphase boundary moves a 
few planes further away from the ordered side, and slightly less than 1.7 nm ear- 
lier on. The maximum and minimum interphase boundary thicknesses observed 
were 3.3 and 0.8 nm, respectively, which represent variations in thickness on the 
order of 100%. The average interphase boundary thickness measured from Fig. 
13.25(b) is comparable to recent studies on strain-induced incomplete wetting 
above the critical temperature T., where the alloy becomes disordered, at AuCu-I 
(001) surfaces (Schweika et al. 2004), and is also similar to previous calculations 
for a Au-Cu alloy (Kikuchi and Cahn 1979) and recent atomistic calculations per- 
formed on Al-Li and Al-Ag alloys slightly below T., using embedded atom po- 
tentials and Monte Carlo (MC) methods (Sluiter and Kawazoe 1996; Asta and 
Hoyt 2000). In these systems, which are similar to the alloy in the Au—Cu case, 
the calculations indicate that the diffuse interphase boundary should extend over 
four to six unit cells (i.e., L ~ 1.6-2.4 nm). In the Au-Cu system, the strain- 
induced incomplete wetting phenomenon leads to order in AuCu (001) surfaces 
that extends five to seven planes (i.e., L ~ 1.0-1.4 nm) into the crystal at temper- 
atures much greater than T.. 


13.2.2.3.4 Partly Coherent Interfaces in Al-Cu 

Recent in-situ hot-stage HRTEM studies of precipitate plates in Al-Cu-Mg-Ag 
and Al-Ag alloys performed both parallel and perpendicular to the plate faces, 
and comparison of these studies with prior HRTEM and conventional in-situ 
hot-stage TEM investigations, have clearly established the terrace-ledge-kink 
(TLK) mechanism as the primary atomic mechanism involved in growth and dis- 
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Kink 


Face-on 


Fig. 13.27 Perspective view (top), face-on (bottom left), and edge-on 
(bottom right) views of a precipitate plate growing by a terrace—ledge— 
kink mechanism (from Howe et al. 1996). 


solution of faceted precipitates in metal alloys (Laird and Aaronson 1969; Howe 
et al. 1987; Howe 1997). This process is illustrated schematically in Fig. 13.27, 
which shows a perspective view of a precipitate plate growing by ledges that 
nucleate on the habit plane and propagate out to the edge, where they stack one 
above the other. Figure 13.27 shows two additional views, one perpendicular to 
the habit plane of the plate, i.e., face-on, and the other parallel to the habit plane, 
i.e., edge-on. It is clear from the two lower illustrations, that when the plate is 
viewed edge-on parallel to the facets, it is possible to observe the atomic structure 
and dynamics of individual ledge motion and also the motion of the ledges 
stacked at the plate edges by in-situ HRTEM. When the edge of the plate is 
viewed in the face-on orientation, the electron beam is parallel to the ledges and 
this makes it possible to observe kinks that form on and propagate along the 
ledges. By combining information from these two orientations, it is possi- 
ble to obtain a three-dimensional description of the atomic mechanisms of inter- 
facial motion, as illustrated by the following data for {111} @ plates in an 
Al-Cu-Mg-Ag alloy (Howe and Benson 1995; Benson and Howe 1997). 


Structural and kinetic analyses in an edge-on orientation ‘The orientation relation- 
ship of the {111} 9 phase with the matrix is (110),||(111),, [110],||[101],, and 
[001],||[111],, which is a low-energy orientation relationship for 0 phases desig- 
nated as a Vaughan II orientation relationship. Figure 13.28(a) shows an HRTEM 
image of a ledge on the face of a 0 plate viewed edge-on along a [001],||[121], di- 
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Fig. 13.28 (a) HRTEM image of a single ledge on a 0 plate during 
growth at about 220 °C, and (b) graph of growth distance versus time 
for the ledge in (a) (from Benson and Howe 1997). 


rection, as in the lower-right illustration in Fig. 13.27. The ledge is approximately 
two {111}, matrix planes high, or half of a unit cell of the 0 structure (0.424 nm). 
This was the smallest ledge size that was observed on the faces of the 0 plates and 
higher ledges were often observed (Benson and Howe 1997). At about 220 °C the 
ledge was observed to oscillate several times per second over a distance of about 
two unit cells of the 0 phase along the precipitate face while moving slowly across 
the face toward the precipitate edge in the direction indicated by an arrow. In-situ 
experiments that were performed perpendicular to the plate face indicate that 
the oscillatory motion is due to the formation and annihilation of kinks along 
the ledge, as demonstrated in the next section. The videocassette recording also 
revealed direct experimental evidence of enhanced atomic motion in the matrix 
just ahead of the ledge; this leads to slight blurring in the photograph, which 
is visible in the enclosed area in Fig. 13.28(a). It is important to note that the pre- 
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cipitate structure only one unit cell behind the ledge appears completely trans- 
formed, indicating that the structural and compositional changes which are nec- 
essary for diffusional growth occur simultaneously within a few atomic distances 
of the ledge. Although the ledge appeared to move smoothly across the precipitate 
face over short periods of time, it displayed start-stop behavior when viewed over 
longer times, as shown in Fig. 13.28(b). Such periodic lack of mobility during the 
migration of ledges has been observed previously and attributed to a lack of sites 
for atomic attachment along the ledges as they align along low-energy matrix di- 
rections, in this case (121), (Garg et al. 1993). 


Structural and kinetic analyses in a face-on orientation Figure 13.29 shows 
an HRTEM image taken at the edge of a 0 plate viewed face-on along a 
[110],||[111], direction, as in the lower-left illustration in Fig. 13.27, during an 
in-situ hot-stage experiment at about 275 °C. The prominent rectangular pattern 
of white spots outlined in Fig. 13.28 with dimensions of 0.244 x 0.429 nm? 
relates directly to positions of Cu atoms in the @ structure, as determined by 
HRTEM image simulation. During the in-situ HRTEM experiments, the 0 plate 
was observed to grow by the nucleation of double kinks half a unit cell high 
(0.429 nm) along the (110),||(101), plate edge, which propagated along the edge 
until they reached the intersecting facet. The arrow in Fig. 13.29 indicates the end 
of one such kink. The smallest kinks were one-half of the @ unit cell in height 
(one rectangular pattern of white spots about 0.429 nm long) but sometimes two 
or three kinks nucleated and/or dissolved in rapid succession in an oscillatory 
manner about an average position, similarly to the behavior described for the 
ledge in Fig. 13.28. It is important to note that the kink in Fig. 13.20 is well de- 
fined to within two or three half unit cells of the {111} @ structure. Although the 
image in Fig. 13.29 was taken at the edge of the 0 plate where several ledges may 
be stacked vertically parallel to the electron beam direction, when this perspective 





Fig. 13.29 An isolated kink (indicated by the arrow) traveling along the 
[110], facet of a {111} @ precipitate plate during growth at about 275 °C 
(from Howe and Benson 1995). 
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is combined with the one in Fig. 13.28, it is possible to conclude that the phase 
transformation is occurring at kinks in ledges on the 0 plates and that the trans- 
formation is completed within a volume as small as about one unit cell of the 6 
phase along the [110], and [110], directions. This volume contains about four 
atoms of Cu and eight atoms of Al. Thus, performing in-situ HRTEM allows ob- 
servation of the atomic mechanisms of the transformation (the TLK mechanism) 
as well as the dynamics of transformation. It is also possible to study the dynam- 
ics of the kinks at the edges of the plates, as described in detail elsewhere (Howe 
and Benson 1995; Benson and Howe 1997). 


13.2.2.3.5 Incoherent Interfaces in Ti-Al 

In contrast to the case of coherent and partly coherent interfaces described above, 
the atomic mechanisms by which solid—solid incoherent interfaces move are not 
well understood (Howe et al. 2000; Aaronson 2002; Massalski et al. 2006). This is 
true for both high-angle grain boundaries (Sutton and Balluffi 1995; Schonfelder 
et al. 1997; Merkle 2002) and interphase boundaries. During dissolution or at 
static interfaces when nucleation at the interface is not required for motion, local 
fluctuations can occur due to thermal effects. In-situ HRTEM studies on grain 
boundaries have revealed such reversible fluctuations (Merkle 2002). 

Recent in-situ heating HRTEM experiments performed on massive transforma- 
tion interfaces in a TiAl alloy have shown evidence of both continuous and step- 
wise motion during growth at the atomic level, depending on the orientation 
relationship and interface plane (Howe et al. 2002). These two types of growth be- 
havior were previously observed at much lower levels of resolution during cine- 
matographic studies of the massive transformation in Cu-Ga alloys (Kittl and 
Massalski 1967) and during in-situ heating TEM experiments of massive inter- 
faces in Ag-Al (Perepezko and Massalski 1975) and Cu-Zn alloys (Baro and 
Gleiter 1974). One problem associated with all of the previous, lower-resolution 
in-situ studies is that the actual structures of the massive transformation 
interfaces were not known, but assumed to be incoherent. This assumption has 
been challenged repeatedly in the literature, particularly since many massive 
products display definite crystallographic orientation relationships with the par- 
ent phase (Aaronson 2002). The following results were obtained from frame-by- 
frame analysis of in-situ HRTEM data obtained on a massive transformation 
interface in a Ti-Al alloy growing by a continuous mechanism, i.e., by atom at- 
tachment continuously along the interface rather than at well-defined crystallo- 
graphic facets or steps in the interface (Raffler and Howe 2006). The results re- 
veal new phenomena associated with continuous growth at solid-solid interfaces 
and are compared with existing experimental and atomistic simulation results on 
grain and interphase boundaries at the same level of spatial resolution. 

Figure 13.30 shows an HRTEM image of an «/y,, incoherent interface ob- 
tained in a Ti-46.54 at.% Al alloy that was solutionized at 1360 °C for 1200 s 
and quenched with water or ice-brine to nucleate grains of the ym (massive) 
phase in the « (parent) phase, after the specimen had been heated for about 
5400 s (1.5 h) at 575 °C in situ in the TEM (Howe et al. 2002). The interface is 
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Fig. 13.30 HRTEM image of an 02/7, incoherent interface obtained 
after the specimen had been heated for about 5400 s (1.5 h) at 575 °C 
in situ in the TEM. The (220)y,, interface plane and the (111), planes 
in the y,, phase are indicated. Lines A and B show the positions of two 
(111)ym planes that were used to obtain Fig. 13.34 and a typical trace 
of the «2/7, interface is outlined in the top half of the image (from 
Raffler and Howe 2006). 


edge-on and nearly parallel to a (220)y,, plane, or roughly perpendicular to the 
(111)y,, planes that are clearly visible and indicated in the ym phase in the image. 
The orientation relationship between the two phases across this interface was ap- 
proximately: [112]y,,||[122]2 with about 1.4° tilt between the two zone axes; and 
(111)ym||(223)o2, and the viewing direction is approximately [112]y,,||[122]o2 in 
Fig. 13.30. In the video recording, the interface appeared to advance into the a 
phase by continuous overall motion, without evidence of the start-stop growth 
behavior which is often observed during motion of coherent and partly coherent 
interfaces between solid phases with low-index orientations and interface planes 
by steps or ledges, as shown above for 0 phases. The interface also appeared to 
undulate slightly as it advanced, rather than remaining planar during growth. 
These features were examined in detail in the subsequent frame-by-frame analy- 
ses over a period of 50 s (or 1500 frames). 

A typical trace of the &2 /y„, interface that was analyzed in this study is indicated 
by a line in the top-center part of Fig. 13.30. Figure 13.31(a) shows the actual po- 
sitions of 10 such traces obtained from each frame of the video recording during 
the first 0.3 s. In Fig. 13.31(b), the traces were separated by a distance of 0.52 nm, 
starting with the original frame on the right, to better reveal their characteristics. 
The positions of various advances and recessions in the traces of the interface are 
indicated by arrows in Fig. 13.31(b). Initial examination of the traces gives the 
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Fig. 13.31 (a) The positions of 10 traces obtained from each frame of 
the video recording during the first 0.3 s. (b) The same traces separated 
by a distance of 0.52 nm, starting with the original frame on the right, 
to better reveal their characteristics. The positions of various advances 
and recessions in the traces of the interface are indicated by arrows 
(from Raffler and Howe 2006). 


impression that motion of the interface is highly irregular, although every trace 
shows at least one location where the interface advances into the ø phase. Fur- 
ther examination of the arrows shows that when the interface advances in one lo- 
cation in a frame, it subsequently recedes in the same location in the next frame. 
This pattern of advance and recession leads to an oscillatory behavior in the inter- 
face position as a function of time, as illustrated using only the (111) planes in 
the ym phase in Fig. 13.32(a). (Note that the regularity of the (111) planes shown 
in Fig. 13.32(a) is only for purposes of illustration and not meant to imply that 
the motion of these planes at the interface is so regular.) In addition to the oscil- 
latory behavior caused by the advances and recessions, movement of the interface 
appeared to occur by the spreading of certain advances along the interface. This 
behavior is illustrated in Fig. 13.32(b). Both of these features at the interface ap- 
peared to involve the cooperative growth and dissolution among groups of indi- 
vidual (111) plane edges in the y,, phase at the interface. 

The distances between advancements in the traces of the interface were mea- 
sured in each frame and found to display characteristic spacings of approximately 
1.15 and 1.61 nm. This feature is illustrated in Fig. 13.33(a), where the distances 
between the advances are indicated by solid dots. Figure 13.33(b) shows the same 
traces with broken lines drawn tangentially to the advances and recessions along 
the interface traces and a darker, parallel solid line drawn halfway between these. 
The distances between the broken lines varied from 0.27 to 0.58 nm, with an 
average value of 0.41 nm. If the amplitudes and distances of the advances/ 
recessions at the interface are treated as a wave-like fluctuation, the fluctuation 
has an amplitude and fundamental wavelength of approximately 0.21 nm and 
1.15 nm, respectively. It is interesting to note that the second characteristic wave- 
length associated with the fluctuation of 1.61 nm is approximately v2 times the 
fundamental wavelength. 
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Fig. 13.32 Illustrations of: (a) the oscillatory behavior of the interface as 
a function of time, and (b) movement of the interface by the spreading 

of an advancement in the directions indicated by the horizontal arrows. 
The amplitude (A) and wavelength (A) of the fluctuations are indicated 

n (a) (from Raffler and Howe 2006). 





Fig. 13.33 (a) The same traces as in Fig. 13.20, with the distances 
between advances indicated by solid dots. These were used to 
determine the wavelength of the fluctuations. (b) The same traces with 
broken lines drawn tangentially to the advances and recessions along 
the interface traces and a darker, parallel solid line drawn halfway 
between these. These were used to determine the amplitude of the 
fluctuations (from Raffler and Howe 2006). 
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The interface dynamics was analyzed over a longer period of time by similarly 
plotting the interface traces from video frames obtained every 5 s, over the dura- 
tion of the video sequence (50 s). The same interface characteristics were ob- 
served over this longer time interval as for the previous short one, indicating 
that the interface behavior is the same over all time scales within the time resolu- 
tion of the experiment (i.e., 0.03 s). In the 50 s segment analyzed, the distances 
between the advances in the interface were found to be approximately 1.19 and 
1.69 nm and the distances between the advances and recessions of the fluctuation 
varied from 0.32 to 0.53 nm. These values yield an average amplitude and funda- 
mental wavelength for the interface fluctuation of approximately 0.21 nm and 
1.19 nm, respectively, with the higher wavelength of 1.69 nm again being approx- 
imately v2 times the fundamental wavelength. The same analyses performed at 
intermediate time intervals yielded similar amplitudes and wavelengths, indicat- 
ing that they are characteristic of the interface over all time scales of observation 
from 0.3 to 50 s. 

The average velocity of the interface based on the initial and final positions of 
the interface traces over the 50 s of observation was 0.023 nm s~!. In order to 
understand further the instantaneous velocity/behavior of the interface, the edges 
of individual (111)y,, planes were followed frame-by-frame in two locations at the 
interface, indicated by lines parallel to the planes labeled A and B in Fig. 13.30. 
Figure 13.34(a,b) shows the resulting plots of these (111)y,, plane edges every 
30th frame over 1500 frames, or a time of 50 s. These plots clearly illustrate the 
somewhat irregular but definite oscillatory behavior of the interface, i.e., its wave- 
like fluctuations versus time. In addition, it is apparent that the average velocity 
of the interface is constant, as indicated by the straight lines that were fitted and 
superimposed on the plots in Fig. 13.34(a,b), although the instantaneous position 
of the interface oscillates about the average position with time at these locations. 
The equations for the lines are also given in the plots. This behavior was typical 
of all such (111)y,, planes at the interface and reflects the behaviors illustrated 
in Fig. 13.31. It is important to note that although the average interface velocity 
was 0.023 nm s!, the instantaneous velocity at any given time could be more 
than an order of magnitude higher, as evidenced by the slopes of the oscillations 
in Fig. 13.34. 

The results from the frame-by-frame analyses above indicate that this massive 
transformation interface, which moves forward continuously as a planar interface 
overall, rather than by an obvious ledge mechanism, displays quasiperiodic fluc- 
tuations along its length that can be characterized by a wave-like function with an 
amplitude of approximately 0.21 nm (roughly the spacing of the (111)y,, planes) 
and a fundamental wavelength of approximately 1.15 nm. In addition, a second 
wavelength of 1.6 nm, or approximately 2 times the fundamental wavelength 
(1.15 nm x v2 = 1.6 nm) is also present at the interface. This oscillatory inter- 
face behavior, which accompanies the interface as it propagates forward with con- 
stant average velocity, is superimposed on the average position of the interface 
versus time. The oscillations are not regular, but they display a characteristic am- 
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Fig. 13.34 Frame-by-frame plots of the (111)y„ plane edge positions at 
locations A and B in Fig. 13.30, with data points indicated every 30th 
frame over the duration of 1500 frames (50 s). The equations for the 
straight lines fitted to the data points in the figure are also given in the 
plots (from Raffler and Howe 2006). 


plitude and wavelength. There was no obvious connection between either the am- 
plitude or the wavelength at the interface with any structural features in the %2 
phase, such as a characteristic ledge spacing or height, although this aspect de- 
serves further examination. It would also be interesting to compare the behavior 
of this interface at other boundary orientations obtained in the in-situ HRTEM 
experiments and this work is currently in progress. 

The creation of advances at the interface and spreading of some of these ad- 
vances along the interface indicates that the interface must be overcoming some 
barrier to migration, rather than simply moving forward uniformly everywhere. 
As mentioned at the start of this section, the presence of local barriers to interface 
migration typically leads to oscillatory behavior at interfaces in diffusional trans- 
formations. The present interface is different from the previous Al-Cu-Mg-Ag 
diffusional interface in that it is a (relatively) planar, incoherent interface between 
two crystals with a high-index orientation relationship and does not involve long- 
range diffusion, as opposed to being a partly coherent interface between two crys- 
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tals with a low-index orientation relationship and cusp-oriented interfaces that 
grow by a terrace-ledge-kink mechanism, involving long-range diffusion. In this 
regard, it is more like a general, high-angle grain boundary migrating under a 
driving force (Sutton and Balluffi 1995; Schonfelder et al. 1997). Unlike the case 
of highly faceted cusp-oriented interfaces, it appears that the lack of well-defined 
structure at this interface allows it to undergo regular wave-like spatial fluctua- 
tions in position with time, some of which grow to a critical size and then spread 
along the interface, moving it forward. The size of the critical fluctuations for this 
interface appears to be just greater than approximately 0.21 nm high and 1.69 nm 
wide, since fluctuations larger than these are not commonly observed. In this re- 
gard, its motion is similar to that of a planar interface growing by nucleation and 
spreading of two-dimensional nuclei (Cahn et al. 1964; Benson and Howe 1997; 
Howe 1997), except that the nuclei are not well-defined crystallographically and 
their location is a stochastic process. 

The dynamic characteristics of the present interface are remarkably similar 
to those observed for the massive transformation in Cu-Ga alloys obtained at a 
much lower spatial resolution (i.e., at the micrometer scale), but at a higher time 
resolution (at 64 frames s~') and at much greater velocities (up to 1 mm s~!) us- 
ing cinematographic studies (Kittl and Massalski 1967). In fact, if one compares 
the interface traces in Figs. 13.31 and 13.33 with those in Figs. 8 and 9 in Kittl 
and Massalski (1967), the traces are essentially identical, indicating scaling of cer- 
tain interface behaviors in these massive transformations over great variations in 
length, velocity, and time. The benefit of the Ti-Al case is that the much higher 
spatial resolution reveals the previously unknown atomic details of interfacial 
structure associated with the dynamic behavior. The oscillatory character of the 
interface and the constant velocity (i.e., the linear displacement versus time in 
Fig. 13.34) also compare reasonably well with results from a recent kinetic MC 
study of a massive transformation interface between fcc and bcc crystals (see 
Figs. 3 and 5 in Bos et al. (2004), for example), although in this study the low- 
index close-packed planes in the two crystals were parallel, so that the interface 
is cusp-oriented and two-dimensional bcc nuclei are clearly present at the inter- 
face at low driving forces. It is also interesting to note that some oscillations are 
present in molecular dynamics simulations of grain-boundary migration in pla- 
nar, high-angle twist grain boundaries (see Fig. 5 in Schonfelder et al. (1997), for 
example). 


13.2.3 
Local Analysis by Atom Probe Tomography 


Whereas the previous sections on microscopy focused on surface and structural 
characterization, we now address a technique dedicated to local chemical analysis. 
Scientists who are planning chemical micro- or nanoanalysis of materials have 
nowadays a choice among a vast number of different techniques. These are based 
either on inelastic interactions of electrons and ions with matter (e.g., energy- 
dispersive X-ray spectroscopy (EDXS), electron energy loss spectroscopy (EELS), 
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Rutherford backscattering (RBS)) or on direct mass spectrometry (e.g., secondary 
ion mass spectrometry (SIMS)). Atom probe tomography (APT) represents a 
branch of the latter group. Since this technique is particularly suited to the inves- 
tigation of metals and alloys and it has the unique capability of producing 3D 
maps with single-atom sensitivity, it is worth choosing this method for a detailed 
description. 

Atom probe tomography applies the principle of field ion microscopy (FIM) 
and represents the latest progress in this area. Thus, the method dates back to 
the pioneering work of E. W. Miiller, an ingenious physicist, who invented the 
field ion microscope in 1951 (Müller 1951) after years of experiments with elec- 
tron emission microscopy that turned out to be resolution-limited by the finite 
Fermi energy of the electrons. With field ion microscopy, he could already dem- 
onstrate atomic resolution images of tungsten surfaces in 1957 (Müller 1957). He 
and his coworkers were also the first to combine FIM with time-of-flight mass 
spectrometry in 1965, creating in this way a unique tool for quantitative chemical 
analysis in the nanometer range, which attracts particular attention today in the 
era of nanotechnology (Miiller et al. 1968). Although being able to achieve image 
magnifications in the order of 10° and thereby of atomic resolution, an FIM is a 
surprisingly simple instrument if compared to the complex electron optics of 
modern electron microscopes, since not a single imaging lens is needed. Owing 
to its simple projective geometry the instrument does not suffer from all 
the tedious stability problems to which an electron microscopist is accustomed. 
However, only recent progress in detector technology, of which the important 
milestones were the introduction of microchannel plates, CCD cameras, fast 
charge-to-digital converters, and picosecond time measurement, made available 
single-ion detectors with sufficient spatial resolution and quite high detection fre- 
quencies. Using this modern equipment the early atom probe of Müller advanced 
to a modern instrument that allows the almost complete, three-dimensional re- 
construction of the spatial arrangement of a few million of atoms contained in a 
volume of typically 10° nm?. 

Meanwhile various instrumental concepts of 3D atom probes have been 
designed and put into operation. The first instrument functioning well was the 
so-called position-sensitive atom probe (PoSAP) described in 1988 by Cerezo 
et al. (1988). An improved instrument with the capability to process multiple 
events is the tomographic atom probe (TAP) presented somewhat later (1993) by 
Blavette and coworkers (Deconihout et al. 1993). Since this instrument was deliv- 
ered to locations throughout the world it has become popular and has lent its 
name to the general term of the method: “atom probe tomography.” Nowadays 
technical development is in continuous progress and new instruments are intro- 
duced regularly. The most recent achievement is the so called local electrode atom 
probe (LEAP) (Kelly et al. 2004). This instrument moderates the serious restric- 
tions in specimen geometry, namely that a needle of high aspect ratio is needed, 
by placing a micrometer-sized electrode in front of an array of microtips. Also, the 
total number of atoms analyzed per measurement, and thus the size of the recon- 
structed volume, increase by one to two orders of magnitude. 
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This section can provide only a rather short overview of the latest atom probe 
methods. First, the physical principles of the measurement tools are described 
and the algorithms for 3D reconstruction are introduced. Then, the accuracy of 
the method and its limitations are discussed on the basis of simulated mea- 
surements. Finally, in Section 3.2.3.7, a few selected case studies are presented 
to demonstrate the application of the method to the physics of alloys. Readers 
who are interested in further details are referred to recent textbooks (Miller et al. 
1996; Miller 2000) on the method and also to basic literature on field ion micros- 
copy (Tsong 1990). 


13.2.3.1 The Functional Principle of Atom Probe Tomography 

All FIM techniques make use of the fact that electrical fields are concentrated 
at tips of sharp curvature. With moderate voltages supplied, enormous field 
strengths in the range of some 10 V nm! are easily obtained at the apex of 
nanometer-sized tips, whereas fields of such a magnitude would never be ob- 
tained with macroscopic geometries. So, a typical field ion microscope consists 
of an ultrahigh vacuum chamber with a specimen stage holding the sample tip, 
a high-voltage supply, and a viewing screen with the capability to image single ion 
impacts; see Fig. 13.35. A positive potential is supplied to the metallic specimen 
while the entrance face of the screen is kept at ground. In order to reduce thermal 
energies, a cryostat is required to cool the tip to about 20 K. The field at the tip 
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Fig. 13.35 Principle of a field ion microscope. 
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surface is controlled by the voltage supplied, according to Eq. (62), in which V 
and f denote the voltage supplied to the tip and a shape factor. 


V 
Esurf = 75 62 
=i (62) 


For electrodes of ideal spherical symmetry this shape factor would be exactly 
equal to 1. But in reality the shaft part of the needle-shaped specimens leads to a 
field modification that is reflected by a £ factor in the range 7-10 depending on 
the specimen’s geometry. With increasing distance from the surface, the field de- 
cays logarithmically, which means that the dominant part of the field already 
drops in the first few millimeters. 

In order to produce a field ion micrograph of the tip surface, an imaging gas, 
usually He, Ne, or a mixture of both, is introduced into the vacuum chamber. The 
gas atoms are polarized within the inhomogeneous field close to the apex and 
drawn toward the tip. The energetic situation for the shell electrons of a gas atom 
is sketched in Fig. 13.36. If a positive voltage is supplied to the tip, the electronic 
states of the gas atom become elevated with respect to the Fermi energy of the 
metallic sample. Provided the field strength is sufficient and the atom is at a suit- 
able distance, a finite probability exists that an electron tunnels from the atomic 
state into the band structure of the metallic specimen. After having lost an elec- 
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Fig. 13.36 Electronic states of the metallic sample and an image gas 
atom close to the tip. Atomic states are elevated by the field E with 
respect to the metallic states. ¢ and | denote the work function of the 
metal and the ionization energy of the gas atom, respectively. 
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tron, the positively charged gas atoms are accelerated toward the imaging screen. 
There, by means of a multichannel plate and a phosphorus anode, each ion 
impact produces a visible light flash via the principle of secondary electron multi- 
plication. 

The potential well for the electron transfer by tunneling is sensitively controlled 
by the local field strength. As a consequence, the ionization rate of the gas atoms 
is a function of (a) the tip voltage and (b) the surface topography. Because of the 
discrete atomic structure of the surface, the local field varies periodically along 
the surface with a wavelength in correlation with the interatomic distance. In par- 
ticular, the edges of the atomic terraces are protruding features, and thus regions 
of elevated field strength and pronounced ionization rate. 

The ion trajectory is determined by the shape of the electric field. Since the 
thermal energy of the ions is rather low compared to the energy gain within the 
field, the ions are practically starting at rest. As a consequence, there is a one-to- 
one correspondence between the location of ionization at the tip surface and the 
imaged position on the screen. Regions of high ionization rate at the specimen 
surface are represented by locations of high brightness on the screen. As a conse- 
quence, protruding edges of lattice planes, the atomic terraces, are imaged as 
bright concentric rings surrounding low-indexed poles in the case of crystalline 
lattice structures. This is illustrated by Fig. 13.37, which presents a comparison 
of a field ion micrograph and a corresponding ball model of the imaged structure. 

To achieve a clear field ion micrograph the so-called best imaging field must be 
established at the tip surface. To a first approximation this field strength is char- 
acteristic of the imaging gas (e.g., 30 Vnm! and 41 V nm™! for Ne and He, re- 
spectively). If the field strength exceeds this imaging level, a further process is 





R=10nm R=15nm 


Fig. 13.37 Ball model of a spherical apex (left) in comparison to an 
experimental field ion micrograph. Protruding atoms are represented by 
bright dots. The image structure comprising concentric rings is a 
natural consequence of the crystalline packaging and the tip-shaped 
geometry. (By courtesy of V. Vovk, University of Miinster.) 
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initiated, so-called field evaporation. Reaching a critical evaporation field char- 
acteristic of the material under investigation, the specimen atoms themselves are 
ionized, field desorbed and accelerated toward the screen, similarly to the pre- 
viously discussed imaging gas atoms. This field evaporation mechanism repre- 
sents the basis of the local chemical analysis by an atom probe. The evaporation 
is triggered by high-voltage pulses of a few nanoseconds’ duration. With a suit- 
able control of the pulse amplitude the evaporation rate can be properly con- 
trolled (e.g., to about 0.01 atoms per pulse), so that the probability of multiple 
events becomes quite low. Thus, atoms may be counted and evaluated one by one. 

In principle, such an analytical instrument, the 3D atom probe, consists of 
the same components as the FIM shown in Fig. 13.35. Only the screen has to be 
replaced by a position-sensitive detector system of single-ion sensitivity, and the 
high-voltage supply must allow the required voltage pulsing. Using the voltage 
pulse as the start signal for an accurate time measurement and the detector sig- 
nal for stopping, the time-of-flight (ToF) of the ions and thus their mass and 
chemical identity can be measured individually. The flight path between tip and 
detector amounts to about 0.5 m, leading to typical flight times in the microsec- 
ond range. During a measurement, up to several million detector events, the re- 
spective flight times, and the hitting positions are recorded. From these raw data 
the original spatial arrangement of the different atomic species is reconstructed 
subsequently to the measurement by efficient numerical methods which will be 
discussed in more detail below. 
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Fig. 13.38 Design of a modern tomographic atom probe. The dedicated 
chamber layout makes it possible to switch between different detector 
geometries: the straight flight tube allowing a short flight path to 
optimize the open angle of the instrument; and the reflectron 
arrangement yielding an improved mass resolution. 
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Figure 13.38 is a schematic drawing of a modern conventional 3D atom probe, 
for illustration. In contrast to the concepts introduced above, two modifications 
should be noticed. 

e Instead of a straight flight tube, a reflectron geometry is used 
with an ion mirror which leads to parabola-shaped ion 
trajectories. This geometry compensates for fluctuations in 
the kinetic energy of individual ions. A faster ion will 
penetrate deeper into the mirror field and so will have a 
longer flight path. If length and voltage of the reflectron are 
adjusted correctly, ions of identical mass but leaving the tip 
with slightly different velocities will hit the detector after 
identical flight times. In this way, the mass resolution is 
significantly improved. 

e The evaporation pulse is supplied as a negative pulse to an 
extraction electrode in front of the tip, which allows shorter 
and better defined pulse shapes. 


13.2.3.2 Two-Dimensional Single-lon Detector Systems 

The rapid development of atom probe tomography in recent years was only made 
possible by the remarkable evolution of spatially resolving detector systems with 
single-ion sensitivity. In the last two decades several detector concepts have been 
proposed and put into operation. Systems that are currently in use will be dis- 
cussed briefly. 

All available detector concepts are based on a stack of two to three “multichan- 
nel plates” (MCPs). Such an MCP is an amplifier working according to the prin- 
ciple of secondary electron multiplication. It is made of thousands of small glass 
tubes, about 25 um in diameter and 1 mm in length, which are packed in parallel 
alignment to form a plate of about 1 mm thickness (see Fig. 13.39). The front and 
back sides of the plate are coated by thin metallic films serving as electrodes to 
supply a voltage in the region of 1 kV. Usually the tubes are tilted by about 10° 
to the plate normal to achieve a higher detection probability. An ion hitting the 
inner wall of such a glass tube will induce a few secondary electrons, which are 
accelerated further by the supplied field. On their way toward the back of the 
plate, they hit the glass wall several times and produce further secondary elec- 
trons. This cascade process will finally yield a cloud of about 10* electrons. If a 
stack of two or three MCPs is used instead of a single one, the individual ampli- 
fication factors will be multiplied, so that a single ion hitting the front with suffi- 
cient energy will produce finally a cloud of about 10° elementary charges, which 
is sufficient for further electronic evaluation. An MCP is a fast device. With opti- 
mized electronic circuitry the rise time of single ion pulses is in the region of 
100 ps. The spatial resolution of the MCP is controlled by the dimension of the 
glass tubes. 

For mere imaging, as for conventional FIM, it is sufficient to place a phosphor 
coated screen behind the exit face of the MCP, so that each electron cloud 
produces a short light flash. To obtain quantitative positional information, either 
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Fig. 13.39 Funcitonal principle of an multichannel plate (MCP), an 
arrangement of thousands of secondary electron multipliers working in 
parallel. Each of them is made of a tiny glass tube, 25 um in diameter. 


the optical information of the screen can be recorded by a CCD camera or the 
electron clouds need to be measured directly by suitable electronics. For the latter 
case, a multiple anode array is placed behind the MCP and connected to sensitive 
preamplifiers and fast converters to transform analogous charge or time informa- 
tion into digital data. Various concepts may be distinguished by their different 
layouts of the anode array and the complexity of the electronics. Historically, the 
first system to function well was the so called position-sensitive atom probe 
(PoSAP) build around a “wedge and strip anode” (Cerezo et al. 1988). The name 
of this anode type is self-explanatory regarding the layout sketched in Fig. 
13.40(a). The geometry is designed in such a way that the relative fractions of 
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Fig. 13.40 Anode layouts for the readout of the positional information 
of a channel plate: a) wedge and strip detector; b) TAP detector; 

c) delay line detector. (Instead of the double-wire Lecher lines, only a 
single wire is shown to illustrate the principle.) 
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the total charge measured on the three electrodes Qx, Qy, and Qz vary with the 
position of the electron cloud. For the layout shown, the position may be calcu- 
lated in a straightforward manner by applying Eqs. (63). (Here and in the follow- 
ing, uppercase letters represent detector coordinates while positions on the speci- 
men are described by lowercase letters.) 


Ko 2 ¥ oc V. (63) 
Qx + Qy + Oz Qy + Qz 


Since only three independent anodes are used, the required electronics is rea- 
sonably simple. However, the layout has the important drawback that the total 
area of each electrode and consequently the respective capacities are quite large 
so that the drain of charges after the impact takes a considerable time. If two 
ions hit the detector within this time gap, they cannot be separated. To reduce 
the risk of such double events, the operator is forced to perform the evaporation 
of the tip very slowly. 

To circumvent this problem, the so-called tomographic atom probe (TAP) detec- 
tor was designed shortly afterward as a square-anode array of 96 independent 
electrodes (see Fig. 13.40b). Choosing the correct distance between the MCP and 
anode, the size of the electron cloud is adjusted so that an individual electron 
cloud hits at least three or four electrodes and its central position may be deter- 
mined by weighting the charge fractions on the respective electrodes. Since 
many other unused electrodes are available, besides those that are hit by a first 
event, most multiple events can be separated. Due to this parallel design the pos- 
sible analysis rate of the TAP detector becomes much higher than with the earlier 
PoSAP, but of course this is at the cost of much more elaborate electronic equip- 
ment. Nevertheless, there is still a certain probability that the charge clouds of 
neighboring events may overlap and cannot be resolved. 

To improve the spatial resolution, systems have been built which combine 
anode arrays of low resolution with a gated CCD camera that localizes individual 
light flashes on the screen, with high accuracy. However, since the gating of a 
CCD camera is rather slow, the flight time and in particular the sequence of im- 
pacts hitting the detector after a pulse cannot be determined from the optical in- 
formation of the camera alone. The CCD data have to be complemented by the 
measurement of hitting times on an auxiliary anode array and correlated suitably. 
To realize this concept, in the so-called optical PoSAP (Cerezo et al. 1994) the in- 
formation of the screen is divided by a beam splitter. In this way, part of the im- 
age intensity can be used to determine the impact flashes on an auxiliary anode 
array using the scintillation principle; another part is evaluated by the CCD cam- 
era as sketched in Fig. 13.41. From the figure it becomes obvious that rather com- 
plex equipment is required. As an alternative, optically transparent electrodes 
with phosphor coatings may be used (Deconihout et al. 1998). Nevertheless, in 
both cases the data rate is limited by the rather slow scanning rates of the CCD 
camera so that the optical principle is not competitive — at least at present. 
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Fig. 13.41 Principle of the optical PoSAP detector. The combination of a 
CCD camera and a resistive anode array allows high positional accuracy 
and at the same time multievent detection. 


Most recent instruments are equipped with a detector applying the delay line 
principle which was proposed first in 1987 (Keller et al. 1987). Instead of flat elec- 
trode areas, two independent double-wire spirals are used which are wound along 
the X and Y axes of the detector used to determine the X and Y position, as indi- 
cated schematically in Fig. 13.40(c). Each double wire represents a Lecher line, on 
which the pulse signal propagates with the velocity of light. Both ends of the 
Lecher line are connected to a fast multichannel time-to-digital converter (TDC) 
with subnanosecond resolution. If the ion hits at the center of the detector, the 
pulse signals will propagate symmetrically to both ends of the Lecher line and 
will reach the TDC at exactly the same time, whereas for an asymmetric impact 
position the two time signals will differ considerably. In Fig. 13.42 the sum and 
the difference of time signals are presented as they have been collected for many 
independent impacts on a circular detector of 120 mm diameter and spiral spac- 
ing of the anode of 1 mm. The sum of the two time signals represents an instru- 
mental constant and corresponds approximately to the propagating time from 
one end of the line to the other. A time interval of 150 ns can be measured well. 
The time difference between the two signals is proportional to the position ac- 
cording to Eq. (64), where the calibration parameter v, denotes the propagation 
velocity along the spiral axis, which amounts to about 0.4 mm ns™!. 


l r 
X =v -:9) (64) 


An analogous equation holds for the Y direction. Since a fast time-to-digital 
converter is already required for ToF mass spectrometry, delay line detectors are 
a very economical solution. Furthermore, they allow very high data rates, so that 
evaporation pulses may be applied with frequencies of up to 200 kHz. 
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Fig. 13.42 Evaluation of time signals of the delay line. The sum (top) of 
the two signals is a constant which may be used to relate time signals 
correctly to individual events. The difference is a direct measure for the 
position. Due to a circular channel plate, the frequency distribution 
resembles a sinusoidal. (By courtesy of P. Stender, University of 
Munster.) 


13.2.3.3 lon Trajectories and Image Magnification 
In order to understand the properties of field ion micrographs and the quality of 
the volume reconstruction, the ion trajectories must be discussed in more detail. 
An idealized specimen will have axial symmetry, so that the potential and the 
field may be considered in a two-dimensional space with r and z, the coordinates 
perpendicular and parallel to the rotational axis of symmetry, respectively. For 
convenience, we split the expression for the electrical potential ® into the abso- 
lute tip voltage Vand a spatial distribution function g so that Eq. (65) holds, and 
the equations of motion can be written in accordance to classical mechanics as Eq. 
(66), where n and m denote the charge state and the mass of the ion, respectively. 


@(r,z) = V- g(r, 2) 


ar neV 6g. az neV 09 
di? m or’ dt* m 6z 





(65) 


(66) 
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Without any further calculation it is seen that the acceleration in both coordinate 
directions depends on mass, charge state, and voltage in the same way. Thus, hav- 
ing the ion initially at rest, the trajectory becomes independent of all these vari- 
ables. All the different species will follow the same trajectory; only the required 
flight time is characteristic of the given charge state and mass. This has the im- 
portant consequence that the fundamental imaging relationship between tip and 
detector is universal for all species and that, besides minor modifications due to a 
slight difference in initial position, imaging gas and specimen atoms will follow 
exactly the same trajectory. Therefore, the position-sensitive detector system may 
be calibrated by means of field ion micrographs. 

A quite reasonable geometric representation of a well-prepared tip may be 
given by a truncated cone closed by a hemispherical cap, as indicated in Fig. 
13.43. In addition, field lines and model trajectories are shown. Since the ions 
start at rest, their motion follows the field lines initially. This means that they 
leave the spherical apex in a radial direction. Later, they deviate from the curva- 
ture of the field lines because of the forces of inertia and the trajectory becomes 
almost straight. While the initial position of the ion at the spherical cap is de- 
scribed with the polar angle 3, the imaged position at the detector may be charac- 
terized by a smaller angle 9’ (see Fig. 13.43). In order to calculate the initial posi- 
tion on the tip’s surface from the image, only the function between the two angles 
must be known. This function varies in detail from tip to tip as the individual 
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Fig. 13.43 Electrical field and trajectory of an evaporated atom. The 
impact position and initial location at the tip surface are related by a 
point projection. The center of projection is shifted relative to the 
center of the spherical cap by uR. As an alternative the ratio between 
the polar angle 9 and the projection angle 9’ may be used for 
evaluation. 
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Fig. 13.44 Relationship between measured angle 9’ and polar angle 9 
as determined for electropolished tungsten tips. The polar angle has 
been determined from crystallography. Tip axis aligned parallel to [011]. 
(By courtesy of P. Stender, University of Münster.) 


shape does. However, from a practical point of view, it turns out that a simple 
proportionality holds, which is conveniently described by an imaging compres- 
sion factor x, defined by Eq. (67). 

K:= 9/9 (67) 
The compression factor can be calibrated by means of field ion micrographs of 
single crystalline specimens, in which low-indexed lattice directions are identified 
from symmetry considerations. See, for example, the indexed poles on the micro- 
graph of Fig. 13.37. The crystallographic angle between different poles and thus 
the polar angle 9 is known from crystallography, while the detection angle 9’ is 
determined from the position of the pole on the FIM micrograph and the flight 
distance L between tip and detector. Typical data determined for an electropol- 
ished and field-developed tungsten tip are shown in Fig. 13.44. Obviously, the 
linear relationship of Eq. (67) is well fulfilled. For this individual specimen a com- 
pression factor of « = 0.54 is determined. 

It is easy to quantify the magnification of the analytical microscope on the basis 
of the geometric model. The polar distance on the image, D = L- sin Y x L- 9, 
has to be compared to the distance at the hemispherical cap, dx R- 9, where 
R denotes the current radius of the apex. Thus, the magnification is given by 
Eq. (68). 


L-k 
R (68) 
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Recalling that a typical tip radius amounts to about 30 nm and the distance be- 
tween detector and tip may reach 0.5 m, a magnification of 107 is easily obtained. 

In Eq. (68) an interesting detail is noteworthy. The magnification of the micro- 
scope itself depends on the tip radius of the specimen. In other words, the speci- 
men represents the essential lens of the microscope. Therefore, atom probe to- 
mography will only function in a reliable manner if specimens are prepared 
carefully and can be well reproduced in shape. Furthermore, the tip radius in- 
creases during the measurement, as the specimen field-evaporates continuously. 
As a consequence, the magnification decreases during the measurement. In or- 
der to reconstruct the spatial arrangement of the atoms after the measurement, 
the development of the radius must be recorded or estimated in a suitable way. 
The introduced geometric model of a tip neglects the roughness of the surface 
on the atomic scale. In reality, edges of the atomic terraces and a faceting of the 
spherical surface — low-indexed surface orientations are emphasized in size due 
to anisotropy of the evaporation probability — will lead to slight modifications of 
the trajectories, which will be discussed later. 


13.2.3.4 Tomographic Reconstruction 

The atomic reconstruction of the investigated volume can be performed with sur- 
prisingly simple algorithms, which are outlined in this section. To reduce the 
mathematical effort, we assume that the specimen axis is aligned perpendicular 
to the detector plane, which is a suitable approximation for the majority of exper- 
imental work. The outlined scheme is based on the work of Bas et al. (1995). The 
general case, taking into account a relative tilt between tip and detector, can be 
treated in an analogous way. The interested reader may find appropriate formulae 
in Miller (2000) and Al-Kassab et al. (2003). 

The evaluation of data is conveniently subdivided into three steps: 1) the spe- 
cific mass m/n is calculated from the ToF, 2) the lateral position at the tip surface 
is calculated from the impact position at the detector, and 3) the depth scale along 
the symmetry axis of the specimen is determined from the data sequence. In the 
following, we express the impact position at the detector by Cartesian coordinates 
X and Y (see also Fig. 13.43 for further geometric parameters). 

Since the field lines are concentrated at the tip apex, the ions have already taken 
up the major fraction of their kinetic energy during the very first millimeter of 
their trajectory. Later the motion is almost straight and uniform, so that from 
conservation of energy the specific mass is calculated to sufficient approximation 
by Eq. (69). 


mM o 212 pel Vip + Vpulse) 
n I? +x? $ y2 





(69) 


Using the image compression factor introduced earlier and the appropriate geo- 
metric relation for the detection angle [Eq. (70)], 


tan 9’ = YX?+ Y2/L, (70) 
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the Cartesian coordinates of the position at the tip’s surface are determined by 
Eqs. (71). 


X X 
y= pe sin 9% = DÈ sin(9’ /x) 


u Yr “oe (71) 
Z= R(1 — cos(9’/x)) 


In the last equation the axial coordinate z has been marked by a tilde to express 
that at this step the axial position is only a preliminary one. It is given relative to 
the position of the tip front zo, but as this reference point shifts during the mea- 
surement, we have to correct the depth position in the final evaluation step: With 
each evaporated atom the specimen is eroded by one atomic volume. Thus, the 
number of atoms already detected represents a natural depth scale. To establish 
this scale, the actual image magnification, which relates the sensitive area of the 
detector to the investigated area at the apex, needs to be taken into account. Ex- 
pressing all this in a differential equation, we obtain Eq. (72), where Q and N de- 
note the average volume per atom and the number of detected atoms, respec- 
tively. The factor p takes into account the fact that the detector has a limited 
probability for detection (p ~ 0.5), and the magnification M is calculated by 
means of Eq. (68). 


Q Q. M? 
dzy = dN = dN (72) 
PA measured PA detector 





Applying Eq. (72), the total shift of the tip front relative to its initial position at the 
beginning of the measurement is found by integration and the final z coordinate 
results from summing the two contributions: z = zu + Z. 

In order to evaluate Eqs. (71) or (72), the instantaneous tip radius R must be 
known. If the evaporation properties of the investigated material are reasonably 
homogeneous, this radius can be concluded from the total voltage (tip plus elec- 
trode voltage) that is required to obtain a given evaporation rate. The evaporation 
field strength at this rate is known as a material parameter so, by inversion of Eq. 
(62) we can get Eq. (73) to determine the actual tip radius. 


R= Uor 
B z evap 





(73) 


However, this method is only feasible if the evaporation field strength Esvap stays 
constant during the measurement. In heterogeneous specimens, e.g., thin-film 
multilayers, this prerequisite is usually not fulfilled. In this case, the evolution 
of the radius must be estimated under the assumption of a constant cone angle 
of the tip. Having measured the initial radius by transmission electron micros- 
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Fig. 13.45 Reconstruction of the atomic arrangement of a Cu/Py 

(Py = NizoFe21, Permalloy) multilayer deposited onto a tungsten 
substrate. The atomic positions within the marked box are plotted in 
higher magnification on the right. (After Ene et al. 2005). 


copy or estimated it from the tip voltage when the measurement starts, the in- 
crease of radius can be calculated based on simple geometry (see Fig. 13.43) 
with Eq. (74). 


sin y 


dR = (74) 


1-siny 


A typical reconstruction of experimental data, calculated by the method outlined, 
is shown in Fig. 13.45. In this case the volume was taken from a Cu/NiFe multi- 
layer specimen. In most reported measurements the analyzed region corresponds 
to a rectangular prism of typically 15 x 15 x 100 nm?. The position of each de- 
tected atom is marked by a gray-scale-coded dot. 

After the spatial distribution of the atoms has been reconstructed, various aver- 
ages, composition profiles, 2D compositional maps, and isoconcentration sur- 
faces may be derived by sorting and counting the different species in suitable 
subvolumes. As shown in the detail of Fig. 13.45 (right-hand side), the recon- 
struction often reveals low-index lattice planes that are orientated almost perpen- 
dicular to the tip axis. If the required parameters — either the evaporation field 
strength and field compression factor or the initial radius and shaft angle — are 
chosen correctly, the spacing between the resolved planes indeed corresponds to 
the spacing of the lattice structure under consideration. Fourier methods are be- 
ing used in order to detect such periodicities inside the reconstructed volumes. 
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13.2.3.5 Accuracy of the Reconstruction 
As already discussed, the fundamental imaging law relies on a good description 
of the shape of the specimen, which needs to be produced by continuous 
field evaporation prior to the measurement. This process is also called the “field 
development” of the specimen. As soon as a steady state of tip development is 
reached, the tip geometry represents a surface of constant evaporation rate. For a 
homogeneous material, the tip is then bounded by an isosurface of the field 
strength. If this process of development is performed too fast or insufficiently, a 
variety of artifacts may be produced. Particularly dangerous is the partial fracture 
of a specimen before or during the measurement; such fracture usually produces 
surface topologies that are unsuitable for a reliable analysis. Furthermore, the 
depth scale will be erroneously calibrated due to the intermediate loss of material. 
Even if the experimentalist obeys all the rules of good experimental practice, 
the positioning of the atoms within the reconstruction is never perfect, at least 
as long as the evaluation scheme outlined in Section 13.2.3.4 is used. It is based 
on two important assumptions: (a) the tip apex may be represented by a perfect 
sphere; (b) the critical field strength required to evaporate the atoms is homoge- 
neous along the surface. The first assumption is only true as long as the atomic- 
scale roughness of the surface can be neglected. The second assumption is no 
more valid when alloys or intermetallic phases are investigated that comprise sev- 
eral atomic species of different evaporation probabilities. Figure 13.46 represents 
the situation when only a few atoms of a lattice plane are left, still sticking to the 
next plane. In this case, the electrical field that controls the trajectory of the atom 
next in the evaporation line will definitely deviate from the idealized field of the 
spherical surface due to the local disturbance of the remaining protruding atoms. 
Thus, to calculate the trajectories exactly, the structure which should be deter- 
mined from just these trajectories needs to be known in advance. In other 
words, we are faced with an implicit conundrum: This is one of the problems to 





Fig. 13.46 Schematic field distribution on a low-indexed lattice plane 
just before it is completely evaporated. The last-but-one atom is 
accelerated by a field which is significantly deformed by the last atom. 
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be solved in the future. Meanwhile, for a sound evaluation of atom probe data, 
the only promising strategy is to simulate the evaporation sequence of hypotheti- 
cal specimen structures and to compare the simulated reconstructions with those 
of real experiments. This procedure is quite analogous to the usual practice in 
high-resolution electron microscopy, where experimental images are compared 
to those simulated for hypothetical structures until a good match is found. 

For that task, Vurpillot et al. (2000, 2001) derived a simulation scheme that al- 
lowed evaluation of the spatial accuracy of the tomography and the influence of 
heterogeneous evaporation properties on theoretical grounds. As the simplest ge- 
ometrical model, which still reflects the microscopic features on the atomic scale, 
they suggest constructing the apex from a simple cubic arrangement of Wigner- 
Seitz cells. In a first step, the electrical field between the model tip and the detec- 
tor is calculated by solving Poisson’s equation numerically for the electrical poten- 
tial in vacuum by means of a finite element method. The electrical field strength 
at the positions of the surface atoms is determined, and the position of highest 
field strength selected. In a second step, the atom, that means the Wigner-Seitz 
cell at this specific position, is removed from the apex model and the field is re- 
calculated for the new configuration. In a final step, the trajectory of the selected 
ion is calculated between its initial position at the apex and the detector, based on 
classical mechanics and considering the acceleration of the ion within the electri- 
cal field. In the case of alloys or heterogeneous systems, the different evaporation 
probabilities of the atomic species must be taken into account. This means that, 
before the position of highest field is selected in the first step, the electrical field 
is scaled artificially by a factor varying from atom to atom in order to reflect the 
respective evaporation probability. 

By repeating this scheme recursively to predict the impact positions of several 
thousand atoms, an atom probe measurement can be simulated quite realistically. 
General features of experimental data are well reproduced, although an artificially 
short flight distance between tip and detector and rather small specimens of 
about 20 nm radius had to be used to limit the computational effort. In Fig. 
13.47 the impact positions of several thousands of atoms on the detector are 
shown, as calculated from the simulated evaporation of a few lattice planes. In 
contrast to the naive expectation, the spatial density of the events on the detector 
is by no means homogeneous. Instead, lines of significant redistribution of the 
atoms are seen, which are related to low-index zones of the crystal structure. Ob- 
viously the redistribution is related to the special situation sketched in Fig. 13.46. 
The last few atoms sticking on a flat, low-index surface are affected by severe field 
distortions, and therefore their trajectory is disturbed significantly in comparison 
to a simple point projection. 

These subtle redistributions of the atoms limit the spatial resolution of the 
atom probe method and indicate to the experimentalist that specimens should 
be tilted during the measurement in such a way that low-indexed poles are 
avoided inside the measured volume. Deviations of the atom trajectories by the 
local surface topology are the main limiting factor for the instrument’s resolu- 
tion. This is made particularly clear by the positional shift of atom positions in an 
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Fig. 13.47 Simulated impact positions on the detector area, during 
evaporation of a few lattice planes of a simple cubic model alloy. Tip 
axis aligned along [001]. (After Vurpillot et al. 2000.) 


alloy compared to that of a pure specimen, as represented in Fig. 13.48. Because 
of the different evaporation probabilities of the two species, the local fields for the 
alloy are distorted differently than for a pure specimen. Thus, the reconstructed 
positions of the atoms are different for both model tips. However, as can be seen 
from Fig. 13.48, most positions agree to within about one lattice constant. Only a 
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Fig. 13.48 Shift of reconstructed atom positions of an AB model alloy 


with respect to their ideal position in a homogeneous specimen (x in 
units of lattice constant). (After Vurpillot et al. 2000.) 
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minor fraction of atoms originating from zone line positions are shifted by much 
larger amounts, up to five lattice constants. In this way, the simulation indicates 
the lateral accuracy of the tomography to be slightly better than one lattice con- 
stant, as long as poles and zone lines are avoided. 


13.2.3.6 Specimen Preparation 

Since atom probe tomography requires a dedicated sample geometry, possible 
preparation methods should be discussed briefly. Sharp, needle-shaped speci- 
mens are used for any field ion microscopy technique. Traditionally, these needles 
have been produced by electropolishing of thin metallic wires under in-situ 
control by means of optical microscopy. But at present thin films and other com- 
plex nanostructures, for which no conventional wire is usually available, are 
becoming more and more important. Thus, the art of specimen preparation is 
under rapid development. For many recent studies on thin-film materials, layer 
systems have been deposited onto tungsten tips, which serve as a substrate. To 
achieve an optimum shape, the freshly prepared tungsten substrates are field-de- 
veloped before the deposition. Since considerable stress is induced by the electri- 
cal field during the measurement, many investigations are hindered by insufh- 
cient mechanical stability of the interface between the substrate tip and coating. 
Thus, this interface needs special care. Sometimes an interlayer, often chromium, 
is used as an adhesion aid; ion-beam cleaning of the substrates immediately be- 
fore deposition has also proven to be advantageous (Schleiwies and Schmitz 
2002). In Fig. 13.49(a), a typical thin-film specimen produced in this way, an Al/ 
Cu/Al trilayer deposited on tungsten, is presented. In this geometry, the main 
analysis direction is aligned normal to the interfaces, so that these specimens 
are especially suited to investigation of reactive diffusion by local depth profiles. 





Fig. 13.49 (a) Example of a Al/Cu/Al tri-layer on a tungsten substrate 
tip, deposited by ion beam sputtering technique (By courtesy of C. Ene, 
Göttingen). (b) Scanning electron micrograph of a layer specimen 
prepared by electron beam lithography (by courtesy of J. Schleiwies, 
Gottingen). 
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However, when interpreting the analytical results, one has to keep in mind that 
the films are deposited onto a curved surface. Usually the curvature induces a 
rather small grain size, so that the microstructure is not directly comparable to 
that of thin films deposited onto conventional planar wafer substrates (Lang and 
Schmitz 2003). 

If this variation in microstructure cannot be tolerated, for example because the 
properties of technical devices are to be determined, tips have to be cut by lithog- 
raphy (Hono et al. 1993) or focused ion-beam techniques. A planar layer system 
is first coated with a suitable photoresist, exposed to electron beam lithography, 
and developed chemically to obtain a suitable etching mask. A typical sample is 
shown in Fig. 13.49(b). The tip is attached to a “handle” about 100 um in size. 
The wedge-shaped needles pointing to the right-hand side taper to a maximum 
thickness of 100 nm at the apex. After etching by means of an ion beam, the 
tips are removed from the substrate and glued to a supporting wire. In this spec- 
imen geometry the interesting interfaces are aligned parallel to the main direc- 
tion of analysis; thus the method is well suited to investigation of interfacial 
roughness or pinholes in multilayers of small periodicity. 

As an alternative approach, Larson (2001) described deposition of multilayers 
on the planar top surface of rectangular Si posts having a typical cross-section 
10 x 10 um? square. Since this size is already much larger than the film thick- 
nesses typically investigated, the growth conditions on top of the posts should cor- 
respond to those on larger planar substrates. To allow the atom probe measure- 
ment, the posts with their coated top surfaces need to be further sharpened in 
a final preparation step. Focused ion beams have been used successfully in 
this case. Since energetic Ga beams of 15-30 keV are applied, the ion beam intro- 
duces considerable irradiation damage, which leads for example to ballistic mix- 
ing (cf. Chapter 8) close to the surface of the needle. Therefore one has to be care- 
ful to avoid artifacts. Usually this means that the atom probe measurement 
should include sufficiently large-volume regions located deep below the damaged 
surface region. 


13.2.3.7 Examples of Studies by Atom Probe Tomography 

With the following examples, the application of atom probe tomography to essen- 
tial problems of alloy physics will be documented. The topics selected here cannot 
be a complete survey over all the activities in the field, and the choice is certainly 
biased by the individual interests of the authors. Nevertheless, we try to include a 
fairly broad range of phenomena and to highlight the remarkable strength of the 
method in terms of a real three-dimensional analysis with outstanding spatial res- 
olution. The historical development should also be reflected somehow. 


13.2.3.7.1 Decomposition in Supersaturated Alloys 

In order to understand the early stages of decomposition, two fundamental 
mechanisms are distinguished from a theoretical point of view. In the limit of 
weak supersaturation, nuclei with a solute concentration close to thermodynamic 
equilibrium need to be formed by statistical fluctuations in order to initialize 


837 


838 


13 High-Resolution Experimental Methods 


the decomposition. With increasing solute content, the activation barrier to nucle- 
ation decreases, until it vanishes at a critical composition, called the spinodal. 
Within the spinodal composition regime, decomposition starts spontaneously. 
The mathematical model of this process dates back to as early as 1958 (Cahn 
and Hilliard 1958). First, the fundamental evolution equation was solved in a 
linear approximation, which makes it possible to represent the composition field 
by a superposition of compositional waves. Waves within a certain range of wave- 
lengths will increase in amplitude by uphill diffusion, which is a consequence of 
the thermodynamic driving forces of the alloy system. The wavelength of fastest 
growth will dominate the microstructure, so that periodic structures develop. (For 
spinodal decomposition, see also Chapter 7.) 

Many experimental studies have been focused on the kinetics of spinodal de- 
composition. Especially, for systems of long-range interaction such as polymer 
blends, the Cahn-Hilliard theory was well confirmed by the experimental data 
(Binder 2001). But in the case of binary alloys, it has been a long-term issue to 
prove the predicted continuous increase in compositional amplitudes by direct 
local analysis. If initial heterogeneities are too small to be resolved by the micro- 
scopic analysis, an intermediate average composition is determined. During 
coarsening, an increasing fraction of heterogeneities will overcome the limit 
of resolution. Thus, there is a real probability that an experimentally observed 
growth of concentration amplitudes is just a side effect of limited resolution of 
the instrument and coarsening of the microstructure. Interestingly, one had to 
wait for the development of atom probe tomography to make possible a suffi- 
ciently detailed analysis of morphology and local composition for a direct compar- 
ison with the theory of spinodal decomposition to be made. 

A complete and very detailed study in this field has been presented by Hyde, 
Miller, and coworkers on supersaturated FeCr alloys (Miller et al. 1995). Although 
this study was done with the early PoSAP instrument and therefore suffered from 
some experimental limitations, it is nevertheless worth reporting here, since this 
work also documents one of the very early direct comparisons of atom probe data 
sets to Monte Carlo (MC) simulations. Such a comparison is particularly advanta- 
geous, because both methods work with data sets of discrete atoms, and their 
data represent roughly the same number, of atoms say about 10°. Meanwhile, 
this strategy has become common practice for the interpretation of results by 
atom probe tomography. In the study discussed here, the comparison to simu- 
lated atomic structures was also used to estimate the influence of experimental 
limitations, such as finite resolution and low data rate of the PoSAP, which leads 
to unbalanced detection probabilities for both species (Cr 30%, Fe 50%). For the 
MC simulation, a Metropolitan algorithm based on pairwise interaction and the 
swapping of randomly selected pairs of nearest neighbors was used. Besides 
the simulations, the experimental results were also interpreted by numerical solu- 
tions of the nonlinear Cahn-Hilliard-Cook theory of spinodal decomposition. 

Different FeCr alloys were prepared, solution-treated, and isothermally an- 
nealed at a temperature of 773 K. The following discussion is restricted to the al- 
loy with a composition close to the middle of the miscibility gap (Fe-45 at.% Cr), 


13.2 High-Resolution Microscopy 














5 nm 


Fig. 13.50 Atom probe tomography of Fe-45 at.% Cr after 500 h of 
annealing at 773 K (a, b) in comparison with a corresponding 
microstructure simulated by a Monte Carlo algorithm based on nearest- 
neighbor swapping (c). For differences between (a) and (b), see the 
text. (After Miller et al. 1995.) 


which is distinguished by the highest thermodynamic driving force. In Fig. 
13.50(a), a tomographic reconstruction of the atomic arrangement is shown as de- 
termined from a specimen annealed for 500 h. The decomposition into Fe- and 
Cr-rich regions is obvious. Isoconcentration surfaces are very helpful in working 
out the interconnectivity of the 3D morphology, which is typical of a process of 
decomposition into almost balanced volume fractions of the two phases. To deter- 
mine local compositions, a regular array of grid points is spanned throughout the 
investigated volume. 

Around each grid point, a cube-shaped subvolume is defined, and the composi- 
tion attributed to this point is calculated by counting all the different atomic spe- 
cies within the respective subvolume. Having determined a 3D composition field 
in this way, 1D profiles, 2D composition maps, and also isoconcentration surfaces 
are easily determined by data reduction. Since the statistical accuracy of a concen- 
tration value scales with the reciprocal square root of the number N of evaluated 
atoms [Eq. (75)], 


c(1 —c) 


Ac = 
N 


(75) 


when choosing the subvolume size one needs to compromise between low statis- 
tical scattering and high spatial resolution. Usually, subvolumes have a dimen- 
sion of one to two nanometers, while the grid spacing amounts to only a few 
angstroms. Thus, the composition field represents a moving average. Isoconcen- 
tration surfaces calculated in this way from the experimental data are presented 
in Fig. 13.50(b), and those determined from comparative MC data in Fig. 13.50(c). 
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In both cases identical evaluation methods have been used. In a qualitative sense, 
both representations indicate comparable microstructures that are characterized 
by interconnected Cr-rich domains, the size of which varies over a considerable 
range of length scales. 

In order to check the validity of the established theories, Hyde et al. (1995) per- 
formed a very detailed quantitative evaluation of parameters that are suitable to 
characterize the microstructure. The size and interspacing of the Cr-rich domains 
were determined by the autocorrelation technique. For the small analysis vol- 
umes of conventional 1D atom probes, the statistics of autocorrelation functions 
have been notoriously bad. This situation changed significantly with the introduc- 
tion of the 3D versions of the instrument. Statistical scattering becomes signifi- 
cantly reduced due to the increased number of atoms involved. Now radial auto- 
correlation functions R, [Eq. (71)] may be determined by evaluating the atoms in 
spherical shells around arbitrarily chosen central atoms of the measured data set. 
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Fig. 13.51 Autocorrelation analysis of heterogeneous FeCr 
microstructures as measured by atom probe tomography (a,c) and 
simulated with a Monte Carlo technique. The correlation functions 
indicate the average domain size (first minimum) and the typical 
spacing between Cr-rich regions (first maximum) after various 
annealing times. The respective kinetics are presented in the lower row 
(c,d). (After Hyde et al. 1995.) 
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As shown in Fig. 13.51 these radial autocorrelation functions develop clearly dis- 
tinguished first minima and maxima that are interpreted as the average size and 
spacing of Cr-rich domains, respectively. The growth of these parameters with an- 
nealing time demonstrates the coarsening of the structure. Double logarithmic 
plots (Fig. 13.51c,d) prove a power law behavior in the experiment and the MC 
simulation as well. With an exponent of 0.22 + 0.05, the experimental kinetics 
agrees very well with the MC simulation, for which the exponent amounts to 
0.21. The same autocorrelation analysis was done for microstructures calculated 
by the nonlinear continuum theory. Surprisingly, a growth exponent of 0.33 is 
found, which resembles conventional coarsening kinetics. This is in remarkable 
disagreement with the experimental finding. 
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Fig. 13.52 Frequency of composition distribution as determined from 
(a) experimental analysis of Fe—45 at.% Cr alloy during annealing at 
773 K; (b) simulated structures. Pronounced decomposition is reflected 
by the appearance of two distinct maxima. (After Hyde et al. 1995.) 
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As has already been pointed out, a critical test for the spinodal mechanism 
would be the observation of growing composition amplitudes. To analyze these 
amplitudes, the frequency distribution of the compositions found for the subvol- 
umes mentioned have been plotted in Fig. 13.52 for the experimental and MC- 
simulated data sets. Beginning with the homogeneous alloy represented by a 
unique probability maximum at the average composition, the decomposition 
leads to double-peaked frequency distributions in later stages; see, for example, 
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Fig. 13.53 Growth of concentration amplitude during spinodal 
decomposition as determined (a) from the atom probe analysis; (b) by 
Monte Carlo simulation; (c) by the nonlinear Cahn-Hilliard continuum 
model. (After Hyde et al. 1995.) 
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the very clear curve after 500 h of annealing in the experimental data (Fig. 
13.52a). Following an earlier suggestion by Langer, Hyde approximates these fre- 
quency distributions by two overlapping Gaussian peaks that represent the two 
phases of the heterogeneous microstructure (Hyde et al. 1995). The centers of 
these peaks mark two compositions u, and 4,, of which the difference quantifies 
the amplitude of compositional segregation. Despite considerable experimental 
fluctuations, the compositional amplitude of the atom probe data sets increase 
significantly during annealing, as does the amplitude of the MC simulated 
data similarly (Fig. 13.53a,b). By contrast, the conventional Cahn-Hilliard theory 
yields a very different prediction. In this latter case, the concentration amplitude 
is expected to increase in a step-like manner after a well-defined incubation pe- 
riod (see Fig. 13.53c). The deficits of the widely accepted Cahn-Hilliard contin- 
uum model in reproducing the experimentally determined behavior are quite re- 
markable. The authors of the study suggest that the obvious discrepancy may be 
due to interfacial transport, which is completely neglected in the Cahn-Hilliard 
theory. 


13.2.3.7.2 Nucleation of the First Product Phase 

Owing to the technological trend toward miniaturization, the very early stages 
of reactive diffusion at thin-film interfaces shifted into the focus of research. Fre- 
quently it has been argued that the thermodynamic driving force to form the first 
reaction product is usually so high that the critical thickness of nucleation ranges 
down to the size of a lattice constant, or even smaller. Consequently, nucleation 
should not be a rate-controlling step at all. However, the first evidence that this 
is not true came from calorimetric studies of reactive diffusion in metallic thin 
films. Although only one product phase forms, double-peaked heat releases were 
observed during the reactions in several binary systems (Bergmann et al. 2001; 
Roy and Sen 1992; Michaelsen et al. 1997). This experimental finding was inter- 
preted by Coffey et al. (1989) as a two-stage mechanism. In the first stage, nuclei 
form at the initial interface and quickly grow in lateral directions, whereas the 
second heat release is attributed to parabolic thickness growth by volume diffu- 
sion. The process of nucleation is still quite unclear. Several mechanisms have 
been proposed to explain the apparent reduction in driving force in the presence 
of a sharp composition gradient (Desré and Yavari 1990; Gusak 1990; Hodaj and 
Desré 1996). As a common feature, they predict a critical composition gradient 
that must be established by interdiffusion before nucleation of the first interme- 
tallic compound becomes possible. However an experimental verification for this 
interpretation has been missing. 

A recent nanoanalytical study (Vovk et al. 2004) has been aimed to shed some 
light on the early nucleation stages. The reaction couple Al/Co was chosen as a 
particularly clear model system, since a double-peaked heat release has been re- 
ported in the literature, and furthermore the first product phase AlyCo) is an 
exact stoichiometric compound with an existance range smaller than 1 at%. The 
latter fact is expected to increase the influence of a composition gradient. For the 
experiments, bilayers of Co and Al, each 20-30 nm in thickness, were deposited 
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Fig. 13.54 (a), (b) Atomic reconstructions of the Al/Co interface (a) in 
the as-prepared state and (b) after annealing at 300 °C for 5 min. 
Positions of individual atoms are marked by gray-coded dots. (c) Sketch 
of specimen geometry. (After Vovk et al. 2004.) 


on tungsten substrate tips. In Fig. 13.54 two examples of 3D reconstructions of 
the Co/Al interface are presented. Although the layers are deposited on curved 
surfaces, the initial interface appears practically flat, since the radius of curvature 
is still significantly larger than the width of the rectangular prism analyzed. This 
flat interface is preserved for short annealings, so that the earliest stages may be 
characterized by one-dimensional composition profiles determined normal to the 
interface as shown in Fig. 13.55. Due to the outstanding resolution, minor chem- 
ical modifications at the interface become noticeable. After a heat treatment of 5 
min at 300 °C, the zone of interfacial mixing at the interface has broadened from 
about 1 nm in the as-prepared state to 3.5 nm, indicating a significant mixing of 
the components. However, the composition profile in this annealing state is well 
fitted by an error function. It therefore clearly results that only interdiffusion is 
observed, instead of the formation of a new intermetallic reaction product, which 
would appear as a plateau of almost constant composition. 

Nucleation of a new phase is first observed in some of the measurements after 
5 min annealing at 300 °C. In these cases, globular particles are detected at the 
interface toward the Al side (see Fig. 13.54b). The fact that these particles appear 
only in some of the measurements after identical annealing conditions empha- 
sizes the statistical nature of a nucleation process. Furthermore, from the volume 
reconstructions it becomes clear that nucleation takes place at heterogeneous 
sites at the interface as sketched in Fig. 13.54(c). A composition profile across 
the newly formed phase (right-hand broken line of Fig. 13.54b) identifies the 
product as AloCo; (see Fig. 13.55), while a profile determined across the remain- 
ing interface (left-hand broken line in Fig. 13.54b) confirms the interdiffusion at a 
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Fig. 13.55 (a) Composition profiles determined perpendicular to the 
initial Al/Co interface after several annealing stages. (b) Composition 
profiles determined along the left- and right-hand broken lines in 
Fig. 13.54(b), respectively. (After Vovk et al. 2004.) 


depth of 3-4 nm as described previously. In this way, atom probe tomography 
yields a clear demonstration that there is significant interdiffusion before nuclea- 
tion of the product. According to the experimental analysis, the critical diffusion 
depth before nucleation starts amounts to 3.5 nm in the case of Al/Co at 300 °C. 

If the theoretical nucleation thickness of the intermetallic product is estimated 
by the usual balance between volume driving force and interfacial energy, a value 
of d = 2a/g, = 0.2 nm is predicted (Pasichnyy et al. 2005). In view of this value 
it is very surprising that the product AloCo, is formed only after the intermixed 
zone has already reached a minimum thickness of 3.5 nm. However, by a quanti- 
tative argument (Pasichnyy et al. 2005), it could be demonstrated that this behav- 
ior is very consistent with a polymorphic nucleation mechanism. This mecha- 
nism assumes that the nucleus of the new phase is produced by transforming 
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Fig. 13.56 Schematic Gibbs free energy curves (top) and idealized 
linear composition profile (bottom). Polymorphic transformation leads 
to reduction of energy only within the gray shaded composition range, 
which is established only in a thin layer within the interdiffusion zone. 
x is the position of the nucleus within the diffusion zone. 


the lattice structure into that of the product phase without modifying the local 
composition. Since any nucleus must have a minimum size in order to overcome 
the nucleation barrier, the ideal stoichiometric composition can only be achieved 
in the center of the nucleus, while in the boundary region of the nucleus the 
composition must deviate due to the existing concentration gradient. This situa- 
tion is sketched in Fig. 13.56, giving evidence that the composition range may be 
directly related to a depth range. In other words, nucleation is only probable with- 
in a thin-layer fraction of the total diffusion zone and the thickness of this layer 
shrinks with increasing concentration gradient. In consequence, high concentra- 
tion gradients will prevent nucleation. 

For the quantitative calculation, the free energy required to form a nucleus 
was calculated by integrating the difference between the two Gibbs free energies 
(see Fig. 13.56) over the volume of the nucleus. For clarity, the nucleus may be 
approximated by a cube, so that the rather simple formula in Eq. (77) is obtained, 
in which 2R, x,, and ø are the width of the cube, the position of the center of the 
nucleus within the diffusion zone, and the specific interfacial energy, respectively. 


XctR 
AG(R,x.) = 24R? - o + 4R? | Ag(c(x)) dx (77) 


%—R 


Numerical results of Eq. (77) are presented in Fig. 13.57 for three different widths 
of the diffusion zone: 3.0, 3.5, and 4.0 nm. In these plots, the size of the nucleus 
is expressed by N, the number of atoms contained. The concentration c, repre- 
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Fig. 13.57 Surface AG(N, cx) for the polymorphic transformation of a 
cubic volume into the AlgCoz phase inside diffusion fields of width: 
(a) 3.0 nm, (b) 3.5 nm, (c) 4.0 nm. Thermodynamic functions 

were considered for a temperature of 573 K. (After Pasichnyy et al. 
2005.) 


sents the composition at the center of nucleus. For a diffusion width of 3.0 nm, 
the Gibbs free energy to form a nucleus still increases monotonically with its size. 
Thus, nucleation is forbidden. At a width of 3.5 nm the situation has already 
changed slightly. A weak local minimum appears in the energy landscape. For a 
width of 4.0 nm, this minimum has become more pronounced and its magnitude 
becomes negative, which means that a particle of the product phase may now be 
formed with a gain of energy. A nucleation barrier of 25 kT is determined from 
the energy landscape, which is quite a realistic value to obtain a reasonable nucle- 
ation rate. If the critical diffusion width is defined as that thickness for which 
a particle of the new phase becomes stable, this critical width must be slightly 
greater than 3.5 nm, in remarkably good agreement to the experimental obser- 
vation by atom probe tomography. The same calculation was also performed for 
other nucleation mechanisms suggested. For example, the calculation for the so- 
called transversal mode (Desré and Yavari 1990) yielded a critical diffusion width 
smaller than 1.0 nm, in obvious contradiction to the measurements. In conse- 
quence, we can conclude that the atom probe analysis yielded convincing evi- 
dence for an interdiffusion process taking place before nucleation of the first 
product, although the zone of mixing ranges down to only a few nanometers. In 
this way it was possible to determine the critical diffusion depth with an accuracy 
better than 1 nm, which is important to distinguish between different nucleation 
modes. The transversal mode, which has been discussed for more than a decade, 
can be excluded for the Al/Co reaction couple. 


13.2.3.7.3 Diffusion in Nanocrystalline Thin Films 
In nanocrystalline matter with grain sizes down to about 10 nm, the volume frac- 
tion attributed to grain boundaries (GBs) can easily exceed 50%. Since there is 
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vast experimental evidence that the diffusion along GBs is much faster than bulk 
diffusion (Kaur 1989), atomic transport in nanocrystalline thin-film materials is 
expected to be accelerated significantly by the presence of GBs. With further de- 
creasing grain size, an additional necessary topological feature of the boundary 
arrangement - the so-called triple line — may affect atomic transport. Along a tri- 
ple line, three GBs meet forming a one-dimensional defect. The structure of a tri- 
ple junction is expected to differ considerably from that of an ordinary GB. Pre- 
sumably, it will be much more disordered, so that an even faster transport rate 
can be assumed. However, up to now only one measurement of triple line diffu- 
sion by analytical electron microscopy has been reported (Bokstein et al. 2001). 
The authors interpret their results in terms of a much faster diffusion rate along 
the triple line. The difficulty of the measurement stems from the small effective 
cross-section of the defect, which requires chemical analysis of the highest 
possible resolution. In the 2D projection of a TEM sample the triple line is 
always overlapped by grain volume. So, the measured level of segregation is much 
reduced. 

In recent atom probe experiments (Schmitz et al. 2006), Au (15 nm in thick- 
ness) and Cu layers (25 nm in thickness) were deposited on top of tungsten tips. 
In order to slow down the intermixing of the soluble metals Cu and Au, a thin Co 
barrier (6 nm thick) was inserted in between. Tips were annealed in a UHV fur- 
nace at 295 °C for 30 min. Due to the strong substrate curvature, thin films de- 
posited on substrate tips tend to be very fine-grained, with grain sizes down to 5 
nm (Lang and Schmitz 2003). Thus, these specimens are ideal candidates in 
which to observe the transport along topological singularities of the GB arrange- 
ment. A typical field ion micrograph of the upper Cu layer is shown in Fig. 
13.58(a). Discontinuities in the structure of concentric bright rings mark the pre- 
sence of grain boundaries. A few of them are marked by broken lines. Obviously, 
a polycrystalline structure with a grain size of about 15 nm has formed. Triple 
junctions are also seen that are aligned approximately parallel to the tip axis, so 
that they can be analyzed in the main measurement direction; “T” marks a par- 
ticularly clear one. 

By selecting such areas for analysis, the local concentration field around the 
junction could be measured. The geometry of the three GBs and an example of 
a 2D composition map is shown in Fig. 13.58(b). Since the atom probe delivers 
real three-dimensional data, 2D composition maps can be determined in any ar- 
bitrary direction subsequently to the measurement. In the example shown, the 
map is aligned perpendicular to a triple line. It is seen quite clearly that the line 
is locally enriched in Au and also that the three GBs are distinguished by a mea- 
surable Au content, though at a somewhat lower level than inside the junction. 
The measured concentration fields around the triple line were evaluated by 
means of the approximate solution of Klinger et al. (1997), which represents an 
extension of the established Fisher model of GB diffusion (see Chapter 5). In the 
case of the triple line, the atomic transport may be understood by a three-level 
cascade process instead of only two levels in the case of ordinary GB diffusion. 
First, the material is transported along the triple line. Second, leakage into the re- 
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Fig. 13.58 (a) Field ion micrograph of a sputter-deposited Cu layer. 
Grain boundaries are marked by broken lines, a triple junction by “T”. 
(b) Two-dimensional composition map in gray scale representation 
determined as a cross-section through a triple line joining three Cu 
grains with their boundaries A, B, and C. (After Schmitz et al. 2006.) 


lated three boundaries takes place, and finally, atoms are drained from the GB 
into the bulk grain volume. Quantitatively the concentration field is described by 
Eq. (78). 








V3 - Dead : \/4Dz/at 
c(x, y, Z, t) = co : exp -z 
4/ Drys 


\/4Dg/rt x 
x exp ( eae r) x i — erf | (78) 





In this equation the x, y, and z coordinates are directed perpendicular to a repre- 
sentative GB, along a grain boundary perpendicular to the triple line, and along 
the triple line, respectively (see the axes in Fig. 13.58b). The meaning of the in- 
dices of the three diffusion constants D; is self-explanatory. The effective thick- 
ness ô of the GB may be estimated as about 1 nm and the effective cross-section 
s of the triple junction is approximated by s = 8°. Conveniently, the right-hand 
side of Eq. (78) separates into three independent terms on the space coordinates. 

Since the Au content within the grain volume falls short of the noise level of 
the atom probe method, the authors used the known bulk diffusion coefficient 
to interpret their data. Evaluating the compositional slope along the y axes (see 
Fig. 13.59a), the GB coefficient Dgg can be determined using the second term 
on the RHS of Eq. (78). Then, by means of the first term, the slope along the z 
axis (see Fig. 13.59b) yields the diffusion coefficient within the triple junction. 
From the measurements at Cu/Au films the following numerical results were 
obtained: 
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Fig. 13.59 Au diffusion in Cu at 295 °C: Normalized concentration 
profiles determined along grain boundary (a) and triple junction (b) as 
determined by atom probe tomography. (After Schmitz et al. 2006.) For 
details see the text. 
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Thus, at the diffusion temperature of 295 °C, the diffusion coefficient of the triple 
line is indeed about 5600 times larger than the GB diffusion coefficient. Actually, 
atom probe tomography is the only method allowing such detailed measurements 
in complex nanocrystalline microstructures. 


13.2.3.7.4 Thermal Stability of GMR Sensor Layers 

In recent years reading heads based on the giant magnetoresistance effect (GMR) 
have made possible a dramatic increase in magnetic recording density (see Chap- 
ter 14, Section 14.4). Since the period of the required multilayers ranges down to 
a few nanometers, spatially resolved analysis is a challenge for atom probe tomog- 
raphy, too. Co/Cu and Cu/Ni79Fe2; are two of the metallic systems most often 
used. The soft magnetic alloy NijoFe2; (Permalloy, Py) is especially suited for po- 
sition or orientation sensors, since the effect of hysteresis is very low. For many 
potential applications, such as angular sensors in motor vehicles, the thermal 
stability of the device is an important issue. It is known that the Cu/Py system is 
much more sensitive to thermal load than Cu/Co. With the former, the GMR am- 
plitude is already degenerating at temperatures of 150 °C, while for the latter the 
amplitude remains stable up to 400 °C (Hecker et al. 2002). Different mecha- 
nisms have been proposed to be responsible for GMR degradation: van Loyen 
et al. (2000) argue that at least two effects should contribute, namely grain bound- 
ary diffusion and inter- or demixing at the interface. Hecker et al. (2002) conclude 
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Fig. 13.60 Composition profiles of a Cu/NiFe multilayer determined 
perpendicular to the layer interfaces. (a) as-prepared state, (b) state 
after 20 min of annealing at 350 °C. (After Ene et al. 2005.) 


that the alloying tendency of Ni and Cu above 250 °C controls the decay of the 
GMR in the Py systems. 

In view of potential technical applications it is important to identify the mech- 
anisms of thermal reaction. In an experimental study (Ene et al. 2005) Cuz 5 nm/ 
Py2.5 nm multilayers were deposited onto substrate tips and annealed in a UHV 
furnace. A typical volume reconstruction of an as-prepared state is shown in Fig. 
13.45, which we have already discussed above. The depth resolution is sufficient to 
distinguish individual (111) planes of Cu, which are used to calibrate the length 
scale of the reconstruction. Concentration profiles determined along the tip axis 
direction are presented in Fig. 13.60 for the as-prepared state (a) and the 350 °C/ 
20 min annealed state (b). The resolution of analysis, which reproduces sharp 
transitions at the interfaces from almost 100-0 at.% Cu, is noteworthy. It would 
be not possible to achieve such a selectivity with analytical TEM. If the profiles 
of the as-prepared state are compared to those after the annealing at 350 °C, 
no difference is seen at first sight. In particular, the integrity of the thin films 
is preserved and no grain boundary effects are detected. This is all the more 
striking, since the magnetoresistivity has already vanished completely at lower 
temperatures. 

However, due to the outstanding sensitivity of TAP, it is possible to determine 
even the smallest modifications of the interfacial chemistry. In this case, the slope 
of the concentration profile at the interfaces was used as a characteristic parame- 
ter (for a definition, see the insert in Fig. 13.61a). Clearly, during the annealing 
the slope decreases, starting with its initial maximum value of 1.5 nm! (Fig. 
13.61a). At the temperature of GMR breakdown (250 °C) it has already fallen to 
1 nm“!. Such a variation is not negligible, because in view of the small layer 
thickness a considerable zone at the interface has alloyed, which is obviously al- 
ready sufficient to degenerate the GMR effect. This observation by TAP has been 
found to be in complete agreement with measurements of electrical resistivity 
(Hecker et al. 2002), which had shown that the base resistance of the multilayer, 
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Fig. 13.61 (a) Temperature dependence of the concentration slope at 
the interfaces (left-hand scale) and the Ni content of the Cu layers 
(right-hand scale), both after 20 min of isochronal annealing. The 
broken lines are merely guidelines. (b) Arrhenius representation of the 
interdiffusion coefficients determined. (After Ene et al. 2005.) 
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Fig. 13.62 Grain boundary diffusion in nanocrystalline Ni/Py multilayers 
after 30 min of annealing at 400 °C. (After Ene et al. 2005.) 
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indicating the complete alloying of the layers, increases significantly only at tem- 
peratures higher than 250 °C, at which the magnetoresistance has already van- 
ished. In addition, the decrease in the interfacial slope has also been analyzed in 
terms of a volume diffusion coefficient. Measurements at different temperatures 
allowed an Arrhenius plot presented in Fig. 13.61(b), where it is compared with 
literature diffusion coefficients for Cu, Ni, and GB diffusion in Py. The activation 
energy is remarkably small, even smaller than that of GB diffusion. This suggests 
that the observed short-range diffusion at the interfaces is induced by nonequi- 
librium point defects. 

It must be pointed out that the thermal reaction described so far is not influ- 
enced by neighboring GBs; only volume diffusion effects are observed, though 
on a very short length scale. With the help of FIM images like that shown in 
Fig. 13.58(a), GBs may be selected for the analysis and investigated for segrega- 
tion which would point at GB transport. Indeed, significant GB transport could 
be observed as shown in Fig. 13.62, but only at temperatures above 300 °C. Ni 
diffuses into the grain boundaries of Cu, and in turn Cu into the grain bound- 
aries of Py. After annealing for 30 min at 400 °C, the Ni segregation level in Cu 
amounts to 25 at.% and that of Cu in Py to about 55 at.%. While the former is far 
too low to introduce ferromagnetic shortage across the Cu spacer layer, the latter 
would be sufficient to fragment the ferromagnetic layers into isolated domains. 
However, at realistic temperatures leading to the degeneration of the amplitude 
of the GMR effect, no GB segregation was detected at all. Therefore, based on 
the TAP measurements, GB transport could be ruled out as a potential mecha- 
nism. 


13.2.4 
Future Development and Outlook 


The purpose of this chapter has been to demonstrate how high-resolution micro- 
scopic imaging and analysis are used to understand the atomic structure and dy- 
namics of defects in materials. Particular emphasis has been placed on inter- 
phase boundaries and surfaces, since these are important to an understanding 
of solid-state reactions and - utilizing these reactions — to development of the 
structure and properties of materials. With the methods presented it is possible 
to image, analyze, and even manipulate individual atoms, and furthermore to 
study their cooperative motion even in situ. Having reviewed so many impressing 
experimental examples, one may ask whether there is still room for any further 
development at all. However, a closer look at the severe restrictions with which 
each of the three methods is currently faced will help us to define important start- 
ing points for essential improvements. 

In the case of atom probe tomography, the volume investigated and thus the 
number of identified atoms is so small, let us say 10°, that statistics limits the ac- 
curacy of any composition measurement in general to about 0.1 at.%. The micro- 
structure of materials is often complex, with at least one characteristic length 
scale in the micrometer range. The probability of finding a certain defect on this 
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length scale within the analysis volume of 10 x 10 x 100 nm? is negligible. Last 
but not least, all the examples presented dealt with metallic specimens, while 
modern applications often require functional materials such as ceramics or semi- 
conductors. How can the method, which is a classical tool of physical metallurgy, 
overcome the barrier toward other classes of materials? 

A possible way to achieve a larger analyzed volume would be to elevate the volt- 
age supplied to the tip. The radius of the tip scales with the voltage and so does 
the lateral dimension of the analyzed volume. However, this strategy finds its nat- 
ural limit for practical reasons, since there is no way to produce a voltage pulse 
sufficiently short and with appropriate frequencies for amplitudes larger than 
5 kV. Another possibility is to increase the aperture angle of the instrument, 
which can be achieved by reducing the flight distances. This in turn requires a 
very accurate time-of-flight measurement. Nevertheless, equipped with modern 
time-to-digital converters, very recent tomographic atom probes have been put 
into operation with an aperture angle of up to +35°. This leads to a lateral size 
range of analysis of 40-80 nm, as reported at relevant conferences (Deconihout 
et al. 2006; Stender et al. 2006). Together with the volume, the total number of 
atoms to be evaluated increases dramatically. A typical data set will then consist 
of several million atoms. To measure such a large number of events in a reason- 
able period of time, pulse frequencies need to be increased significantly, which is 
difficult to achieve for the high voltage needed in a conventional atom probe. 

A very intelligent method of circumventing this technical problem is the so- 
called scanning atom probe, which had already been suggested in 1994 by Nishi- 
kawa and Kimoto (1994). By placing a tiny extraction electrode of some 10 pm 
bore size close to the tip (see Fig. 13.63), the voltage required to obtain a given 
evaporation field strength is reduced by a factor 2-4. In consequence much 
lower voltage pulses, which can be produced with repetition rates higher than 
100 kHz, are sufficient. After related technical problems were solved, the concept 
has recently been put into operation. Meanwhile, these instruments are function- 
ing well and are even available commercially (Kelly et al. 2004). Their efficiency of 
analysis is impressive. A data rate higher than 10000 atoms s~! is obtainable and 
data sets with more than 10° atoms have been achieved routinely. Besides the 
large aperture angle and high data rate, this most recent branch of atom probe 
tomography has an important advantage regarding specimen preparation. As in- 
dicated by Fig. 13.63, instead of a single tip made from a supporting wire, an ar- 
ray of microtips may be used. Since the requirement for a large aspect ratio is 
considerably relaxed by the electrode geometry, these microtips may be produced 
conveniently by sputtering through a suitable mask. Very recently, InAs nano- 
wires, grown naturally by using nanopatterned Au catalysts, could be analyzed 
by a tomographic atom probe equipped with such a microelectrode (Perea et al. 
2006). 

Currently several groups are experimenting with pulsed laser irradiation to ex- 
tend APT to nonmetallic materials with a conductivity below 10? Q7! cm", 
which has been the important limit for high-voltage pulsed evaporation (Melmed 
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Fig. 13.63 Principle of a scanning atom probe: an extraction electrode 
with an open diameter of about 10 um is placed close to a microtip to 
concentrate the electric field. By post-acceleration between the ring- 
shaped electrode and the detector entrance, mass resolution is 
improved. (see Kelly et al. 2004). 





et al. 1981). The idea dates back to early work by Tsong and coworkers (Kellogg 
and Tsong 1980). Presumably, it has been due to the primitive technique of early 
laser systems that this idea has not been explored to its full extent and has been 
finding a renaissance only recently after the emergence of the femtosecond laser 
technique. Owing to the very short pulse width of these lasers, the evaporation 
may even be assisted by a direct field produced by the light wave instead of the 
mere thermal pulsing postulated in the early laser work. Evidence for this has 
been collected by the atom probe group in Rouen (Gault et al. 2005; Vella et al. 
2006). Other authors opposed this interpretation (Cerezo et al. 2006). Be that as 
it is, at least several very recent reports have documented that very “difficult” ma- 
terials like oxide ceramics can be analyzed reliably by laser-assisted atom probe 
tomography (Oberdorfer et al. 2006; Thompson et al. 2006; Vurpillot et al. 2006). 
Indeed, this signals an important breakthrough, which will certainly help to bring 
atom probe analysis toward a broader acceptance in materials science. 

The purpose of the section on HRTEM was to demonstrate how static and in- 
situ HRTEM are used to understand the atomic structure and dynamics of de- 
fects in materials. It was seen that in-situ HRTEM is able to reveal the projected 
atomic structure of defects and their mechanisms of motion involving collective 
atom processes, but not individual atoms or jumps.” Since most dynamic pro- 


1) Of course, HRTEM is able to resolve 
individual atoms and their motion on 
surfaces in projection. 
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cesses at interphase boundaries appear to involve collective atom motion, e.g., 
fluctuations at order-disorder interfaces and growth of precipitate interfaces by a 
TLK mechanism, and HRTEM is generally able to resolve these features, this is 
not a great limitation. Hence, in-situ HRTEM is able to provide a wealth of infor- 
mation about detailed atomic processes that occur in materials. This situation will 
only continue to improve, as rapid progress is being made in increasing the spa- 
tial, temporal, and chemical resolution of HRTEMs, the capability of imaging de- 
vices, storage and processing, and precise computer control of the microscope. 

More generally in the field of electron microscopy, recent developments include 
the introduction of dedicated corrector optics, in order to eliminate the spherical 
and chromatic aberrations of the objective lens, and thereby to improve the reso- 
lution of the instrument significantly. This allows straightforward interpretation 
of high-resolution images. In analytical electron microscopy, both the introduc- 
tion of monochromators to narrow the energy spread of the electron beam, and 
that of in-column energy filters, have been important recent developments. With 
conventional optics, the spatial resolution of energy-filtered images is limited 
to about 1 nm owing to the nature of inelastic scattering and the chromatic 
aberration of the objective lens. Thus, the combination of chromatic aberration 
correction and energy filtering is an appealing approach to the ultimate goal of 
chemical mapping with atomic resolution in a TEM, while monochromation and 
spherical aberration correction can provide atomic-level imaging and chemistry in 
an STEM. 

Another important development is taking place in electron tomography. By ap- 
plying stereographic methods and powerful image reconstruction, the former re- 
striction to two-dimensional image projections, from which electron microscopy 
has suffered in the past, can be overcome. Tomography is already an established 
practice in the characterization of complex biological molecules with electron 
microscopy. However, inorganic materials are crystalline in most cases, so that 
strong diffraction contrast effects are present, which complicate easy application 
of stereographic methods. Overcoming this problem by a combination of Z- 
contrast or energy filtering imaging and stereography is quite promising and 
excellent examples are beginning to appear in the literature. It is reasonable to 
expect that it will be possible to image the 3D structures of materials with atomic 
resolution in the near future. 

In scanning probe microscopy, a remarkable breakthrough is not so much ex- 
pected in terms of improving the ultimate resolution limit. Instead, important 
progress is currently being made in the development of new dedicated nano- 
probes which realize the idea of a “lab on top of the tip’ in many different fields, 
such as electrochemistry, magnetism, thermometry, or tribology. Continuously 
improved methods of microfabrication will produce even smaller cantilevers and 
also faster piezoscanner systems, which allow higher resonance frequencies 
and much faster scanning rates. In this way, in-situ measurements with the 
AFM may become a suggested option to study time-dependent processes in bio- 
logical objects or nanostructured materials, similarly to what we have discussed 
for HRTEM in this section. 
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14.1 
Soft and Hard Magnets 
Roland Grössinger 


14.1.1 
What do “Soft” and “Hard” Magnetic Mean? 


A simple classification of ferromagnetic materials can be made on the basis of 
their coercivity. Soft magnetic materials are characterized by a small area of the 
hysteresis loop and a low coercivity (H, < 1000 A m~!) whereas hard magnetic 
materials (permanent magnets) show a high coercivity (H, > 30 kA m7). Figure 
14.1 shows typical hysteresis loops of soft and hard magnetic materials. 

Applications for soft magnetic materials include electromagnets, motors, trans- 
formers and relays, and electromagnetic shielding. Applications for hard mag- 
netic materials are relays stepper motors, generators, etc. Semihard materials are 
used for data storage applications. 

The demand for saving energy in electrical engineering and the continuing in- 
crease in operating frequencies in electric circuits require a constant improve- 
ment of the quality of soft and hard magnetic materials. In recent years, in addi- 
tion to the standard materials (soft: Fe-Si, soft magnetic ferrites), amorphous and 
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Fig. 14.1 Hysteresis loops of ferromagnetic soft and hard magnetic materials. 
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Fig. 14.2 Overview of soft and hard magnetic alloys: saturation 
polarization achieved as a function of the coercivity. Note that the 
coercivities of the different materials range more than seven orders of 
magnitude. 


nanocrystalline ferromagnetic ribbons and wires have been considered very 
promising for this purpose (see, e.g., Luborsky 1983; Vazquez 1994; Herzer 
1997). Besides the standard hard magnetic materials (Alnico, ferrites) nowa- 
days rare-earth-based materials (Sm-Co, Nd-Fe-B) but also nanocrystalline hard 
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Fig. 14.3 Permeability of soft magnetic materials as a function of the saturation polarization. 
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magnetic materials are used (for a survey see Schultz and Müller 1998; Grös- 
singer et al. 2003, 2004). 

As an overview of the magnetic properties of soft and hard magnetic alloys, Fig. 
14.2 shows the saturation polarization achieved as a function of the coercivity. 
Note that the coercivity of the different materials ranges over more than seven or- 
ders of magnitude. Figure 14.3 shows the permeability of soft magnetic materials 
as a function of the saturation polarization. 


14.1.1.1 Intrinsic Properties Determining the Hysteresis Loop (Anisotropy, 
Magnetostriction) 


14.1.1.1.1 Anisotropy 
The magnetic properties of a ferromagnetic material are strongly influenced by 
various contributions of anisotropy. The magnetic anisotropy is the dependence 
of the internal energy (dF = HdM) on the direction of the spontaneous magneti- 
zation. This energy term is called the magnetic anisotropy energy. Different kinds 
of anisotropy exist. 
e Shape anisotropy is macroscopically important because it 

describes the macroscopic stray field: Hs = —D.Ms 

(D = demagnetizing factor); it also plays a role inside a 

crystalline material where the local stray field acts between 

the grains influencing the coercivity. In the case of magnetic 

powders the local shape anisotropy of the grains has to be 

considered. The shape anisotropy also determines the 

working points in all kinds of magnet circuits. 

e The origin of the crystalline anisotropy is the spin-orbit 

coupling. The coupling is given by the lowest energy 

condition of the direction of the spin moment and the field 

produced by the orbital contribution of the electron 

wavefunction. For a crystalline hexagonal system such as 

cobalt, the free energy can be expressed by expanding F in a 

series of powers of sin? 0 [Eq. (1)]. 


F = Ko + Kı sin? 0+ K sin! 0+--- (1) 


For hard magnetic materials especially, this is the most 
important contribution. For cubic crystals such as iron and 
nickel it is more convenient to express F in terms of the 
direction cosines of the internal magnetization with respect 
to the three cube edges. Thus the expression is given by 


Eq. (2). 


F= Ko + Kı (cos? a1 COS” a2 + cos? %2 Cos? 43 


+ cos? g3 cos? 0) + Ky cos? gı cos? a cos? a3 +++ (2) 
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e Induced anisotropy is important for stress- or field-annealed 
soft magnetic materials; in this case K, depends on the 
microstructure but also on the elastic constants (Kraus and 
Duhaj 1990; Herzer 1994). It influences over the magneto- 
elastic energy 3/24.o the shape of the hysteresis loop, 
which is important for sensor applications. 


14.1.1.1.2 Magnetostriction 

Generally magnetostriction is either the change in the shape or the change in the 
volume of a magnetically ordered material when a magnetic field is applied. 
Therefore this is a general effect when a material becomes magnetized. It appears 
in all solids, in metals as well as in insulators. Due to Hooke’s law the deforma- 
tion suffered is proportional to the sample size, say 7, and therefore it is conve- 
nient to express the deformation by the relative variation A = A///, which is 
called the linear magnetostriction. In a crystal A/ is doubly anisotropic: it depends 
on the crystallographic direction of measurement and on the direction along 
which the magnetization, M, is oriented by the applied magnetic field, H. Be- 
cause the saturation magnetostriction As depends on the crystalline directions of 
A¢ and M (f and a, respectively) we have to specify the magnetostriction as 
A(a, ß). But also, in a polycrystalline material formed by small randomly distrib- 
uted crystallites, A measured along H, called the parallel magnetostriction 7), can 
be different from that measured along the perpendicular direction, 41. It can hap- 
pen that A, % 2/2 (isotropic, no volume change) but usually this is not the case 
and then the volume of the solid changes by the amount AV/V = œ = | + 241, 
called the volume magnetostriction. The volume magnetostriction is of interest 
from a scientific point of view; it allows general conclusions about the origin of 
the magnetostriction. Usually the volume magnetostriction is about 10-100 times 
smaller than the shape magnetostriction. 

On the other hand, the shape (or form of the unit cell) of the material is modi- 
fied, and a measure of this effect is the difference A, = A — 21, called the shape 
magnetostriction. The shape magnetostriction is determined by the magnetiza- 
tion process and is of technical relevance for sensor and actuator applications. 


14.1.1.2 Extrinsic Properties — Microstructure 

A technically relevant magnetic material generally consists of more than one 
phase. The favorable properties of a technically usable material are based on the 
existence of interplay between different phases. Therefore the microstructure is 
important. It is determined by: grain size and grain shape; grain boundaries; 
and phases with different physical (magnetic) properties. It depends on the pro- 
duction process. 

For investigating the microstructure, one can use optical methods or, better 
still, scanning electron microscopy (SEM), transmission electron microscopy 
(TEM), or atomic force microscopy (AFM); compare Chapter 13, Section 13.2. 
The different crystallographic phases are characterized by diffraction experiments 
which can also be applied locally. 
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complete development 


Fig. 14.4 Typical microstructure of a 2/17-type magnet taken by TEM. 
Left: completely developed lamellar structure; right: incompletely 
developed microstructure (Matthias et al. 2002). 


In soft magnetic materials coercivity, permeability, and magnetic losses are 
determined by the alloy microstructure. These properties are therefore called 
“extrinsic.” 

In hard magnetic materials the microstructure determines the coercivity and 
the magnetization process. A high coercivity is achievable only with a certain mi- 
crostructure. In modern Sm-Co magnets the pinning of narrow domain walls 
occurs in phases with different crystal structures and consequently different 
magnetic properties (especially anisotropy) (Kronmiiller et al. 1984). As an exam- 
ple, Fig. 14.4 shows the microstructure of a 2/17-type permanent magnet 
(Matthias et al. 2002): the typical lamellar structure consists of 2/17-type material 
inside and 1/5 Sm(Co,Cu); material between the grains (see Section 14.1.3.3). 


14.1.2 
Soft Magnetic Materials 


The ranges of permeability and saturation polarization of most soft magnetic al- 
loys are shown in Fig. 14.3. The most important conventional and new soft mag- 
netic materials are as follows: Fe—Ni (e.g., Permalloy, Mumetal), Fe-Si, Fe-Al 
(Sendust, i.e., FeSiAl), Fe-Co, soft ferrites (Mn-Zn ferrites etc.), amorphous 
alloys (Fe- or Co-based), and nanocrystalline alloys. 
A soft magnetic material which is well suited for technical applications should 
exhibit the following properties. 
e High saturation magnetization at room temperature: The 

saturation magnetization determines the necessary volume in 

transformers for the transmission of a certain power. The 

higher the saturation magnetization of the material, the 

lower the necessary volume. At room temperature Fe has the 
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highest saturation magnetization, but it has to be very pure 
in order to achieve good soft magnetic properties. 

e High Curie temperature: Therefore mainly materials with Fe, 
Co or Ni (3d elements) are used for technical applications. 

e Low coercivity: a low coercivity is generally accompanied by a 
high permeability (these properties are reciprocal). 

e High permeability: A high permeability is of great importance, 
especially for shielding applications. 

e Low conductivity: This is important for ac applications. 

e Low losses: The losses are caused by the hysteresis losses and 
the eddy current losses (normal and anomalous). 


These properties (coercivity, permeability, losses) are correlated with the mag- 
netization process, where the interaction between domain walls and the micro- 
structure (grain size, phases, grain boundaries, local stress centers, etc.) are of 
importance. 

The introduction of various rapid quenching techniques (such as planar flow 
casting, atomic vaporization, and others) allowed the production of new types of 
materials. By rapid quenching three types of interesting soft magnetic materials 
can be produced: 

e crystalline materials like Fe—Si and Fe-Al-Si (Sendust alloy) 
e amorphous materials 
e nanocrystalline materials. 


Rapid solidification from molten material leads to a new state which can be amor- 
phous, nanocrystalline, or just crystalline. Applying a high quenching rate to a 
molten alloy can result in a material which presents generally new and interest- 
ing physical properties. Such materials are said to be “rapidly solidified.” 

Amorphous metallic ribbons can be fabricated by bringing a stream of molten 
alloy in contact with a rapidly moving substrate surface. The most common sub- 
strate surfaces described in the literature are the insides of drums or metallic 
wheels, the outsides of wheels, between twin rollers, and on belts. 

Very often the so-called planar flow-casting technique is used, in which the 
nozzle is situated very close to the moving substrate surface (at a distance around 
0.3 mm). The rapidly quenched material obtained is a thin (about 20-50 um) and 
usually narrow (up to 10 mm) ribbon (see Fig. 14.5). In industrial production 
lines, much wider ribbons are possible. 

Annealing of amorphous alloys is a very important technique for reducing local 
stresses, achieving chemical homogenization and improvement of magnetic 
properties (adjustment of the shape of the hysteresis loop) of the samples. An- 
nealing can cause stress relaxation, controlled development of induced aniso- 
tropy, adjustment of a well-defined domain structure, controlled microstructural 
changes, and nanocrystallization. 


14.1 Soft and Hard Magnets 












Heating coil 


Amorphous 
Nozzle ribbon 


> 





Rotating 
Cu-wheel 


Fig. 14.5 Melt spinning technique. 


14.1.2.1 Pure Fe and Fe-Si 

Iron and iron-silicon alloys are the most important soft magnetic materials in 
use today. Improvements in magnetic properties have been achieved primarily 
by minimizing chemical impurities and controlling crystal orientation. The term 
“pure” iron generally refers to iron of 99.9% purity. Nonmetallic impurity ele- 
ments such as C, O, S, or N which enter the lattice interstitially reduce the soft 
magnetic properties. A maximum permeability for commercial iron of 10 000- 
20000 can be increased to 100000 and more by annealing in H, and in vacuum. 
This additionally causes a decrease in the coercivity Hc; Table 14.1 gives a short 
summary of the effect of purification of Fe. Small additions of C or N (of the 
order of 100 ppm and more) cause a drastic increase in the coercivity. For special 
applications, cores produced from Fe powder are used. This is favorable because 
of the low losses at frequencies up to 1 MHz. 

The so-called silicon steel had already been introduced by the beginning of the 
20th century and became very successful for electric transformers. The magnetic 
properties of the Fe-Si system can be summarized as follows: the saturation 
magnetization, the Curie temperature, and the magnetocrystalline anisotropy de- 
crease due to Si substitution, whereas the components of the magnetostriction 


Table 14.1 Room-temperature values of different treated “pure” Fe materials. 





Material HoMs [T] Hc [Am] Permeability (max.) 
Vacuum-melted Fe 2.15 25 20 000 
Electrolytic Fe 2.15 16 up to 40000 


Pure Fe (H)-treated) 2.16 4 up to 100000 
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constants Aıoo and 2111 behave different. However, close to 6 wt.% Si 2109 ap- 
proaches zero. This composition, on the other hand, is not favorable because at 
6% Si the material becomes too brittle for technical applications. The electrical 
resistivity increases strongly due to Si substitution, which is very important, espe- 
cially for ac applications. 

For transformer applications all kinds of losses have to be considered: for exam- 
ple, hysteresis losses, eddy current losses, and anomalous eddy current losses. 
Whereas hysteresis losses are independent of frequency, eddy current losses and 
anomalous eddy current losses which are due to domain wall movements are 
frequency-dependent. 

The core losses can be affected by: composition (silicon content), impurities, 
grain orientation (texture), applied stress, grain size, thickness, and surface con- 
ditions. As early as 1934 (Goss 1934) it was shown that the development of a tex- 
ture leads to a reduction of core losses. 

In transformer applications the magnetic flux lies predominantly in the length 
of the laminations. It is therefore desirable to enhance the permeability in this 
direction. This can be achieved by various hot- and cold-rolling stages which lead 
to textured sheets. Good results were achieved with grain-oriented silicon-steel, 
with the [001] direction in the length of the lamination. The [001]-type crystal 
directions are the easy directions of magnetization which lead to the greatest per- 
meability. Grain orientation of silicon-iron can be performed by a rolling proce- 
dure as shown in Fig. 14.6. 

A very detailed description of the properties of silicon-iron can be found in 
Chin and Wernick (1980). Besides Fe-Si, Fe-Al and Fe-Al-Si soft magnetic 
alloys are also of technical importance. These materials are characterized by a 
high electrical resistivity, high hardness, high permeability, and low losses. Fa- 
mous in this respect is the so-called Sendust family of Fe-Al-Si alloys (4-7%Al, 
7-13%Si). This composition is defined by the lowest magnetostriction value as 
well as a low crystalline anisotropy in the ternary phase diagram. 


14.1.2.2 Ni-Fe Alloys 
Special Ni-Fe alloys (trade name: Permalloy), defined by a zero or low magneto- 
striction and zero or low magnetic anisotropy, are also known under the name 


(011) Plane (001) Plane 


[100] Direction [100] Direction 





Rolling Direction Rolling Direction 


Fig. 14.6 Two possible ways to produce grain orientation in Fe—Si by a rolling procedure. 
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“mu-metal,” which is produced by a careful heat treatment and minor additions 
of Cu and Cr. These compounds exist over a wide range of compositions, from 
30-80 wt.% Ni with varying magnetic properties. We distinguish between alloys 
with a high Ni content, which are characterized by a high permeability and a Ni 
content of about 50 wt.%, which exhibit a high saturation; and those with a low 
Ni content, which have a rather high electrical resistivity. 

Ni-Fe alloys with low anisotropy and low magnetostriction have extremely high 
permeability, up to 300000, and an intrinsic coercivity as low as 0.4 A m™!. The 
high permeability makes mu-metal very effective as a screening material against 
static or low-frequency magnetic fields, which cannot be attenuated by any other 
method. Mu-metal requires special heat treatment — annealing in a hydrogen 
atmosphere, which increases the magnetic permeability about 40-fold. The 
annealing alters the crystal structure, aligning the grains and removing some 
impurities, especially carbon. Mechanical treatment may disrupt the grain align- 
ment, leading to a drop in permeability in the affected areas, which can be re- 
stored by repeating the hydrogen annealing step. 


14.1.2.3 Soft Magnetic Ferrites 

A ferrite is a ferrimagnetic oxidic compound which contains magnetically ordered 
iron and which is derived from magnetite (Fe304 more exactly Fe?+O.Fe,3*O3) 
by substituting divalent metal ions for Fe?*. These materials exhibit generally 
good soft magnetic properties combined with an excellent high-frequency behav- 
ior due to the fact that they are insulators. Trivalent metal ions substitute for Fe?* 
and ions with other valences (1+, 4+, 5+) also can be incorporated by consider- 
ing charge compensation. Mn, Fe, and Co are generally the most common diva- 
lent metals, but Cu, Zn, Mg and others are also used. Ferrites exhibit a close- 
packed cubic spinel (MgAl,O,4) structure, where the divalent ions replace Mg 
and the trivalent ions replace Al. The origin of magnetism in ferrites is due to 
a) unpaired 3d electrons, (b) superexchange between adjacent metal ions, and 
c) nonequivalence in number of A and B sites. Since the common ferrite ions 
Mn?*, Fe**, Ni?*, Co?*) have more than five 3d elctrons, the magnetic mo- 
ments are aligned antiparallel between A and B sites. This results in a ferrimag- 
netic type of order where, due to the occupation number, the B sites dominate. 
Most of the ferrites of the formula MFe,O, are inverse spinel structures (Fe?+f 
M?* | Fe?+]]O4) and the moment of the compound is given by that of M?*. 

An advantage of ferrites is their high electrical resistivity due to the fact that the 
material is an insulator. A large saturation magnetization is desired but, due to 
the ferrimagnetic structure, the total magnetization is generally low in soft mag- 
netic ferrites. Zn substitution causes an increase in the saturation magnetization 
(especially at low temperatures), but a decrease in ordering temperature. 

For soft magnetic applications the following properties are important. 
e High permeability: This depends on spin rotation in the 
domains, which are defined by intrinsic parameters 
(ug = 1+2nMs?/K; upg = rotational part of permeability), but 
also on wall displacements, which are dominated by micro- 
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Table 14.2 Magnetic properties at T = 20 °C of ferromagnetic spinels. 





Ferrite Js = wMs 


MnFe,04 0.50 
Fefe,04 0.60 
CoF&,04 0.53 
NiFe204 0.34 
CuFe204 0.17 
MgFe204 0.14 


Lio.5 Fe2.5°4 0.39 
y-Fe203 0.52 
ZnFe204 0 
CdFe204 0 


lon moment 


[ze] 


5.0 
4.1 
3.7 
2.3 
1.3 
11 
2.6 
2.3 


Curie temperature 


Tc [°C] 


300 
585 
520 
585 
455 
440 
670 
575 


Lattice 
parameter [nm] 


0.850 
0.839 
0.838 
0.834 
0.8445 
0.836 
0.833 
0.834 
0.840 
0.873 


Density 


[103 


5.00 
5.24 
5.29 
5.38 
5.42 
4.52 
4.75 
4.89 
5.40 
5.76 


kg m73] 





structural parameters (uw = 1+ 0.752Ms?D/y; uw = wall 
displacement part of permeability). The best candidates for 


high permeability are Mn and Ni ferrites. 


e Low coercivity: This needs the lowest value of magneto- 
crystalline anisotropy and a low magnetostriction. 


Magnetic properties at T = 20 °C of ferromagnetic spinels are summarized in 


Tables 14.2 and 14.3. 


Note that the first-order anisotropy constant Kı of Mn—Zn ferrite is exceptional 
low (Table 14.3). Therefore Mn-Zn ferrites are considered as high-permeability 
materials. The most important commercial soft ferrites are: Mn—Zn and Ni-Zn 


Table 14.3 Typical values of the magnetocrystalline anisotropy 


coefficient and saturation magnetostriction at room temperature 


for various ferrites. 





Ferrite 


MnFe204 

FeFe204 

CoFe204 

NiFe204 

MgFe204 

y-Fe2O; 

Mn9.62 209.41 Fe1.9704 


K: [103 | m?] 


As [x106] 


40 


—110 
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Table 14.4 Properties of Mn—Zn and Ni-Zn ferrites. 





Property Mn-Zn Ni-Zn 
Initial permeability 4; 500-20000 10-2000 
Saturation polarization Js [T] 0.3-0.5 0.1-0.36 
Curie temperature Tc [°C] 100-250 100-500 
Coercivity Hc [A m] 4-100 16-1600 
Resistivity p [Q m] 0.02-20 10-107 
Density [103 kg m=] 4.6-4.8 4.8-4.9 
Total power loss [W m ?] 50-200 very low 


ferrites — see Table 14.4. Mn—Zn ferrites are favorable for applications up to 1 
MHz, whereas Ni-Zn ferrites are suited to higher frequencies. The range of 
properties depends very much on preparation conditions and material parameters 
such as composition, grain size, porosity, etc. 


14.1.2.4 Amorphous Materials 
The production of amorphous 3d-metal (Fe,Co,Ni)-based materials yields a great 
variety of compounds with all kinds of magnetic properties. Amorphous materi- 
als are of interest here because the magnetic properties can be changed systemat- 
ically by substituting different elements without changing any kind of “struc- 
ture.” It should be mentioned that at the beginning it was very surprising that 
magnetic order is possible without the periodicity of a crystal structure. This 
problem was solved by realizing that the short-range order of the atoms causes 
a local density of state which is very similar to that of crystalline materials. 
The technical breakthrough was achieved by the invention of the so-called melt- 
spinning process (e.g., the single roller technique; cf. Fig. 14.5), which allows the 
easy and fast production of large amounts of thin ribbons (typically between 10 
and 50 um thick) of amorphous materials. For magnetic applications two interest- 
ing families of amorphous 3d-metal-based materials exist: 
e 3d metal-metalloid (metalloid = B, Si, C, etc.) 
e early transition element-late transition element. Here the 
Fe-Zr and Co-Zr systems are famous — both systems show 
an unusual magnetization behavior which can be explained 
to be due to invar-type behavior (see Section 14.2). 


Great progress in developing new soft magnetic materials has been achieved 
since the mid-1980s (see, e.g., Pfeifer and Radeloff 1980; Luborsky 1983). The 
amorphous alloys are produced with the general formula (3d metal)go(metalloid)29 
(3d metal = Fe, Co, Ni; metalloid = B, Si, Al, C, etc.). The metalloid is necessary 
for lowering the melting point of the alloy, thus stabilizing the amorphous state. 
Amorphous materials are generally obtained as thin ribbons (thickness 20-50 
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um) by the single roller technique (Hasegawa 1982; Luborsky 1983). The absence 
of grains and grain boundaries leads to excellent soft magnetic properties (see, 
e.g., Egami et al. 1974). The easy substitution over a broad concentration range 
makes these materials interesting from a basic point of view, but the excellent 
magnetic properties are also interesting for many applications (Boll and Hinz 
1985). The most promising systems are here Fe—Co-based amorphous alloys. 
Therefore many studies were performed on these materials, describing the basic 
magnetic properties such as the concentration and temperature dependence of 
the magnetization, the Curie temperatures (O’Handley et al. 1976; O’Handley 
and Bordreaux 1978), the crystallization behavior (Koster 1984), the magnetostric- 
tion in these systems (O’ Handley et al. 1976, 1977; Grössinger 1990), the induced 
anisotropy (Vazquez et al. 1977), etc. 


14.1.2.5 Nanocrystalline Materials 

Nanocrystalline materials form the second group of new soft magnetic 
compounds. Amorphous ribbons with the approximate composition 
Fey3 sCuı Nb3Sij3.5Bo are heat-treated (550 °C for 1 h), thus achieving a nanocrys- 
talline state. The composition is mainly Fe73.5(Si,B)22.5 where the addition of Cu 
and Nb hinders the growth of the grains, thus leading to crystals of about 10-15 
nm (Késter et al. 1991). This material was developed by Hitachi Metals and is on 
the market under the name Finemet (Yoshizawa et al. 1988). The material con- 
sists in the nanocrystalline state of small «-Fe-Si grains which are embedded in 
an amorphous matrix. An important factor is that the ordering temperature Tea of 
the amorphous matrix is lower (about 300 °C) than that of the Fe-Si T,, (about 
550 °C). The excellent soft magnetic properties occur because the grain size is 
smaller than the magnetic exchange length. This leads to an averaging of the 
magnetic anisotropy of the crystalline Fe-Si and consequently to a low value 
of the effective anisotropy (Herzer 1990). A similar reduction occurs for the 
magnetostriction, where the positive contribution of the amorphous matrix is 
reduced by the negative contribution of the Fe-Si (Herzer 1995; Sato Turtelli 
et al. 2000). 

The magnetic behavior is strongly determined by the ratio of crystalline to 
amorphous phases. This becomes especially important at elevated temperatures 
(above T,,) where the coupling between the grains is reduced, which leads to a 
magnetic hardening (Gréssinger et al. 1995; Hernando et al. 1994). 

The second nanocrystalline system of importance is based on a Fe-Zr-Cu-B 
amorphous alloy which also transforms into the nanocrystalline state after a heat 
treatment at about 600 °C for 1 h (Suzuki et al. 1994; Dahlgren at al. 1996a; 
Estevez Rams et al. 1996). In this material nanocrystals of «-Fe are embedded in 
an amorphous matrix. In both cases the exchange coupling between the nano- 
crystals leads to excellent soft magnetic properties at room temperature. The 
Fe-Zr-Cu-B system has the technical disadvantage of difficult production - it 
needs a protective gas because of the highly reactive Zr. However, it exhibits a 
higher saturation magnetization in the nanocrystalline state and from an experi- 
mental point of view it is a simpler two-phase system. As mentioned already, «-Fe 


14.1 Soft and Hard Magnets | 873 


appears there as nanocrystals. This is much easier to detect — e.g., by Méssbauer 
spectroscopy. Additionally, the ordering temperature of the amorphous phase in 
this case lies between 50 and 100 °C which is technically disadvantageous. 


14.1.3 
Hard Magnetic Materials 


The development of hard magnetic materials started with the discovery of Fe-C 
steels. During the 20th century the technical performance was dramatically im- 
proved. A key number for permanent magnets is the energy density (described 
in kJ m`?) stored at the optimum working point of a magnetic material. The 
stored energy density (B.H) nq, Could be improved from less than 40 kJ m in 
about 1900 to more than 400 kJ m”? which is available nowadays. Figure 
14.7 shows the improvement of (B.H) „ax Within the last 100 years. The dramatic 
enlargement of the coercivity became possible by the discovery of materials, 
such as the rare-earth 3d intermetallics, with a high intrinsic magnetocrystalline 
anisotropy. 

One has to distinguish between the intrinsic and extrinsic properties which are 
important for a high quality permanent magnet. The intrinsic properties com- 
bined with the actual microstructure (grain size and shapes, phases, structures, 
etc.) determine the extrinsic properties. 

The following intrinsic properties are important for permanent magnets. 

e High saturation magnetization at room temperature: Therefore 
3d metals such as Fe or Co are used. A high saturation 
magnetization is also necessary to achieve a high-energy 
product. 


320 
Sm(Co,Cu,Fe, Zr), s 


Magnetic steel 


1920 1940 1960 1980 2000 





Fig. 14.7 Development of energy density achieved over the last 100 
years of commercial permanent magnets. 
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e High Curie temperature, Tc: Here again the 3d metals are 
necessary, because the direct exchange between the 3d 
electrons causes a high ordering temperature. A high Tc and 
consequently a high operating temperature are necessary for 
all industrial applications. 

e High anisotropy: The use of 4f metals is favorable because 
they generally exhibit an orbital moment L which causes a 
high magnetocrystalline anisotropy. The anisotropy can be 
described either by the so-called anisotropy field H4 or by the 
anisotropy constants Kj, K2, K3.... The anisotropy constants 
generally determine the nucleation but also the pinning field, 
which are important parameters for the mobility of the 
domains and consequently for the magnetization process. 
The condition of a high uniaxial anisotropy is necessary for 
achieving a high coercivity. 


The following extrinsic properties are important for permanent magnets. 

e High-energy product (B.H) „ay, which scales with B,?: Therefore 

a high saturation magnetization determines together with the 
degree of alignment the maximum possible stored energy. 
For isotropic, uniaxial material B, = Bs/2, and therefore 
(B.H) „a, in an isotropic material is principally much 
smaller! Generally, a large energy density is necessary for 
application with a high degree of miniaturization. 
Polycrystalline, isotropic hard magnetic materials are 
important for applications where a magnetic code is to be 
written. 

e High coercivity: Besides the magnetocrystalline anisotropy, a 
well-adjusted microstructure determines mainly the coercivity 
— see Fig. 14.4. 

Thermal stability: This is given by the temperature 
dependence of the magnetization of all phases involved. 
Mechanical properties: Here again the microstrucure plays a 
leading role. 

Corrosion stability: This depends on the chemical composition 
of the phases involved as well as on the microstructure. 


Figure 14.8 shows the demagnetizing curve B as a function of applied field H in 
the second quadrant for different hard magnetic materials. It also demonstrates 
how the working point is found. The arrow indicates a working point, which is 
given by the geometry of the sample (the slope is the reciprocal of the demagne- 
tizing factor) (Dahlgren 1998). 
Nowadays there are four important industrial permanent magnet families: 
e AINiCo 
e hard magnetic ferrites 





Applied Field H (105 Am-!) 
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Flux Density B(T) 


Fig. 14.8 Demagnetizing curve in the second quadrant for different permanent magnets. 


e Sm-Co magnets: 1/5-based materials and 2/17-based 
compounds which are significantly different in their 


magnetization behavior 


e Nd-Fe-B magnets. 


Table 14.5 Comparison of permanent magnet materials. 





Material Cost 
index [%] 


Nd-Fe-B (sintered) 65 
Nd-Fe-B (bonded) 50 


Sm-Co (sintered) 100 
Sm-Co (bonded)* 85 
Alnico 30 
Hard ferrite 5 
Flexible* 2 


(B.H) max 
[MGOe] 


45 
10 
30 
12 
10 


Coercivity 
ıHc [kOe] 


up to 30 
up to 11 
up to 50 
up to 10 
up to 2 
up to 3 
up to 3 


Max. operating 
temp. [°C] 


130 
100 
300 
120 
550 
300 
100 


Machinability 


fair 
good 
difficult 
fair 
difficult 
fair 
excellent 





fl Bonded or flexible magnets are plastic-bonded magnets, which are 
useful for industrial applications which do not need the highest 
magnetic properties but do need every kind of shape. 
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Fig. 14.9 Remanence versus coercivity field for different permanent 
magnet materials (Dahlgren 1998). 


The properties of industrial permanent magnets are shown in Table 14.5. The rel- 
ative prices are normalized to that of a sintered Sm-Co magnet (100%) (see the 
first entry). The last entry shows that all kind of bonded materials are superior 
with respect to machinability. However, these materials exhibit a lower working 
temperature and naturally a lower energy density. 

In Fig. 14.9 the remanence M, of various permanent magnet materials is 
plotted versus the coercivity. One can see very clearly that the sintered Nd-Fe-B- 
based materials exhibit the highest values of remanence and coercivity at room 
temperature. 


14.1.3.1 AINiCo 

Alnico magnets are made up of an alloy of iron, aluminum, nickel, and cobalt 
with small amounts of other elements added to enhance the properties of the 
magnet. Alnico magnets have a high corrosion resistance, a good temperature 
stability (see Table 14.5), and good shock resistance, but are easily demagnetized 
due to the rather low coercivity field. Alnico magnets are produced by two typical 
methods, casting or sintering. Sintering offers superior mechanical characteris- 
tics, whereas casting delivers higher-energy products (up to 5.5 MGOe) and 
allows the design of intricate shapes. Anisotropic Alnico, which provides a pre- 
ferred direction of magnetic orientation, can also be produced. Its coercivity is 
generally based on the shape anisotropy of the Fe-Co rich phase. 
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14.1.3.2 Ferrites 

As a result of their favorable performance/price ratio, ceramic anisotropic ferrite 
magnets are located in the center of the permanent magnet market, where re- 
quirements with respect to performance and/or allowable magnet volume are de- 
manding but not extreme. The main application for high quality ferrite magnets 
is as segments in various DC motors for the automotive industry. The manufac- 
ture of ceramic ferrite magnets consists of fine milling of the prefired material 
followed by several pressing and sintering steps which have been optimized in 
the last 50 years. 

The intrinsic properties correspond to the M-type crystal structure (PG3/mmc) 
crystal structure and, notably, from the five distinct Fe sublattices. Two intrinsic 
properties are crucial: saturation magnetization (Js) for B, and anisotropy field 
strength (Ha) for Hg. The five Fe sublattices are coupled by superexchange, al- 
lowing only parallel (up) or antiparallel (down) orientation. Their mutual orienta- 
tion is given by the Gorter model (Went et al. 1952; Stuijts et al. 1955) 2a (up), 4f1 
(down), 12k (up), 4f2 (down), 2b (up). Taking into account that the magnetic mo- 
ment for Fe** amounts to 545, the total moment per mole of AFe12019 at 0 K 
amounts to 204p, in agreement with the observed saturation magnetization for 
the pure compound (J;). The temperature dependence of J; is shown in Fig. 
14.10, implying Js(300 x) = 478 mT. It is remarkable that the J,-T curve is almost 
linear in a broad Tregion (dJ,/dT = —0.9 mT K+). Mössbauer analysis revealed 
that the latter stems from the 12k sublattice (Stuijts et al. 1959). The magnetiza- 
tion is strongly bound to the hexagonal c axis. The anisotropy field derives from 
Kı and Js: Ha = 2K, /J;. The temperature dependence of Kı is analogous to that 
of J;, but its increase at decreasing temperature is somewhat less, resulting in a 
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Fig. 14.10 Temperature dependence of saturation magnetization Js, 
anisotropy constant K, and the anisotropy field Ha = Hyg for BaFe12019. 
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flat Hı-Tcurve, having a maximum around 300 K (Fig. 14.10). There is not yet a 
clearcut model for the magnetocrystalline anisotropy. The contribution of dipole- 
dipole interaction has been calculated to be relatively small. So, the spin—orbit 
coupling of the Fe*+ ions must play the main role, in spite of the fact that (free) 
Fe?* has no orbital momentum. Mostly, the contribution to the overall spin-orbit 
coupling is associated with the 2b site, but the joint 12k sites also play a signifi- 
cant role (Stuijts et al. 1959; Lotgering 1974; Kools 1994). 

The hard magnetic properties of M-type ferrites originate in the magnetocrys- 
talline anisotropy of Fe’* in the magnetoplumbite structure (space group P6;/ 
mmc). M-type ferrites were the first permanent magnets where the coercivity was 
determined by the magnetocrystalline anisotropy. 

The invention of La- and Co-substituted hard magnetic ferrites with improved 
properties renewed the interest in these materials (Kools et al. 2002). 


14.1.3.3 Sm-Co 

Since 1966 a new family of magnet materials has been evolving that is known as 
the “rare-earth permanent magnets” (REPMs). For the application of REPMs, a 
sufficiently high Curie temperature, a high magnetization, and a high magneto- 
crystalline anisotropy are important. The first two of these properties are provided 
mainly by the sublattice of the 3d element, whereas the rare-earth sublattice is 
mainly responsible for the last. The magnetic anisotropy of the rare-earth sublat- 
tice is a single-ion crystal field induced anisotropy. A strong magnetic coupling 
between the rare-earth sublattice is required in order to extend the anisotropy to 
the whole lattice and to preserve this anisotropy at elevated temperatures. The so- 
called Sm-Co magnets are multiphase metallurgical systems with complex mi- 





Fig. 14.11 Phase diagram of the Sm—Co system together with the unit 
cells of the 1/5 and 2/17 structures. 
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crostructures and they always contain more than two elements. The crystalline 
structures of the phases from 1:3 to 2:17 stoichometry are closely related. 

Samarium-cobalt-based rare-earth magnet materials have a much higher mag- 
netic strength (energy product) than AINiCo or ceramic ferrite materials. Intro- 
duced to the market in the 1970s, samarium-cobalt magnets continue to be 
used today. They are divided into two main groups: SmCos and Sm 7Coj7 (com- 
monly referred to as 1/5 and 2/17). Figure 14.11 shows the Sm-Co phase dia- 
gram and the hexagonal structures of the 1/5 and the 2/17 unit cells. 

In order to achieve the optimum microstructure a well-designed heat treatment 
is very important. In recent years special Sm-Co based magnets which are stable 
at high temperatures and can be used up to 500 °C were developed (Liu et al. 
1999). 


14.1.3.4 Nd-Fe-B 

Nd-Fe-B-based magnets were invented about 20 years ago (Sagawa et al. 1984; 
Croat et al. 1984). The excellent hard magnetic properties are based on the high 
saturation polarization of the tetragonal Nd,Fe,;4B compound together with the 
high anisotropy field at room temperature (H4 about 70 kOe). Figure 14.12(a) 
shows the tetragonal structure of this material, and Fig. 14.12(b) the complex 
phase diagram of Nd-Fe-B. This third generation of rare-earth magnets contains 
the most powerful and advanced commercialized permanent magnets today. 
Since they are made from neodymium, one of the most plentiful rare-earth ele- 
ments, and inexpensive iron, Nd-Fe-B magnets offer the best value in cost and 
performance. Rare-earth iron—boron magnets became commercially available in 
the mid-1980s and have been growing increasingly popular ever since. 


Nd.Fe,,B, tetragonal unit cell 


Fe 








a=0,8 nm 
c=1,2nm 
B Nd2Fe17 
» Fe Fe 
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Fig. 14.12 (a) Tetragonal structure of Nd2Fe;4B; (b) phase diagram of Nd-Fe-B. 
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(a) (b) 


Fig. 14.13 (a) Hysteresis loop measured parallel (grey) and 
perpendicular (black) to the preferential orientation (Rodewald et al. 
2002); (b) microstructure of a world record Nd-Fe-B-based permanent 
magnet (Gutfleisch 2001). 


Nd-Fe-B magnets are available in both sintered and bonded forms. Sintered 
Nd-Fe-B offers the highest magnetic properties (28-50 MGOe), whereas bonded 
Nd-Fe-B offers lower energy products for the advantage of easy machinability. 
Although bonded magnets do not possess magnetic properties as advanced as 
those of sintered magnets, they can be made in shapes and sizes that are difficult 
to achieve with sintering. A variety of coatings can be applied to the magnets’ sur- 
face to overcome the principle drawback of neodymium-based magnets, namely 
their tendency to corrode easily. 

In order to demonstrate the excellent properties of this kind of material, Fig. 
14.13 shows the hysteresis loop as well as the microstructure of a world record 
Nd-Fe-B-based permanent magnet with (B.H) „ay = 56 MGOe (Rodewald et al. 
2002). 


max 


14.1.3.5 Nanocrystalline Materials 

Some years ago nanocrystalline hard magnetic materials were invented and 
studied (Coehoorn et al. 1988; Jha et al. 1989; Liou et al. 2004). For grains below 
approximately 30 nm the grain size becomes comparable to the magnetic ex- 
change length. This leads to a remanence enhancement, which consequently 
causes a higher stored energy (B.H) „ay The remanence enhancement was first 


max* 
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observed in Nd-Fe-B-Si (Keem et al. 1987) and also later in many other com- 
pounds. It was shown that nanocrystalline hard magnetic materials exhibit 
magnetic properties which are different from those of microcrystalline magnetic 
materials (Dahlgren at al. 1996b, 1997a,b). For applications the most interesting 
feature is that, due to the enhanced remanence, the energy product is increased 
even for isotropic grains with a uniaxial easy axis. Here the ratio M,/M; is higher 
than the expected 0.5 for noninteracting uniaxial grains according to Stoner and 
Wohlfarth (1948). Additionally, they exhibit a single-phase type of magnetic hyste- 
resis loop instead of the expected two-phase type of hysteresis loop that is mea- 
sured for microcrystalline two-phase magnets. Also, the shape of the recoil curves 
is characteristic of so-called spring type magnets (McCormick et al. 1996). 

In Fig. 14.9, the remanence as a function of the coercivity of different Sm—Co- 
and Nd-Fe-B-based materials is compared. The special position of the nanocrys- 
talline hard magnetic materials is obvious. By varying the actual composition as 
well as the heat treatment, the renanence and the coercivity of these materials can 
be changed dramatically. A comparison of the actual microstructure with the 
macroscopic data (hysteresis loop, magnetization, anisotropy) can be used as in- 
put for any kind of modeling (Hauser and Grössinger 2001). 

Most investigations of nanomagnets were performed on RE-Fe-B-based sam- 
ples. These isotropic nanocrystalline materials can be classified as three different 
compositions (Dahlgren 1997b). 

e Low RE content: In this case nanocrystalline RE2Fe14B grains 
are mixed with nanocrystalline soft magnetic «-Fe grains, or 
if the amount of B is high enough, with soft magnetic Fe3B. 
Here, an even higher remanence enhancement than for 
stochiometric RE-Fe-B occurs, due to polarization of the 
a-Fe with a higher magnetic moment - these are called 
nanocomposite magnets. 

e Stoichiometric RE-Fe-B: There the pure exchange coupling 
between the nanocrystalline RE,Fe]4B grains can be studied. 

e High RE content: The nanocrystalline RE, Fe,,B grains are 
isolated from an R-rich phase, which at room temperature is 
nonmagnetic. This material is technically irrelevant. 


Davies and his group (Davies et al. 1993), as shown in Fig. 14.14, carefully 
investigated the effect of varying the Nd content on the magnetic properties of 
Nd-Fe-B. 

The development of new production methods combined with the discovery of 
the nanocrystalline state opened new horizons for magnetic materials research, 
and revolutionized the world of permanent magnets. Since then, many hard mag- 
netic systems starting from Nd-Fe-B but also including the structure types 1/12, 
2/17, 3/29, 1/5 etc. have been made with nanosize microstructures and with large 
values of coercivity (Pinkerton and Winterton 1989; Ding et al., 1993; Kuhrt et al. 
1993; Shen et al. 1993). Besides melt spinning (Dahlgren et al. 1997b) and splat 
cooling, other rapid solidification techniques have been used, such as vapor depo- 
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Fig. 14.14 Dependence of the coercivity ;H., the remanence J, and 
the energy product (B.H)max on the Nd content in a Nd-Fe-B alloy. 
(after Davies et al. 1993.) 


sition (Cadieu 1992, 1995; Sellmyer 1992), atomization (Narasimhan et al. 1986) 
and mechanical alloying (McCormick et al. 1998; Schultz 1990) for the fabrica- 
tion of nanophase magnets. Recently, sputtering techniques have also been used 
to prepare nanosize Sm-Co alloys (Lambeth 1996; Sellmyer 1996) and CoPt 
and FePt alloys for high-density recording media (Liu et al. 1998; Starroyiannis 
et al. 1998). 

Mechanical alloying is another method for producing nanocrystalline materials. 
This method is always followed by a heat treatment (Ding et al. 1994). In order to 
produce nanocrystalline materials in larger amounts, which are industrially of in- 
terest, severe plastic deformation (SPD) was suggested (Giguére et al. 2002). 


14.1.3.6 Industrial Nanocrystalline Hard Magnetic Materials 

Whereas soft magnetic nanocrystalline materials are produced widely and used 
by industry, the situation for hard magnetic nanocrystalline materials is differ- 
ent. Only Magnequench really offers commercial powder in which nanocrystal- 
line Nd-Fe-B is used. MQP™ powders are based upon Magnequench’s patented 
RE-Fe-B alloy compositions, which are rapidly solidified from the molten state 
at extremely high cooling rates, on the order of 10° s~!, by melt spinning. This 
rapid solidification results in a material which has an extremely fine (typically 
30-50 nm) metallurgical grain structure. Because the resulting grain size is 
smaller than the critical size for a single magnetic domain, these materials are 
magnetically isotropic. Further, in contrast to the fine, anisotropic powders that 
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are used to manufacture sintered RE-Fe-B magnets, MQP™ powder is relatively 


stable against oxidation-induced demagnetization. These characteristics make 
MQP™ powders ideally suited for the production of bonded permanent magnets 
(see: http://www.magnequench.com/). 


14.1.4 
Outlook 


The newest developments concerning hard magnetic materials are application- 
dominated. Nd-Fe-B-based bulk magnets with high coercivity but without the 
expensive Dy or Tb are a target of research. Magnets based on Sm-Co which are 
stable at high temperatures are also under development. Another trend goes in 
the direction of producing high-quality and stable hard magnetic films, mainly 
for micromachines, as well as high-density recording applications. Here the study 
(experimental as well as by modeling) of dynamic magnetization process is a 
challenging task. 

In soft magnetic materials amorphous micro- and nanowires are being studied 
because they are of interest for sensor applications. This is related to their mag- 
netic behavior, characterized by a square hysteresis loop. This phenomenon is 
very useful for a large number of technological applications. The study of highly 
ordered arrays of magnetic wires with diameters typically in the range of tens to a 
hundred nanometers is also a topic of growing interest for storage applications. 
The ordering by self-assembly which occurs between the nanowires during their 
production, together with the magnetic nature of nanowires, gives rise to out- 
standing cooperative properties different from bulk and even from thin-film sys- 
tems; this is of fundamental and technological interest. 
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By the name “Invar alloys” one nowadays understands a class of magnetic mate- 
rials, which show a pronounced departure in their thermal expansion from the 
normal Griineisen behavior (for normal metals the linear thermal expansion co- 
efficient « at room temperature is of the order of (10 — 20) x 10° K-!). The 
name “Invar” was coined by the swiss physicist Charles Edouard Guillaume 
who, in his search for a cheaper material for the meter standard to replace Pt-Ir, 
discovered that face-centered cubic (fcc) alloys of iron (Fe) and nickel (Ni) with a 
composition of about FegsNi35 exhibit almost vanishing thermal expansion in a 
broad temperature range around room temperature (Guillaume 1897). 

Figure 14.15 shows this concentration dependence, whereas Fig. 14.16 depicts 
schematically the temperature dependence of the thermal expansion coefficient « 
for Fess Nizs. Because of these outstanding properties the material was given the 
name Invar, attributed to its temperature-invariant volume. Guillaume’s discovery 
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Fig. 14.15 Concentration dependence of the linear coefficient of 
thermal expansion « as a function of the alloy concentration for the 
Invar alloy Fe-Ni. (After Wassermann 1990). 


immediately found widespread application in the construction of calibrated, high- 
precision mechanical instruments. 

Applications are also found in electronic devices such as shadow masks in TV 
and computer screens, and resonant cavities of microwave and laser instruments. 
In addition, Invar alloys are used as structural components ranging from liquid 
gas containers to crude oil vessels and as core wires of long-distance power 
cables. Due to its importance for technological applications many different Invar 
alloys have been developed, among them many ternary and quaternary alloys. A 
material with a thermal expansion coefficient (TEC) of almost zero (<10~’ K~4), 
the so-called super Invar consists of 32 wt.% Fe-4 wt.% Ni-Co. A material which, 
besides its small TEC, also shows resistance to chemical corrosion, is stainless 
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Fig. 14.16 Schematic temperature dependence of the thermal 

expansion coefficient (full line) in FegsNi35 which consists of a 
non-magnetic part &%m (phonons) and a magnetic contribution 
%m (magnetostriction). (After Wassermann 1990). 
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Invar, 54 wt.% Fe-9 wt.% Co-Cr. As structural materials exhibiting a tensile 
stress of >1 GPa, the work-hardening alloy Fe-Ni-Mo-C and the precipitation- 
hardening alloy Fe-NI-Co-Ti are in use. A further important alloy consists of 36 
wt.% Fe-12 wt.% Ni-Cr, the so-called Elinvar, which shows temperature-invariant 
elastic properties and for decades was used as a material for springs in mechani- 
cal watches. It should be mentioned that for the discovery of Elinvar, C. E. Guil- 
laume was awarded with the physics Nobel prize in 1920. Since all Invar ma- 
terials are ferro- or anti-ferromagnetic (e.g., Cr-Fe-Mn) and the Invar-related 
properties occur only below the Curie/Neel temperature, it became evident very 
early that magnetism is inevitably entangled with the effect. These assumptions 
were also the basis of the earliest attempts to explain the Invar behavior. Most 
prominent among these early models is the 2y state model (Weiss 1963). There 
it was argued that in fcc Fe-Ni Invar there should exist two magnetically or- 
dered localized states, a ferromagnetic one with a larger volume yı and an anti- 
ferromagnetic one with a smaller volume y,, where the latter can become ther- 
mally excited and thus compensate for the thermal expansion caused by the 
phonons. At first modern electronic structure theory seemed to confirm this 
main idea, when first-principles calculations of ferromagnetic y-iron and ordered 
Fe; Ni showed the existence of two magnetic states, a low-spin low-volume state 
and a high-spin high-volume state (Williams et al. 1982; Moruzzi 1990). Close to 
the Invar concentration the binding energy curves in these systems were found to 
consist of two distinct branches with a small energy difference between their 
respective minima, which has been viewed as a success of the 2y-state model 
(Wassermann 1990) although the calculations grossly overestimate the volume 
decrease as a function of concentration. On the other hand, there are several 
experimental observations that contradict the 2y-state model. For example, the 
phase transition from the high-spin to the low-spin state as a function of concen- 
tration should be first order (Akai et al. 1993; Abrikosov et al. 1995), while exper- 
imentally it is not (see, e.g., the specific heat results of Bendick and Pepperhof 
1979). In addition, the two-minimum shape of the binding energy curve should 
lead to some discontinuities in the pressure dependence of several physical prop- 
erties which, however, have not been seen experimentally. It must be said clearly 
that there is no experimental evidence whatsoever for the existence of two sepa- 
rate magnetic phases. Also, the earlier theoretical results (Williams et al. 1982; 
Moruzzi 1990), which originally seemed to support the 2y-state model, turned 
out to be an artifact of the constraints applied during the calculations. Unfortu- 
nately and most probably because of its simple setup, the 2y-state model is still 
employed to explain experimental results. Although it gives a phenomenological 
explanation for the macroscopic properties of Invar alloys, it does not make any 
direct connection with electronic structure theory. 

The present microscopic understanding is based on the assumption of an inter- 
play between local magnetic moments, which gradually become disordered with 
increasing temperature. These local moments couple with the conduction elec- 
tron system, which itself becomes polarized and forms an itinerant component 
of the total magnetic moment, being responsible for the strong magnetovolume 


887 


888 


14 Materials and Process Design 


coupling. The success of rather complex types of interactions like this has been 
demonstrated by van Schilfgaarde et al. (1999), who in their calculations also al- 
lowed for noncollinear magnetic order. They find that at large volumes (corre- 
sponding to low temperature) the ground state is indeed given by parallel (ferro- 
magnetic) spin alignment. When they reduce the volume the spins gradually 
depart from parallel alignment so that the spin directions become increasingly 
disordered. Since spin disorder is normally caused by finite temperatures, their 
result shows that, as far as the magnetic part of the problem is concerned, in- 
creased temperature leads to a smaller equilibrium volume. For Invar alloys this 
increasing disorder leads to a decreasing volume which again compensates the 
vibrational thermal expansion in the desired way. 


14.2.2 
Spontaneous Volume Magnetostriction 


The central property which distinguishes Invar alloys from normal magnetic al- 
loys is the anomalously large spontaneous volume magnetostriction œso. Figure 
14.17 shows this behavior schematically. Below the magnetic ordering tempera- 
ture the fractional volume change departs strongly from the usual “nonmag- 
netic’ behavior of the lattice w),; (broken line). 





fractional volume change Q 


magnetic moment 


temperature T 


Fig. 14.17 Schematic temperature dependence of the volume change. 
The difference between the experimental volume and the hypothetical 
nonmagnetic volume (broken line) (T = 0 K) defines the spontaneous 
volume magnetostriction &,o. For temperatures above T, the normal 
thermal expansion behavior due to the anharmonicity of the lattice 
vibrations (phonons) is regained (Griineisen behavior). 
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The difference between the actual volume and a hypothetical w),; defines the 
spontaneous volume magnetostriction wo at T = 0 K. For temperatures above 
T, the normal thermal expansion behavior due to the anharmonicity of the lattice 
vibrations (phonons) is regained (Griineisen behavior). It is exactly this volume 
anomaly which in Invar systems compensates for the thermal expansion due to 
the phonons. Any theory which attempts to explain the Invar effect has thus to 
account for the magnetic part of the volume expansion œm. 


14.2.3 
The Modeling of Invar Properties 


As already stated earlier (Wassermann 1990), the name Invar is actually some- 
what misleading, since in Invar alloys one observes a multitude of related anoma- 
lies. According to Wassermann, a better name would be “moment-volume insta- 
bilities in 3d-element-rich systems”. In his review (Wassermann 1990) he also 
gives a list of Invar-related properties, which is partly repeated here (Table 14.6). 

An analytic treatment of the magneto—volume effects within the Landau theory 
of phase transitions becomes relatively easy and straightforward if one describes 
the magnetic and mechanical properties in the lowest order by assuming the 
polynomial representation of Eq. (3), where the coefficients are related to physi- 
cally easily accessible quantities. Ep is an (arbitrary) energy zero point, which can 
also be taken as zero. 








A B 
E(M, V) = Fo +5 M° +7 M* + ØV + yV? +6M?V (3) 


The equilibrium magnetic moment Mo and volume Vp at T = 0 K are given by 
Eqs. (4) and (5). 


Table 14.6 Invar-related properties. 
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Property 


Thermal expansion 


Variable 


(Al/I)(T), «(T) 


Lattice constant a(T) 

Spontaneous volume magnetostriction ©(T),©(T=0) = wo 
Forced volume magnetostriction (do/dH)(T) 

High-field susceptibility Xu (T) 

Pressure dependence of the magnetization —(dM/dP) > u 
Pressure dependence of the Curie temperature —(dT,/dP) 

Young’s and bulk modulus &(T), A(T) 

Elastic constants cı(T), c44(T), c'(T) 
Magnetization M(T) 
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A+ 20V, 
m=- (4) 
B 
2 
y= fas (5) 


The coefficients A and B are related to the susceptibility 7) at T = 0 K via Eqs. (6) 
and (7). 


Aaa Log (6) 
u 2%0 i 


1 
u 2XoMo vi 
y describes the harmonic part of the binding energy, whereas p, which depends 
on the isotropic bulk modulus 40, is responsible for the anharmonicity of the 
binding energy [Eq. (8)]. 


Bo = —2yVo = B — ME (8) 


Finally, and most importantly for the Invar effect, is the magneto—volume cou- 
pling constant ô, which is related to the critical pressure for the disappearance of 
magnetism P,. Thus all quantities are related to ground-state properties of the 
respective alloy. 

The next and crucial step is to introduce effects of finite temperature. As long 
as we are only interested in the magnetic part of the thermal expansion, we can 
restrict ourselves to magnetic thermal excitations and add the phonon contribu- 
tion according to the Grüneisen relationship, which assumes that the thermal ex- 
pansion due to the phonons is proportional to the Debye law for the specific heat. 

An elegant way to introduce effects of finite temperature is to assume thermally 
induced fluctuations of the magnetic moment. One replaces the bulk magnetiza- 
tion M?” by a new quantity, which also contains spin fluctuations (mi) and 
<m? 3 parallel and perpendicular to the direction of M. The idea behind the inclu- 
sion of magnetic fluctuations is that they, in an elegant way, resemble the effects 
of collective excitations. It should be noted that in our formulations magnetic 
fluctuations are treated classically, allowing only for a high-temperature descrip- 
tion of the magnetic properties, which however is appropriate for the present 
problem. A detailed derivation is given elsewhere (Mohn 2005). Including these 
magnetic fluctuations, the free energy expansion now reads as Eq. (9). 








AF = Ey 4 m + <m> +2<m1 >) 


B M* + M?(6<m > + 4<m{ >) 
"4 \ 48¢m?>? + 3¢m2>? + 4¢m?><m? > 
BV + yV? +6V(M? + <mi> + 2m4) (9) 
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The equations of state for the magnetic field H and the pressure P become Eqs. 
(10) and (11). 


B 
H = AM + BM? + >M(6<mj) + 4m? >) + 20VM — P (10) 





= B+ 2yV +ö(M? + (mj) + 2<m} )) (11) 


At the Curie temperature T, the bulk magnetization M vanishes and the parallel 
and perpendicular magnetic fluctuations (mi) and <m? X become equal. In these 
conditions, the mean-square of the fluctuating magnetic moment (m?) at T, is 
given by Eq. (12). 


yA-ö(ß — P) 


DN = 
=" age 5yB CA 


Since numerical evaluations found that to a good approximation <m? |> varies 
linearly with temperature and that (m?) and <m?» are essentially equal for an 
isotropic system, one can simplify Eq. (9) considerably by assuming Eq. (13) to 
hold. 





{mi > = <m?>(P = EPH 
T 
= me >(P) py (13) 


We are now able to derive a number of important quantities for the description 
of Invar alloys [Eqs. (14-23)]. 
e The magnetic contribution % to the thermal expansion 
coefficient: 


1 dV_ 1 2B6<m?y 





"m= 3V dT 3T, Ad = B(B + P) 1m) 
e The critical pressure P, for the disappearance of magnetism: 
pP (15) 


e The pressure dependence of the Curie temperature, which follows 
from the assumed pressure independence of <m?>/T, [Eq. 


(13)]: 


dT, Te 
Br (16) 
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° The high-field susceptibility xpp taken at constant pressure: 








X = Xe = Xv: 17) 
1 26 T 

=——P,|1 18) 
Xp y Te 
1 2 
— = 2BM 19) 
Xv 

7" 

= 20 

í | 5; 


Xv is the susceptibility at constant volume, and 7 is a 
temperature-independent magneto-mechanical enhancement 
factor. It has been shown that for, e.g., the Fe-Ni system, 
n becomes anomalously large in the vicinity of the Invar 
compositions (Shimizu 1981). 

e The forced magnetostriction h: 


do 1dV_1dVdM 
dH VdH VaMdH|, 





h 


_ do dM 6M 


dM dH|p, yV” 





(21) 


According to Eq. (18) for T = P = H = 0 Eq. (21) reduces to 
Eq. (22) 
M1 


E 22 
2Vo P, (22) 


e The spontaneous volume magnetostriction œs as described in 
Fig. 14.17 is then given by Eq. (23). 


ôM? 
Bo 





0 = On(T = 0) — q(T = 0) = (23) 


One finds that the figure of merit, wso, depends directly on 
the magneto—volume coupling constant 6 and the 
equilibrium magnetic moment Mp. Since also the bulk 
modulus 4p enters Eq. (23), it turns out that the desired 
large values for wo require lattices with a reasonably small 
bulk modulus. 


It should be noted that all quantities depend essentially on ground-state proper- 
ties and that a few physically intuitive parameters are sufficient to allow a phe- 
nomenological description of the magneto—volume effects. 
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14.2.4 
A Microscopic Model 


When calculating &,o according to Eq. (23), it has to be assumed that the non- 
magnetic reference volume is really a fictitious nonmagnetic state. Nonmagnetic 
in this context means that there are no magnetic moments present. It is of course 
well accepted that the paramagnetic state (the state above the Curie temperature) 
in general can be seen as a state where the magnetic moments are completely 
disordered, rather than a state where all magnetic moments have vanished. A mi- 
croscopic modeling of the Invar properties must therefore take into account this 
magnetically disordered state. An elegant way to simulate this magnetic state is 
provided by DLM (disordered local moment) calculations, which make it possible 
to model magnetic disorder on the basis of a CPA (coherent potential approxima- 
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Fig. 14.18 Calculated volume magnetostriction œso (upper panel) and 
local atomic moments of Fe alloys (lower panel). For the local moments 
full symbols refer to the ferromagnetic and open symbols to the 
disordered local moment state, respectively. 
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Fig. 14.19 Slater-Pauling curve for ferromagnetic transition metal alloys. 


tion) calculation (Cyrot 1970; Hubbard 1981; Gyorffy et al. 1985; Khmelevskyi 
et al. 2003). The results from such a calculation are shown in the upper panel of 
Fig. 14.18, where the spontaneous volume magnetostriction wo is plotted as a 
function of the average valence electron number of the binary alloys Fe-Co, 
Fe-Pt, and Fe—Pd. It is immediately seen that for all alloys there exists a common 
maximum for œs around 8.4 electrons/atom. The lower panel of Fig. 14.18 de- 
picts the respective ferromagnetic (FM) ground-state moments and the DLMs 
(magnetic moments in the fully disordered state). 

The maximum in w,9 correlates with the maximum in the difference between 
the FM moment and the DLM, which resembles the result given in Eq. (23). This 
behavior is also in agreement with the Slater-Pauling curve (Fig. 14.19), where 
one plots the magnetic moment of an alloy as a function of the valence elec- 
tron concentration. The general behavior again shows a maximum around 8.4 
electrons/atom, where the magnetic behavior changes from weakly to strongly 
ferromagnetic. 

One can thus conclude that Invar behavior always occurs if an alloy is on the 
verge of this change in the magnetic behavior and at the same time exhibits a 
large enough magnetic ground-state moment. In this sense Invar is not a singu- 
lar feature of some alloys, but rather a general property, which, if all prerequisites 
are fulfilled, leads to the observed temperature independence of the thermal ex- 
pansion coefficient. 


14.2.5 
Outlook 


Although it has been on going for more than a century, Invar research is still an 
highly active field. In particular, the modern preparation methods allow produc- 
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tion of tailor-made properties and applications, e.g., as temperature-driven actua- 
tors in micromachinery. A rather new field are the anti-Invar systems, which ex- 
hibit an exceptionally large thermal expansion coefficient, an effect which often 
occurs together with antiferromagnetic order (e.g., in fcc Fe). 
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14.3 


Magnetic Media 
Laurent Ranno 


14.3.1 
Data Storage 


14.3.1.1 Information Storage 

In our technological age, a huge amount of information needs to be stored in 
a digital manner. The trend is exponential and shows no signs of calming 
down. This is the driving force in the field of data storage and makes it a fast- 
evolving industry. Regarding microelectronics, Moore’s law is the well-known 
self-replicating trend which predicts that the number of transistors per surface 
area will double every 18 months. In the last decade the growth of hard disk drive 
(HDD) bit density has been faster than that of microelectronics and has reached 
100% bit density increase per year. A density of 100 gigabits/in? (Gb in~’), origi- 
nally thought to be a dream value, has been reached in laboratories in 2002 and is 
already in commercial HDD. The new target is 1 terabit/in? (Tb in 2; 1 in = 2.54 
cm). Data storage is a large industry and many technologies compete. The life 
cycle of technologies is only a few years long and innovation is constant. Before 
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discussing the field of high density magnetic recording, let us detail some of the 
criteria that govern storage technologies. 


Data ‘The quantity of bits to store is a first parameter. A microprocessor will deal 
with 32 or 64 bits at a time and a few megabits (Mb) will be stored in cache mem- 
ory. One hour of music (i.e., a CD-ROM) will be equivalent to a few 100 mega- 
bytes (MB) while one hour of video (i.e., a DVD-ROM) will need a few gigabytes 
of storage. The typical capacity of a personal computer hard disk drive has now 
reached 100 GB . A small archiving system will deal with terabytes (101? bytes) 
and a large archiving system will store petabytes (10 bytes). The total capacity 
of hard disk drives produced per year is already a few exabytes (1018 bytes). 


Data access Not only do data need to be stored but they must also be accessed. 
Sequential access (e.g., tapes) and random access (e.g., random access memory, 
RAM) are two possibilities, the choice depending on the use. Sequential access 
is suitable for archiving or to watch a video, whereas accessing files on a hard 
disk needs random access, as any bit can be required at any moment. The access 
time and the access rate also matter. The timescale can span orders of magnitude 
from minutes to access a tape to nanoseconds to read an individual RAM cell, 
and the required data flow can reach gigabytes per second. 


Volatility Data storage should show a certain lack of volatility. However, nonvola- 
tility will be designed taking into consideration the relevant time scale. This can 
span orders of magnitude, from years in the archiving business down to micro- 
seconds in computing. 


14.3.1.2 Competing Physical Effects 


Bistable state A digital memory requires two well-defined and stable states to 
store a bit. Many physical effects offer such configurations. Bits can be stored us- 
ing two physical levels by mechanically engraving the storage medium: bumps on 
a CD-ROM or DVD-ROM, tracks on old-fashioned LPs (analogue storage). Optical 
properties of the media can be modified to create reflecting/nonreflecting bits 
(recordable (CD-R) and erasable (CD-RW)) or bits with opposite magneto-optical 
properties (magneto-optical storage). The data can also be materialized by the 
presence or absence of an electrical charge as in semiconductor memories 
(SRAM, DRAM, flash), the presence or absence of electrical polarization like fer- 
roelectric RAM (FeRAM), the crystallized or amorphous nature of an alloy as in 
phase change media (PCRAM memory) discussed in Section 14.5, and of course 
the direction of a magnetic moment (magnetic tapes, hard disk media, magnetic 
RAM), which is the subject of this section of Chapter 14. 


Reading technique Depending on the way data have been stored and on data 
density (bit size), the appropriate reading technique is required. Mechanical bits 
(LPs) were read by using a mechanical contact; current CD-ROMs are read by us- 
ing a focused infrared laser reflected from bumps in polycarbonate (bump height 
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is a quarter ofthe wavelength in polycarbonate) so that the laser beam undergoes 
destructive interference when reflected from the edge of a bump. Optical bits are 
also read by measuring a reflected laser beam. Electrically stored bits can be read 
because they control (open or closed) the channel of a transistor, by monitoring 
the discharge of a capacitor or by sensing the electrical resistance of the bit 
(phase change RAM). Magnetic bits are read by sensing the stray magnetic field 
above them, or more directly by monitoring changes of the polarization of a laser 
(magneto-optical effects) or by measuring their resistance (magnetoresistive ef- 
fect). Unexpected physical effects can be used in innovative systems such as the 
thermal dissipation change used in IBM’s Millipede program. 


Characteristics of magnetic recording A ferromagnetic material can possess a well- 
defined axis (the easy axis) along which its magnetic moments will align prefer- 
entially. The two possible directions will define two stable states. When the easy 
axis is well defined (large remanence of the magnetization along the easy axis in 
the absence of an applied field) and if the field necessary to reverse the magnetic 
moment (the coercive field) is large enough, the material will be called a perma- 
nent magnet and can be used to store information. The minimum size of a stable 
magnetic particle is less than 10 nm (depending on the material), which trans- 
lates directly into high-density storage, and the intrinsic switching time of a parti- 
cle magnetization is in the nanosecond range, which leads to short data read and 
write times. 


14.3.1.3 Magnetic Storage 
14.3.1.3.1 Aspects of magnetism for storage 


Magnetization Ferromagnetic materials carry a spontaneous magnetic moment 
m (in A m?). Normalized by the sample volume it is called the magnetization M 
(in A m“), which is also written as J = gM (in T). The value of magnetization 
depends on the nature of the magnetic elements. When an atom carries a mag- 
netic moment, partially filled electronic shells must exist. Two series of elements 
with partially filled shells are at the origin of magnetism in most magnetic mate- 
rials: 3d transition metals and 4f rare earths. Possessing magnetic atoms is not 
the only condition for observing ferromagnetism. Interaction between magnetic 
moments must exist and parallel coupling must dominate. Antiparallel coupling 
will induce antiferromagnetic order resulting in no net magnetic moment, i.e., 
no magnetization. Only a few elements are ferromagnetic at room temperature 
(3d transition metals Ni, Co, and Fe), iron being the most magnetic of them 
(uoM = 2.2 T). Of course the list of ferromagnetic materials is much longer and 
numerous alloys and compounds are ferromagnetic (Table 14.7). In ferromag- 
netic materials, magnetization is reduced to 0.5 T for oxides (oxygen does not 
carry any magnetic moment) and may increase to 2.4-2.5 T for FeCo alloy or 
Fe nitride. The magnetic state of the elements does not always give clues about 
the magnetic state of a compound. Cr is antiferromagnetic but CrO; is a ferro- 
magnetic oxide used in tape recording. FeMn and NiO are antiferromagnetic. In 


897 


898 


14 Materials and Process Design 


Table 14.7 Curie temperature and magnetization of selected materials. 





Material Fe Co Ni FesoPtso 
Magnetization 4ọM [T] at 300K 2.16 1.72 0.61 1.43 
Curie temperature [K] 1043 1394 631 750 
Material FesoCoso BaFe12019 Cosg Cr20Pt12 Nigo Fez0 
Magnetization u,M [T] at 300 K 2.5 0.48 0.63 1.04 
Curie temperature |K] 1253 723 595 





recording, high magnetization is not a necessary requirement. When stray fields 
are being read, the signal is proportional to magnetization, but for magneto- 
optical media, magneto-optical properties of the material will matter and they do 
not scale as magnetization. In both cases large magnetizations induce large 
demagnetizing effects, which have to be addressed for long-term bit stability. 


Curie temperature Spontaneous magnetization M, is the value of magnetization 
in a magnetic domain in the absence of an applied field. It is the value which 
quantifies the magnetic order in zero field. M, decreases with increasing temper- 
ature and disappears at the Curie temperature T, (Fig. 14.20). T, characterizes the 
phase transition between long-range magnetic order and disorder, i.e., the compe- 
tition between the ferromagnetic exchange energy, characterized by the exchange 
constant A (in pJ m~!) and thermal energy kT. The exchange energy represents 
the cost of non-uniform magnetization, Eexchange [Eq. (24)] 


2 
Exchange -| a(v%) dV and i=x,y,z (24) 
V Ms 

T, should be well above room temperature (i.e., >450-500 K) for most applica- 
tions, since not only magnetization but anisotropy and coercive field also go 
to zero at T.. The nonmagnetic elements (Ta, Cr, B) which are added to some 
alloys to control their microstructure can cause a decrease of both T, and 
magnetization. 


Spontaneous magnetization 


Tc 


y Fig. 14.20 Temperature dependence of 
> Temperature spontaneous magnetization. 
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Anisotropy — magnetic easy axis To be considered as a permanent magnet with two 
stable antiparallel configurations, a magnetically easy axis should be developed in 
the material. The required anisotropy has several possible origins: shape aniso- 
tropy, crystalline anisotropy, or induced anisotropy. An elongated shape favors 
the alignment of magnetization along the long axis e.g., needle-like particles 
used for low-density storage (tapes). Tetragonal or hexagonal (e.g., cobalt) crystal- 
lographic structures can induce a strong easy axis along the c axis. Such a strong 
magnetocrystalline anisotropy cannot exist in a cubic structure, which is too 
symmetrical. Deposition or thermal annealing under magnetic field may slightly 
order a disordered alloy and induce an easy axis (NigoFe29). When an easy axis 
exists, the uniaxial anisotropy is characterized by the anisotropy constant K (in J 
m?) according to Eq. (25), 0 being the angle between the magnetization and the 
easy axis. 


Enke = K sin? 0 (25) 


K>0 leads to an easy axis and K < 0 to a hard axis (easy plane), which is not 
relevant for storage media. 


Shape — demagnetizing effects When a particle or a sample is uniformly magne- 
tized, the exchange energy is a minimum. However, the stray magnetic field 
around the sample represents a large magnetic energy stored in space. To evalu- 
ate the stray field, a mathematical analogy with electrostatics can be used, where 
magnetism is replaced by surface pseudo-charges (density o = M.n) and volume 
pseudo-charges (density p = —div M). The energy balance for a large sample usu- 
ally gives the stable state of a ferromagnetic sample as the magnetic configuration 
with the minimum stray field, i.e., the demagnetized state. To get zero net 
magnetic moment without paying too much exchange energy, magnetic domains 
with opposite magnetizations are created. Between domains, the region where 
the magnetization rotates is called a domain wall. The width ö and the energy y 
of a domain wall are mainly determined by the exchange energy A, which favors 
smooth transitions, and the anisotropy energy K, which favors sharp transitions 


[Eqs. (26)]. 
ö(m) = „j and y(J] m?) = 4VAK (26) 


Depending on the domain wall geometry, both prefactors in Eqs. (26) may vary 
(here both prefactors, z and 4, correspond to a 180° Bloch wall). A typical value 
for A is 10 p] m°?. For cobalt (K = 0.5 MJ m?) it gives rise to domain walls of 
width 14 nm and wall energy 9 mJ m. 

A direct consequence of demagnetizing effects is that at zero field, the mea- 
sured magnetization is usually lower than the spontaneous magnetization (Fig. 
14.21). It is called M,, the remanent magnetization, and this is the magnetiza- 


tion which is sensed by the read head. The field needed to reverse half the 
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Fig. 14.21 Magnetization as a function of magnetic field for a ferromagnet. 


magnetization (i.e., to reach zero net magnetization) is the coercive field H, (in 
Am). 

The art of permanent magnet design is to prevent demagnetizing fields from 
demagnetizing the magnet. Limiting domain nucleation or pinning domain walls 
to impede domain wall propagation are possible strategies. 


Magnetic states As far as recording is concerned, only two magnetic states per 
particle are required. Once an easy axis has been established, the particle size 
should be reduced below a critical size: the single domain size. This critical size 
represents the limit below which a multidomain magnetic configuration becomes 
unfavorable. Since the domain wall energy scales as a surface and the demagne- 
tizing energy scales as a volume, below the critical size the cost of introducing a 
domain wall will not be balanced by the gain in demagnetizing energy. For a 
spherical particle (radius r), assuming that introducing one domain wall (cost 
yrr?) reduces the demagnetizing energy by 50% (gain 44)M*V), one gets the 
critical radius r, for a spherical particle from Eq. (27) (compare Table 14.8) 


9y 


u MoM? en 





c 


Magnetization reversal Good recording media must possess properties quite sim- 
ilar to those of a permanent magnet. However, in most media, information is to 
be erased and rewritten. This property implies that an applied magnetic field 
larger than the coercive field must be produced by the write head. This limits 
the upper value of the coercive field of recording media. The coercive field of 
high-density media has recently reached uo He = 0.5 T. 


Table 14.8 Single domain radii for selected ferromagnetic materials. 


14.3 Magnetic Media 





D 
I A 
— T 
A 
I 
> 
Longitudinal geometry Perpendicular geometry 


Fig. 14.22 Write head geometry in the case of longitudinal and perpendicular recordings. 


Write head The principle of the write head is that of an electromagnet, which in- 
cludes a soft magnetic circuit, a gap, and windings (Fig. 14.22). For longitudinal 
recording the applied field must be parallel to the medium surface which is next 
to the gap. For perpendicular recording, the field must be perpendicular to this 
surface. The B flux is applied through a narrow gap to the medium. Since induc- 
tion lines are closed loops, the flux is guided through a soft underlayer (SUL) and 
comes back to the head through a second gap, which must be wide enough not to 
overwrite written bits. B flux is conserved, so a wide gap means a smaller field. In 
perpendicular recording, the magnetic bits are effectively in the gap of the write 
head, which allows larger fields to be applied. 


Reversal dynamics The dynamics of magnetic moment reversal is governed by 
Larmor precession. In an applied field the magnetic moment precesses around 
the field and aligns with it. The Larmor precession frequency is of the order of 1 
GHz in a 1 T field. More quantitatively, the dynamics of magnetization follows 
the Landau-Lifshitz equation [Eq. (28)], which includes a precession term and a 
damping term. 


OM MX Hyg + oM x (M x Heg) (28) 


In Eq. (28), y is the gyromagnetic ratio, « is the damping constant, and Heg is 
the effective field, given by Eq. (29). 


1 OE 
Her = —-— — 29 
= (29) 
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The effective field includes contributions from the applied field (Zeeman energy), 
the demagnetizing field (shape anisotropy), and magnetocrystalline and exchange 
energies. The demagnetizing field is not a local field, which makes micromag- 
netic calculations quite elaborate for objects too large to be considered as a single 
magnetic moment (macrospin approximation). 

Subnanosecond reversals of submicronic magnetic elements have been ob- 
tained experimentally. The reversal mechanism for larger particles involves 
domain wall propagation, which is a much slower mechanism (an upper limit 
for domain wall velocity is the velocity of sound in the material). 


Magnetostriction When magnetization rotates, due to the spin-orbit coupling, 
which couples spin moments and orbital moments (i.e., the electron distribu- 
tion), the material experiences distortions. 4; and 2, are defined as the magne- 
tostriction coefficients parallel and perpendicular to the field direction. High 
magnetostriction induces stress and can lead to mechanical failure. Repetitive 
magnetic reversals will require materials with low magnetostrictive coefficients. 


14.3.1.3.2 Read Head 
One option for reading information stored as a magnetic bit is to sense the stray 
magnetic field. Any ferromagnetic material will create a magnetic field (called the 
demagnetizing field within the material and the stray field outside the material). 
An electrostatic analogy may be used. Magnetic pseudo-charges are created when 
the magnetization has a component perpendicular to a surface or when the diver- 
gence of magnetization is not zero. In longitudinal recording (Fig. 14.23), magne- 
tization lies in the plane of the tape or disk and no surface pseudo-charges exist, 
but pseudo-charges will be present in the transition region between two antipar- 
allel magnetizations (non-uniform M). 

In principle the read head will then act as a magnetic field sensor. One of two 
physical effects are exploited: induction or magnetoresistance. 


Induction When a relative movement between the head and the medium exists, 
an inductive head may be used. The signal is the induced voltage in a pick-up 
coil, given by Eq. (30). 





dB.S 
ves (30) 
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Fig. 14.23 Stray magnetic fields above a longitudinal recording medium. 
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The main limitation of this sensing technique is miniaturization: the inductive 
signal scales as the surface of the pick-up coil. 


Magnetoresistance For high-density media (HDD), a magnetoresistive (MR) effect 
is preferred. The MR element can be down-sized, keeping its resistance constant. 
The relative head—medium velocity has no longer a direct influence on the signal 
amplitude. Nevertheless high velocity is still required in order to keep a fast ac- 
cess time. 

The reading resolution must be adapted to the bit size, i.e., thin film technol- 
ogy has to be used. The fly height of the sensing element matters also. For tapes, 
a clean and controlled environment is not possible. The preferred situation is to 
have a mechanical contact between the head and the medium (which limits veloc- 
ity and requires a lubrication layer). In a HDD, the environment is controlled 
(no dust particles), and the head can fly over the disk without permanent me- 
chanical contact, which results in faster movements (10000 rpm and r= 3 cm 
gives 30 m s71). 


Magneto-optical effect Magneto-optical effects may also be used to read magnetic 
bits. A ferromagnetic material does not obey time-reversal symmetry. Its dielectric 
tensor has nonzero off diagonal elements which are the source of magneto- 
optical effects. When a linearly polarized laser beam is reflected perpendicularly 
to the magnetic surface (polar Kerr geometry), the plane of polarization of the re- 
flected beam rotates. The rotation angle 6x is Q.M; with Q the Verdet constant 
and M, the perpendicular component of magnetization. Q depends on the mate- 
rial and on the wavelength. Shorter wavelengths will allow smaller bits to be read. 
However, since magneto-optical (M-O) properties strongly depend on wavelength, 
a change of laser requires a change in materials. Commercial M-O disks use la- 
sers similar to those used for CDs and DVDs. So the trend is to go from infrared 
(780 nm) to red (650 nm) and now blue lasers (405 nm). 

The application of the polar Kerr effect to recording necessitates the use of 
magnetic media with perpendicular anisotropy, and the large magnetization crite- 
rion will be replaced by a large Kerr angle criterion. More precisely the figure of 
merit is the intensity of the rotated polarization, which includes reflectivity and 
Kerr angle (a large Kerr angle with small reflectivity gives a poor signal/noise 
ratio). 


Stability: superparamagnetism A reduction in the size of magnetic bits requires a 
decrease in the size of crystallographic grains. For uniaxial materials, the aniso- 
tropy energy varies as K.V sin? 0, V being the volume of the grain (Fig. 14.24). 
Keeping the anisotropy of the crystal constant means that the energy barrier K.V 
which separates the two stable states (0 = 0 and 0 = z) becomes smaller. 

When a ferromagnetic particle is small enough, it may behave as a paramag- 
netic particle, i.e., as if it had no permanent magnetic moment. This behavior is 
called superparamagnetism. Experimentally it is observed when the characteristic 
time for magnetic moment measurement is longer than the characteristic time 
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Stable Top of Energy Barrier Stable 


Fig. 14.24 Energy stability as a function of the magnetization 
orientation of a single domain particle. 


Table 14.9 Critical grain diameter for a ten-year stability versus thermal reversal at 300 K. 





Material Fe Co PtFe Ni 


Critical diameter [nm] 22 10 4 50 





for thermally induced magnetic reversal. For recording it means that the informa- 
tion is lost before it is read. Superparamagnetism occurs when the thermal 
energy kT is sufficiently large compared to K.V for the magnetization reversal 
probability to be non-negligible in the absence of any applied field. The character- 
istic time 7 for a reversal follows an Arrhenius law, where 1/1 is the attempt fre- 
quency (10°-10'° Hz) as in Eq. (31). 


t = roe V/T 


(31) 
The critical grain size for recording is calculated using a ten-year limit for 
temperature-induced reversal of a grain’s magnetization at 300 K (Table 14.9). 
There are typically 100 grains per magnetic bit, so that the reversal of a few grains 
does not destroy the information. In such conditions, the minimum volume of a 
grain is given by 60kT = K.V. 
Media noise The write head does not control the position of the written magnetic 
bits with respect to the crystallographic grains. The transition width between two 
bits will be at least as large as the grain size (Fig. 14.25). A distribution in the ori- 
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Fig. 14.25 Media noise associated to bit transition width. 
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entation of the anisotropy axes will also contribute to media noise. On the one 
hand, narrow transition regions are required to have well-defined magnetic bits 
and clear signals to read. On the other hand, sharp transitions lead to large 
volume pseudo-charges and thus to large demagnetizing fields. If one assumes 
that magnetization in the transition region can be written as Eq. (32) with a the 
transition width, then the demagnetizing field H4 created by the magnetic 
pseudo-charges associated with this non-uniform magnetization is given by Eq. 
(33), t being the thickness of the longitudinal medium. Since the demagnetizing 
field must be smaller than the coercive field, the transition width has to be kept 
larger than M,t/H,. Small M,t and large H, are required to achieve high-density 
recording. 








M(x) = M,arctan (=) (32) 
2M,t ž 
Hı = - a 33 
d na 1 +2 ( ) 
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Magnetic Recording Media 


14.3.2.1 Particulate Media 

A magnetic storage medium consists of ferromagnetic particles. The position of 
the magnetic bit transition is not predetermined (the write head does not aim at 
magnetic grains), so neighboring grains should be magnetically decoupled to be 
able to locate the bit transition anywhere. The first magnetic media used were 
magnetic powders screen-printed onto a substrate. This method is still in use for 
low-density applications (floppy disks, audio and video tapes). The particle size is 
submicronic. The origin of the particle anisotropy gives two families of particles. 
For materials with small magnetocrystalline anisotropy, shape anisotropy is re- 
quired and particles are needle-like (e.g., CrO2, y-Fe2O3, Fe). For systems with 
large magnetocrystalline anisotropy, particles can be spherical (e.g., BaFe 2019 
hexagonal ferrites). Particulate media can be screen-printed onto large areas and 
onto flexible media (tapes). However, the volume fraction of magnetic materials is 
limited to about 60%. To improve this fraction, continuous film materials have 
been developed. 

Floppy disks and audio tapes have been around for many years (1.44 MB 3.5 in 
floppy IBM 1987). Recent improvements have lead to 100-250 MB floppy disks 
and high-density digital tapes. The typical characteristics of present day tapes are 
as follows. A polymer binder 100 nm thick (charged with elongated iron-based 
particles 60 nm long) is coated onto a polyethylene substrate (5-10 um thick). A 
prototype 1 TB tape makes it possible to write bits 76 nm long and to reach 1 
Gb in”? storage density. Positioning the head on a floppy medium is less accurate 
than onto a HDD, which leads to wider tracks. However, the linear character of a 
tape means that multiple tracks can be written and read in parallel. 
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14.3.2.2 Continuous Media — Film Media 

Particulate media (particles, screen-printed to a tape) allow low density recording. 
Higher density requires the use of continuous media (films). Crystallographic 
grains must be magnetically decoupled, which, in a continuous medium, will 
involve a nonmagnetic intergranular phase. The size of the magnetic bits should 
be as close as possible to that of the superparamagnetic limit, and therefore a 
granular structure with a narrow grain size distribution is required. Grains are 
roughly equiaxed, so the origin of anisotropy must be magnetocrystalline and al- 
loys based on hexagonal cobalt are widely used. Control of the crystallographic 
texture improves the remanence and thus the signal. For HDD media, the pres- 
ent magnetic bit density has reached 100 Gb in, with the following characteris- 
tics (Fujitsu prototye 2002). The size of one magnetic bit is roughly 30 nm along 
the track (7.5 x 10° bits per inch: 750 kbpi) and the track is 180 nm wide 
(1.42 x 10° tracks per inch: 142 ktpi). The medium grain size is less than 10 nm 
and has to be as uniform as possible. To achieve stable bits and narrow transition 
widths, 9H, reaches 0.4 T and M,t is 0.37 emu cm? (1 emu = 10°? A m?). 


14.3.2.2.1 Cobalt-Based Alloys 


Structure Glass disks or Ni-P-coated Al-Mg (4 at.%) alloy substrates are used 
for HDD (the glass disks are harder and stiffer). A thick layer (10 um) of amor- 
phous Ni-P is electrolessly plated onto the Al-Mg disk. Ni-P can be polished 
mirror-like (roughness 5 nm) and a surface texture can be added. Ni grains are 
superparamagnetic in the Ni-P layer so they do not contribute any magnetic sig- 
nal. The magnetic storage layer will be isolated from the substrate by an under- 
layer which induces crystallographic texture and a seed layer which controls the 
granular size distribution. Its top surface is protected by depositing a mechani- 
cally hard capping layer (diamond-like carbon or dense CN,) and finally a lubri- 
cating layer is added. 


Storage layer The magnetic storage layer is a CoCr-based alloy. Cr segregates, 
leaving magnetic Co-rich grains, separated by a Cr-rich, nonmagnetic, intergranu- 
lar phase which decouples the Co grains. The Co grains must retain the hcp 
crystal structure, which is at the origin of the magnetocrystalline anisotropy and 
the large coercive fields. Ta or B addition will favor Cr segregation to the grain 
boundaries. However Cr addition to Co grains reduces the magnetocrystalline 
anisotropy. The role of Pt addition is then to increase Co anisotropy and such 
compositions are used to reach coercivities of 0.5 T. A typical composition is Co 
containing 15 at.% Cr, 10 at.% Pt, and a lower percentage of Ta or B. The CoCr 
alloy undergoes a transition from a highly anisotropic hcp to a low anisotropic fcc 
symmetry if too much Pt is substituted. 


Underlayer A bcc-Cr underlayer develops a (110) texture, which induces cobalt 
(10-10) texture (c axis slightly out-of-plane). To get the Co c axis in-plane, 
Cr(002) textured layers are used. NiAl B2 structure grows with (112) texture. 
When a quaternary CoCrPtIa storage layer is being used, a better lattice match 
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and crystallographic texture may require the use of Cr alloys (CrV, CrTi) as under- 
layers. Extra in-plane texture can be induced by substrate scratching. Laser textur- 
ing of the substrate can induce circumferential anisotropy, while leaving the sub- 
strate sufficiently smooth. 


14.3.2.2.2 Laminated and Antiferromagnetically Coupled Media 

Media grain size should be reduced to keep the transition width narrow and to 
get a large number n of grains per magnetic bit (media noise scales as 1/,\/n). 
However thermal stability requires new strategies to be used since reducing grain 
size while keeping the properties similar induces superparamagnetic behavior. To 
reduce grain size while maintaining stability, new media have been proposed. 
The idea is to keep low M,t, which is the figure of merit for media, by using two 
antiferromagnetically coupled magnetic layers. One example is Co/Ru/Co. When 
the ruthenium spacer is 0.6-0.7 nm thick, antiparallel coupling between the fer- 
romagnetic layers is induced by the oscillating spin polarization of conduction 
electrons in the nonmagnetic (Ru) spacer (RKKY mechanism). The Co/Ru/Co 
multilayer is crystallographically coherent. When both magnetic layers have the 
same thickness, the structure has no net magnetic moment; it is then called a 
synthetic or artificial antiferromagnet. The strength of the antiparallel coupling 
can be tuned by controlling the Ru layer. So the total thickness of a grain can be 
larger than that of single layer media, pushing back the superparamagnetic limit, 
keeping a low effective M,t. Double and triple magnetic layers create laminated 
media. Noise measurements show that some structures behave as if one grain 
were replaced by two or three grains, reducing the noise (1/,/n law). 


14.3.2.2.3 New Materials 

Further improvements may be made by shifting to Li, materials (FePd, FePt, 
CoPt etc.) or rare earth-transition metal intermetallics. The L1o structure is a nat- 
ural multilayer. Pt and Pd have large spin-orbit coefficients which translate into 
large magnetocrystalline anisotropies. The L1o structure is also quite interesting 
because of the tunable character of the atomic order. It is possible to reduce the 
anisotropy and thus to suppress the out-of-plane easy axis, by reverting to the 
disordered fcc alloy (A1 phase). 

Intermetallics relevant for recording are compounds with a well-defined easy c 
axis. The presence of rare earth atoms induces very large magnetocrystalline an- 
isotropies. Processing rare earth elements and dealing with huge coercive fields 
are two key points to progress in this direction. 


14.3.2.3 Perpendicular Recording 


14.3.2.3.1 Magneto-Optical Media 

The magneto-optical Kerr effect is exploited to read so-called magneto-optical 
(M-O) media. Since it is sensitive to the optical properties, the absolute value of 
magnetization is less relevant. However, the stability of magnetic bits makes de- 
magnetizing effects important still. Since polar geometry is used, perpendicular 
magnetization is required. 
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Writing Writing on the medium requires a perpendicular applied field. A large 
coil is used to create a spatially homogeneous field and the submicronic bits are 
written by reducing the coercive field in a controlled manner, i.e., by local heating 
using the focused laser at higher power than for reading. Materials with perpen- 
dicular anisotropy and a large temperature dependence of coercive field are the 
best choices. Heating above the Curie temperature was the initial strategy in the 
1960s and MnBi the first alloy (Kerr rotation 2°). 


Ferrimagnetism Intermetallic alloys such as TbFeCo have a ferrimagnetic config- 
uration, i.e., the transition metal (TM) moments are antiparallel to the rare earth 
(RE) ones. The temperature dependences of both sublattice magnetizations are 
quite different and when the composition is close to REo.25 TMo 75, the two sublat- 
tice magnetic moments compensate each other at room temperature (RT). The 
net magnetization becomes zero at the compensation temperature and the coer- 
cive field diverges since Zeeman energy no longer exists. When these alloys are 
heated above RT (i.e., above the compensation temperature), the coercive field de- 
creases rapidly, allowing for writing. RE-TM alloys have a larger Kerr rotation for 
infrared wavelengths and were the first commercialized systems. Perpendicular 
anisotropy can be developed in amorphous layers, and thermal cycling does not 
degrade the material. However, terbium is easily oxidized and capping of the 
layer is necessary. Since Kerr rotation is only 0.2°, capping layers include an anti- 
reflection layer and internal reflections to eliminate the nonrotated component of 
light and to enhance the rotated component of the reflected beam. 

Multilayers such as Co/Pt or Co/Pd are also applicable as M-O media. Perpen- 
dicular anisotropies can be dominant when the individual layers are kept thin 
(a few atomic planes). Moreover, large Kerr effects have been measured in 
these multilayers at shorter wavelengths (relevant for blue laser diode) and T, 
can be tuned when alloying Co with Ni to get a strong temperature dependence 
of H,. 

The lack of an industry standard for magneto-optical disks and the develop- 
ment of large-capacity Flash and HDD transportable media make M-O recording 
a niche market. The ultimate density is limited by the diffraction-limited focus- 
ing, i.e., d = A/2NA, NA being the numerical aperture of the optical system, and 
by the angular resolution power d = 1.64/D, with D the lens diameter. Super- 
resolution strategies have been developed using amplification layers but they 
may not be enough to warrant the survival of this technology. 


14.3.2.3.2 High Density Perpendicular Recording 

To improve the stability of HDD media and to allow for further improvements in 
the storage density, a transition from longitudinal to perpendicular recording 
(magnetic moments are perpendicular to the magnetic film) is now taking place. 
Commercial products have been announced and have been available since the 
end of 2005. The main change that perpendicular recording brings is the modifi- 
cation of the bit aspect ratio. Magnetic grains become column-like. Furthermore, 
when the density is increased, the magnetic bits become more elongated, i.e., the 
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demagnetizing field does not increase. The new geometry requires the design of a 
new write head because the applied field is perpendicular to the surface. The stor- 
age multilayer requires also a new design which includes a soft underlayer (SUL) 
below the recording layer to guide the B flux (Fig. 14.22). 

As far as materials are concerned, out-of-plane anisotropy has to be induced. 
The distribution of easy axes can be better defined compared to longitudinal (in- 
plane) anisotropy, which is quasi-isotropic in-plane since the substrates are intrin- 
sically isotropic in-plane. The recording medium is effectively in the gap of the 
write head, so it experiences the maximum field from the head. Therefore, larger 
applied fields are available, films with larger coercive fields will be writable, and 
thus larger magnetocrystalline anisotropies may be used. 

The first perpendicular recording layers are based on CoCrPt alloy with out-of- 
plane c axis. SiO, may be used as a segregation agent to obtain a narrow grain 
size distribution. The crystallographic texture of the recording layer has to be 
developed on top of the SUL (200 nm thick) covered with a nonmagnetic inter- 
layer so that both magnetic layers are decoupled. The SUL should be thick 
enough to guide the B flux. A material with high magnetization is preferred and 
it must maintain soft properties (H, smaller than 10 A m~t) with a controlled 
magnetic state (induced radial or longitudinal anisotropy) and remain domain- 
free to eliminate noise in the read signal. To eliminate domains, the SUL can 
be biased by an antiferromagnetic layer such as Ir9Mngo or a hard layer such 
as SmCo. Controlled anisotropy may require a specific heat treatment under a 
magnetic field to release deposition stress and induce a well-defined anisotropy 
direction. 

Soft underlayers previously studied include Fej9Nigı, amorphous Cogı NbzZr;, 
FeCoB, FeosTa3N2, Coo7Ta3Zrs, FeAlSi, etc.; the nonmagnetic interlayer can be 
2-5 nm of Ag, MgO, or Cr, depending on miscibility and texture. 

To increase bit density, while reducing the superparamagnetic limit, higher- 
anisotropy materials are being studied. FesoPtso ordered alloy (L1o phase), 
TbFeCo and Co/Pd multilayers are being investigated as layers with a high per- 
pendicular anisotropy. These are the same materials as the ones studied to push 
back the superparamagnetic limit in longitudinal recording. Their easy c axis is 
now perpendicular to the plane. 

This will require drastic changes from the well-established CoCr alloys. Keep- 
ing the coercive field low enough to allow writing may not be possible, so strat- 
egies using heating to decrease the coercive field during the writing process are 
currently being studied. 


14.3.3 
Outlook 


Perpendicular recording is being implemented and will be improved in the years 
to come toward the dream density of 1 Tb in”?. In 2005, prototypes by Hitachi 
(230 Gb in~?) and Seagate (245 Gb in?) have already been presented. 


909 


910 


14 Materials and Process Design 


The next improvement may be heat-assisted writing to allow for the limitation 
of the maximum field a head can create (2.4 T). Reducing the coercive field while 
writing is already a strategy used in magneto-optical recording. 

The next revolution will be the so-called “patterned media.” Using continuous 
films, one magnetic bit contains roughly a hundred crystallographic grains, the 
volume of each of them being close to the superparamagnetic limit. Moreover 
these grains are randomly distributed on the surface. Patterned media rely on 
one bit-one grain storage. Each bit is then better defined, there is no longer a 
transition region between bits, and the noise associated with this transition, but 
the bit positions should be organized on the disk. 

To pattern a disk, several techniques are being developed in laboratories: 5-10 
nm magnetic particles formed using a chemical route can self-organize on the 
surface, with the help of pre-patterned features such as grooves to maintain 
long-range order. Nanoimprinting of a polymer, similar to CD-ROM imprinting, 
allows the production of sub-100 nm features on large surfaces, the master press 
being produced through conventional electron beam lithography. Patterning of 
discrete tracks 140 nm wide has been demonstrated on a 95 mm disk by Komag 
in 2005. Direct electron lithography is a process too slow to be compatible with 
mass production. Since magnetic storage is a small market compared to micro- 
electronics, the need for lithography implies that microelectronic’s technological 
roadmap (Moore’s law) produces the tools necessary for patterning at the relevant 
nanometric scale. This technological bottleneck may limit the growth of high- 
density storage to Moore’s law while it was faster than the microelectronics trend 
in recent years. 

Recording materials require specific properties which can be finely tuned by al- 
loying. The most relevant property is the magnetocrystalline anisotropy, which re- 
quires anisotropic crystallographic structures, i.e., compounds or ordered alloys. 
At the same time, the nanostructure at a nanometer scale must be controlled 
over wide areas and modern media involve multilayers comprising nucleation 
layers, texture-controlling layers, magnetic hard and soft layers, mechanically 
hard layers, and lubricating layers. Each of these layers plays an important role 
and has to be optimized. 


Further Reading 
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The transport properties of metallic and semiconducting materials have been 
studied and used for a long time. Present electronics is based on the transport 
of electrical charges in metallic conductors and semiconducting channels and 
makes use of the influence of electrical field on charge drift and charge density. 
For quite a long time the spin of electrons or holes was not considered when 
designing an electrical transport device, which can be understood since in non- 
magnetic conductors, the properties of electrons do not depend on spin. This, 
however, is no longer valid when considering ferromagnetic conducting mate- 
rials. Using the spin as an extra parameter or degree of freedom widens the 
field of electronics and is called spin electronics or spintronics. This is a fast- 
developing field, which was created at the end of the 1980s when giant magneto- 
resistance (GMR) was discovered and has already given fruitful applications in 
the field of sensing. Magnetoresistive sensors have rapidly overcome inductive 
sensors in the field of high-density recording, and prototypes have appeared in 
the field of data storage: magnetic RAM (MRAM). 


14.4.1 
Electrical Transport in Conductors 


14.4.1.1 Conventional Transport 
The two main parameters used to define the electrical properties of conductors 
are the density of carriers and the mobility of these carriers. 


Carrier density When the Fermi level intersects bands, a finite density of states 
exists (Fig. 14.26). The material is then a metal and, in general, its density of 
states depends only weakly on temperature. When the Fermi level lies in a 
bandgap, the density of carriers is zero at low temperature (intrinsic semiconduc- 
tor) and then it increases exponentially when the temperature increases, which 
creates thermally excited carriers. Addition of impurities (i.e., controlled doping) 
may create energy levels close to the gap edges and may strongly enhance the car- 
rier density. Small carrier densities may be modified by electrical field (depletion 
layers); they gave birth to the field-effect transistor (FET). 


Fermi level 


N+) NŒ) 


Fig. 14.26 Band structure of a nonmagnetic metal. 
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Mobility A high density of carriers is not enough to induce low electrical resistiv- 
ity p (i.e., high electrical conductivity e = 1/p). Mobility of the carriers may vary 
by several orders of magnitude. The relevant scattering mechanisms which con- 
trol mobility have to be considered. The quality of the crystalline state is a major 
parameter: the better the crystal (topological) order, the larger the conductivity o. 
A perfectly ordered crystal would have infinite conductivity, i.e., the electron wave- 
functions could be described as plane Bloch waves. Several kinds of imperfections 
exist in real materials. First of all, quantum mechanics prevents atoms from be- 
ing motionless (i.e., on a perfectly periodic lattice) and experimentally accessible 
temperatures are always above 0 K. Crystal defects (granular structure, atomic dis- 
order in alloys, dislocations, point defects, dopants) do not depend on tempera- 
ture and contribute a constant conductivity. Lattice vibrations (phonons) can also 
be considered as defects but have a strong temperature dependence — they make 
the main scattering contribution in pure metals at room temperature. In addi- 
tion, several minor scattering mechanisms can also take place, e.g., electron-elec- 
tron scattering, electron—magnon scattering, etc. 


Conductivity In the relaxation-time approximation, the different scattering 
mechanisms are assumed to be independent. Each of them is characterized by a 
scattering time t;. Being independent, the probabilities of collisions (scattering) 
are simply additive. This means that the inverse scattering rates add: 1/t = 
1/t1 + 1/72. Since resistivity is proportional to 1/r, this finally leads to Matthies- 
sen’s rule p(T) = pı + p2. For example, resistivity can usually be considered to 
be the sum of a residual resistivity due to static defects (impurities, structural 
defects), which is the low-temperature limit of p(T) and a thermal resistivity due 
to phonons. For a conductor, with carrier density n, carrier charge q, and carrier 
effective mass m", the Drude formula relates the conductivity to these intrinsic 
parameters through the characteristic scattering time t [Eq. (34)]. The mean free 
path A is related to the characteristic scattering time t through the Fermi velocity 
vr. The carrier drift velocity v (which defines the current j) is proportional to the 
applied electrical field, the mobility « being the proportionality factor [Eq. (35)] 


2 ) 
T and = (34) 


T 





o= z 





* 


v=uE and n= (35) 


The Drude relaxation time t is 2 x 10°!* s for a very good metal such as silver 
and decreases to 1.4 x 107" s for a ferromagnetic metal such as iron. To improve 
the model, the real Fermi surface has to be taken into account and the k depen- 
dence of the scattering times should be included: realistic calculations of resis- 
tivities are quite elaborate! 

Characteristic values of resistivity at 300 K (Table 14.10) are 1 Q cm for a good 
metallic element and 10 uQ cm for a magnetic element. Pure element resistivities 
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Table 14.10 Resistivity at room temperature of a few metallic conductors. 





Metallic material Ag Co NigoFe2o Pt Ta Bi Fe +x%Si Steel 
Resistivity 1.6 5.8 (a axis) 15 10.4 13.1 156 10+7.x 75 
[uQ cm] (300 K) 10.3 (c axis) 
E 
Fermi level 
Ny) N,(E) 


Fig. 14.27 Band structure of a strong ferromagnetic metal. 


span two orders of magnitude. Two key parameters are the electron symmetry in 
the conduction band and the density of electrons. For example, Bi is a semi-metal 
with a small carrier density. The larger resistivity of magnetic elements can be 
interpreted by taking into account s-d scattering. In strong ferromagnetic metals 
(Ni, Co) the 3df band is full and Ep lies in the 3d| band (Fe is a weak ferromag- 
net; the Fermi level passes through both 3d bands). Please note that strong /weak 
ferromagnetism refers to 3d band filling and does not correlate with properties 
such as magnetization, magnetocrystalline anisotropy, or coercive field. In transi- 
tion metals a partially filled 4s band also exists at the Fermi level (Fig. 14.27). The 
s bands are not intrinsically spin-polarized and only acquire some polarization 
through hybridization with 3d bands. The 4s electrons have smaller effective 
masses (are lighter) compared to heavier 3d electrons, which are more localized. 
4s-like electrons carry most of the current. The direct consequence for mean free 
paths is that s|-d| scattering occurs whereas sî electrons do not experience such 
events since there are no d states at the Fermi level. 

The maximum value of resistivity for a metal is of the order of 200 uQ cm. It 
corresponds to the limit where the electron mean free path becomes similar to 
the interatomic distance (loffe—Regel criterion). 


14.4.1.2 Role of Disorder 

In alloys, atomic disorder plays a leading role. When impurities are introduced in 
a normal metal, Nordheim's rule states that the increase in resistivity depends lin- 
early on the concentration of impurities and the Linde-Norbury rule adds that 
the increase varies as the valence electron difference between the impurity and 
the host matrix: Ap oc c.AZ*. For example, iron resistivity (10 pQ cm at 300 K) in- 
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creases by 7 uQ cm for each 1% of Si added. Si addition is used to reduce eddy 
currents in Fe-core transformers without diluting iron magnetization. A direct 
consequence of disorder is that the temperature dependence of resistivity is 
weak and it can be used to monitor atomic ordering during annealing (cf. Chap- 
ter 5). Disordered alloys will have larger resistivities than pure elements and, as 
active spintronics materials, it is mainly alloys with small AZ which are consid- 
ered, such as FeNi or FeCo. 


14.4.1.3 Transport in Magnetic Conductors 

The main difference in a ferromagnetic conductor is the spin dependent density 
of states. The densities of spin-up and spin-down carriers are different, which al- 
lows for a new parameter P: the spin polarization of the conduction band. It is 
now necessary to define two carrier populations: the majority carrier and minority 
carrier populations. The majority carriers do not necessarily have their spin paral- 
lel to the material magnetization — it depends on the filling of bands at the Fermi 
level. 

Traditional transition metals and alloys have spin polarization of the order of 
40-50% (Table 14.11). Since magnetoresistive effects are directly related to spin 
polarization, a search for higher polarization is a long-term trend. Ferromagnetic 
half-metallicity, i.e., the existence of a gap at Er for one spin population while the 
other spin carriers are metallic, has been predicted to exist in several compounds. 
These are crystals with no s bands at Er and narrow bands to allow for gaps. 
Some conducting oxides (CrO2, Lao7Sro3MnO3, Fe3O4, etc.) and some com- 
pounds with the semi-Heusler (C1b) structure (NiMnSb, PtMnSb, etc.), CoS2, 
and some chalcogenides have fully spin-polarized conduction bands according to 
ab-initio calculations. Experimentally, spin polarizations in excess of 80% have 
been measured at low temperatures but 3d metals and alloys are still the best 
spin-polarized materials at room temperature. 


Spin-flip Spin—flip scattering allows both carrier populations to interact. It is a 
rare event compared to non-spin-flip mechanisms, so both populations are well 
defined and a two-fluid picture is quite commonly used, especially at low temper- 
ature. Nonbalanced spin populations can be induced by polarized light absorption 
or when an electric current passes through an interface between two materials 


Table 14.11 Typical spin polarizations of ferromagnetic metals. 





Material Ni Co NigoFez Fe CosoFeso NiMnSb, CrO;, 
Lao 7Sro3 MnO; 
Spin polarization 20-30% 35-45% 32-48% 40-44% 50% > 50% at low 


temperature 
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Table 14.12 Characteristic spin diffusion lengths. 





Material Cu Nigo Fe20 Co 
Spin diffusion 350 (300 K) 4(4K) 59 (77 K) 
length [nm] 38 (300 K) 


with different spin polarizations. Equilibrium of both populations is restored by 
the relevant spin—flip scattering mechanisms. The spin-flip characteristic time 
is t. The length scale to restore spin equilibrium is the spin diffusion length ly 
[Eq. (36) and Table 14.12] 


UFTs À 
ly = y= (36) 


Since a two-fluid picture is relevant, one introduces «= of /o| = p|/pt, the 
resistivity asymmetry coefficient; « > 1 for transition metals because s|-d| scat- 
tering impedes o|. For Co, « is 2 to 3, for FeNi it increases up to « = 9. At high 
temperature a p|| spin mixing term can be introduced to take spin-flip into 
account. 


Interfaces In nanostructures, interfaces between materials with different spin po- 
larizations can be treated as spin-dependent interfacial resistances. This interfa- 
cial resistance is of the order of 1 x 10°" Q m7. Its value changes by a factor 5 
depending on spin in the case of the FeNi/Cu interface. Spin-dependent trans- 
port through an interface between two different spin-polarized materials creates 
a spin accumulation zone. To generalize and solve Ohm's law in a magnetic 
nanostructure, use of the spin-dependent chemical potential is made. The spin- 
dependent currents are proportional to the spin-dependent chemical potential 
gradients. 


14.4.2 
Magnetoresistance 


Application of a magnetic field on a metal modifies the electron distribution in 
the material and leads to several magnetoresistive (MR) effects. Some MR effects 
are volume effects (cyclotron and anisotropic magnetoresistance) and have been 
known for a long time. Since the fabrication of nanostructured materials has 
been mastered, especially when in multilayer form, new effects related to spin 
transport through interfaces have been discovered (giant magnetoresistance and 
tunnel magnetoresistance, spin injection and spin torque effects). 
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14.4.2.1 Cyclotron Magnetoresistance 

In any metal, even nonmagnetic ones, an applied magnetic field will disturb the 
electron trajectories. Longitudinal and transverse cyclotron magnetoresistances 
can be defined. The effect is small and it corresponds to an increase of resistance 
under the field. It is generally proportional to the square of the field. Most metals 
follow Kohler’s law, i.e., Ap/p = f(B/p), with f being a material-dependent func- 
tion which is close to a quadratic law. The lower the temperature (smaller p) and 
the higher the field, the higher is the cyclotron MR. The order of magnitude of 
the effect is 0.1% in a 1 T field. In fact, this MR at high field can be shown to be 
very sensitive to the electronic orbits on the Fermi surface (open orbits or closed 
orbits). By using single-crystalline metals, and varying the amplitude and the di- 
rection of the applied field compared to the crystallographic axes, Shubnikov-De 
Haas oscillations of the resistivity can be observed. The period of these oscilla- 
tions is proportional to 1/B and gives insight into details of the Fermi surface. 


14.4.2.2 Anisotropic Magnetoresistance (AMR) 

AMR is also a volume effect. In ferromagnetic metals, s and d electrons are pres- 
ent in the conduction band. If the asymmetry of the d band is large, s—d scatter- 
ing depends on the configuration between magnetization M and the carrier wave 
vector k. Two resistivities can be defined: pj, where the current and the magneti- 
zation are parallel, and p} where the current and the magnetization are perpen- 
dicular (Fig. 14.28). The angular dependence of AMR can be written as Eq. (37). 


p= p; + (pı — p1) cos? (k, M) (37) 


The AMR can reach a few percent in nickel and nickel alloys (NigoFe29 2% at 
room temperature, 20% at low temperature). (pj — p1) is positive for transition 
metal systems. This kind of MR was the first one used in sensors; for exam- 
ple, it is used in compasses (resolution milliTesla for a DC field, nanoTesla at 
higher frequencies). It should be noted that a SQUID sensor resolution is better 
but it requires a cryogenic environment since it is based on a superconducting 
device. 


14.4.2.3 Giant MR (GMR) and Tunnel MR (TMR) 

GMR and TMR are MR effects due to interfaces. The basic requirements are two 
ferromagnetic layers which are magnetically decoupled in order to be able to ob- 
tain parallel and antiparallel magnetization configurations (Fig. 14.29). The MR 
effect will be observed when comparing the resistance of the two configurations. 
In order to decouple two ferromagnetic electrodes, one needs a nonmagnetic 
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Fig. 14.28 AMR configurations to define the parallel p and perpendicular p, resistivities. 
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Fig. 14.29 Parallel and antiparallel magnetic configurations in GMR and TMR junctions. 


spacer. If the spacer is a metal (e.g., Cu), then the GMR effect may be observed; if 
the spacer is an insulator (Al,O3, MgO), then a TMR may be measured. 


GMR_ The mechanism for GMR is the spin-dependent scattering experienced 
by electrons in ferromagnetic layers. A large difference in spin resistivity should 
exist. In a given layer spin | electrons can be majority electrons or minority elec- 
trons, depending on the magnetization configuration. If spin | electrons experi- 
ence low scattering in both electrodes, then they will short-circuit the spin | cur- 
rent. In this case the GMR junction resistance will be low. In the antiparallel 
magnetization configuration, both spin currents will experience low scattering in 
one electrode and large scattering in the other one, giving a large net resistance to 
the junction. 

Two electrical configurations exist. If the current is in the plane of the trilayer, 
we have the CIP (current-in-plane) GMR. If the current passes perpendicular to 
the interfaces it is the CPP-GMR (current-perpendicular-to-plane) (Fig. 14.30). In 
CIP the relevant length scale for the spacer thickness is the electron mean free 
path. Electrons should visit both ferromagnetic electrodes. The typical thickness 
of the spacer is a few nanometers. In CPP geometry, the spacer thickness can be 
larger (but small compared to the spin diffusion length in the spacer, which could 
be as large as 100 nm). 


TMR The TMR mechanism is different. Only current perpendicular to the 
spacer is possible (otherwise there would be two independent currents on both 
sides of the insulating barrier). Conduction occurs by a tunneling effect through 
the barrier. It depends on the densities of states at Er on both sides. Tunneling 
electrons maintain their spin so the tunnel probability involves the filled states 
in the starting electrode and the empty states with the same spin in the final elec- 
trode. A simple model from Julliére [Eq. (38)] relates the TMR value to the spin 
polarizations P; of both electrodes. 


Rantiparallel ii Rparallel = 2P; P2 





























TMR = (38) 
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| / 
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Fig. 14.30 Electrical configurations for GMR: current-perpendicular-to- 
plane (CPP) and current-in-plane (CIP). 
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TMR values as large as 50% can be obtained using an amorphous alumina bar- 
rier. Julliere’s model has to be completed to really understand the tunnel junction. 
In particular, the evanescent wavefunction in the barrier has to be taken into ac- 
count. Depending on their symmetry, wavefunctions can have different penetra- 
tion depths in the barrier, leading to spin filtering. A full quantum mechanical 
treatment of the structure is then required. Recently, crystallized MgO barriers 
have been shown theoretically and then experimentally to spin-filter one spin 
population very efficiently. MgO barrier implementation has led to TMR ratios 
greater than 200%. 

The relevant length scale for the junction is the thickness of the barrier (1-2 
nm). The electrodes play the role of resistances in series and their thickness is 
usually kept as thin as a few nanometers. Since only the spin polarization at the 
two barrier interfaces is relevant, high spin polarization CoFe is usually inserted 
on both sides of the barrier. 


14.4.2.4 Magnetic Field Sensors 

One main advantage of sensors based on a magnetoelectrical effect (Hall effect or 
magnetoresistance) is the possibility of reducing the size of the sensor, keeping 
the amplitude of the effect. Unlike inductive sensors the response of which is 
proportional to the size of the coil and to the time derivative of the field, magneto- 
electrical sensors can work in static mode. 

Hall sensors require high-quality semiconducting layers involving epitaxial de- 
position and controlled doping. Moreover, four electrical contacts are required. 
MR sensors can be deposited in a polycrystalline state onto Si wafers using the 
sputtering technique. Two current leads are enough to read the signal. 

To include a TMR or a GMR junction in a sensor, several criteria must be met. 
The magnetic configuration of the junction has to be single-valued and reproduc- 
ible. One of the magnetic layers, called the reference layer, is chosen so that its 
magnetization is fixed. Only the magnetization of the free layer will be allowed 
to rotate and such a junction is called a spin valve. 


Reference layer Blocking of the reference layer can be achieved by using a high- 
coercivity material such as Co, but a preferred situation is to pin this layer to an 
antiferromagnetic (AF) layer. Antiferromagnetism only responds to very high 
magnetic fields. The magnetic moment of the surface of the AF layer acts as a 
bias field on the reference layer. The reference layer magnetization loop will be 
shifted by the bias exchange field and the coercive field may be enhanced by the 
AF layer. One magnetization direction becomes more favorable (Fig. 14.31). 
Direct exchange coupling between the pinned layer and the AF layer is ob- 
tained by sequentially depositing a NiFe ferromagnetic electrode and an AF layer 
(FeMn, IrMn, NiO, PtMn) or an artificial AF layer (Co/Ru/Co). Parameters from 
which to choose the AF layer include its Néel temperature (i.e., its transition tem- 
perature) and the blocking temperature, i.e., the temperature above which the 
exchange bias field disappears. The blocking temperature depends on the AF 
thickness. When the AF is metallic, it also gives a non-MR contribution to the 
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Fig. 14.31 Microscopic origin of the exchange bias between a 
ferromagnetic layer and an antiferromagnetic pinning layer. 


junction (short-circuit (CIP) or serial resistance (CPP)), so its thickness must be 
kept small («50 nm). A deposition under a field or an adequate thermal treat- 
ment under a magnetic field will pin the reference layer in the chosen direction. 
Such field processing may be carried out before the free-layer deposition. 


Free layer The second ferromagnetic layer is the sensing electrode or free layer. A 
soft material such as Fe2ọNiso is usually chosen. In the absence of field its mag- 
netization direction must be defined. A uniaxial anisotropy may be induced by 
applying a magnetic field or by patterning the free layer (shape anisotropy), or it 
may be induced during growth. To get linear sensing, biasing of the junction may 
be required. In effect, GMR and TMR depend as cos 0 on the position of the free 
layer magnetization. Linear sensing requires that the work position must be 
around 0 = 90°. Either orthogonal anisotropies are created during the deposition 
annealing process or external biasing must be used to rotate the free layer 90° 
away from the reference layer. Special care must be taken not to modify the 
bottom-layer anisotropy when inducing anisotropy in the second layer. This usu- 
ally defines the deposition sequence. 


Shields In the case of high-density sensing, the GMR or TMR structure must be 
inserted between two soft magnetic shields which screen stray magnetic fields 
coming from regions away from the region to be read (Fig. 14.32). These shields 
are based on Permalloy (Fez9Nigo) or Sendust (Feg4Si1oAls). Shields can be lami- 
nated with oxide layers (2-4 nm SiOz, for example) to reduce eddy currents since 
hard disk drive sensing is much faster than 100 MHz. One shield of the sensor 
acts also as a shield for the write head which is deposited next to the read head. 
GMR and TMR structures are more complicated than AMR structures. Because 
of their larger responses (GMR 20%, TMR > 50% compared to AMR x 2%) 





Top Shield (soft magnet) 
Data H| MR multilayer 
Bottom Shield (soft magnet) 




















Fig. 14.32 Two magnetic shields determine the high spatial resolution of the read head. 
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they have been implemented in high-density recording read heads for hard disk 
drives. TMR effect is even larger x 200% using MgO barriers instead of alumina 
barrier. When taking into account the full structure (buffer layer, capping layer, 
pinning layer, current leads, etc.) the MR ratio is much lower than that of the 
best trilayers because of the contribution of the non-MR layers. 

An example of a TMR sensor structure is: Ta (3 nm)/ Irz2,Mnz4 (25 nm)/ 
CogFeıg (4 nm) reference layer/ AljO3 (1 nm)/ CogFeıs (3 nm)/ NigoFe20 
(7 nm) free layer/ Ta (9 nm). 


14.4.2.5 Magnetic RAM 
Once uniaxial anisotropies have been induced and an electrode has been pinned, 
GMR and TMR junctions possess two equilibrium positions with well-defined re- 
sistances (high state and low state). This is the basis for magnetic RAM (MRAM). 
Such a memory element can be inserted into a matrix architecture similar to that 
of semiconducting RAM. A word-line and a bit-line address each memory cell, 
which is enough to read the magnetic state (parallel or antiparallel). A transistor 
controls the opening of each line. To write a bit, the first generation of MRAM 
used two extra lines. Two current pulses create two magnetic field pulses around 
the lines. Only at the cross-point between the lines would the total applied mag- 
netic field be sufficient to reverse the free-layer magnetization (Fig. 14.33). 

Advantages for the development of MRAM are the nonvolatile character of 
magnetic configurations (unlike electrical leakage of DRAM), the intrinsic fast 
write time (magnetization reversal of a nanomagnet is faster than 1 ns, faster 
than flash memory), the infinite cyclability of GMR and TMR, and the possible 
downscaling of resistance keeping its value and its MR ratio. Magnetism is also 
less sensitive to radiation, especially ionizing radiation, compared to semicon- 
ducting (electrostatic) memories. MRAM has been called the universal memory. 
However, prototype capacities are still in the megabyte range, so for the moment 
MRAM does not compete with multi-gigabyte memories. 

A first commercial MRAM was proposed by Freescale in June 2006. It is a 4 Mb 
(= 512 kB) unit with one tunnel junction and one transistor per memory cell. A 
180 nm CMOS (complementary metal-oxide-semiconductor) process has been 
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Fig. 14.33 MRAM memory cell. Two lines make it possible to measure 
the junction resistance state (high or low); two lines make it possible to 
send current pulses creating magnetic field pulses larger than the free 
layer’s coercive field. 
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used and the read and write cycles are 35 ns long. The price is similar to that 
of DRAM or flash memories, so the price/bit is larger by a factor of 1000 than 
that of gigabyte semiconducting memories. To improve the cell selectivity, next- 
generation MRAM concepts include thermally assisted writing where a current 
is passed through the selected junction during writing to reduce the free layer’s 
coercive field. It has also been proposed that the spin torque mechanism be used. 
A large spin-polarized current pulse has been shown to be able to reverse 
the magnetization of a small element using a spin torque. No write lines would 
be necessary any more, improving the cell selectivity and the form factor of one 
cell. 


14.4.3 
Outlook 


Spin electronics is a very active research field due to the application potential of 
spin-dependent transport effects. Electron mean free path is at the nanometer 
scale, so the development of controlled nanostructures is required in order to 
enhance these spin-dependent effects. The mechanism details are not fully un- 
derstood and new effects such as spin injection and spin torque are still between 
theoretical proposals and experimental validation. The improvement in the fabri- 
cation of structures, the use and discovery of new materials (fully spin-polarized 
metals, room-temperature ferromagnetic semiconductors) lead to the conclu- 
sion that spin electronics has a great future both in research and in application 
laboratories. 


Further Reading 


Rossiter, P.L. (1987), The Electrical Resistivity For an introduction to quantitative treatment 
of Metals and Alloys, Cambridge University of GMR structures: 
Press, Cambridge, UK. Valet, T., Fert, A. (1993), Phys. Rev. B, 48, 
Ziese, M. (Ed.) (2001), Spin Electronics, 7099. 
Lecture Notes in Physics Series, Springer, 
Berlin. 


14.5 
Phase-Change Media 
Takeo Ohta 


14.5.1 
Electrically and Optically Induced Writing and Erasing Processes 


Great advances have been made in memory devices such as magnetic tapes, 
floppy disks, hard disk drives (HDDs), and semiconductor memories. Hard disks 
have high data transfer rates but generally do not have read-only functionality like 
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OTS Model: Carrier double injection 
Electronic process 
Ovonic Switch 


OMS Model: Percolation of crystalline state 
Pe Thermal process 
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Fig. 14.34 Reversible electric current-voltage switching (I-V) model for 
an amorphous semiconductor switching device. OTS (Ovonic threshold 
switch): switches back from the low-resistance (crystalline) state to the 
high-resistance (amorphous) state below Vp. OVS (Ovonic memory 
switch): the resistance state remains after the current is removed and 
depends on the amount of current passed through the device. 


that found in optical disks. With the increasing use of multimedia applications, 
phase-change rewritable optical disks are becoming more popular due to their 
flexibility of use and compatibility with pre-programmed read-only optical disks. 

Ovshinsky (1968) discovered the switching and memory effect in amorphous 
semiconductors (the “Ovonic” effect). Ovonic” is a name derived from the Ovshin- 
sky effect, which is the underlying characteristic of a broad class of disordered 
and amorphous materials (Evans et al. 1970). It was named after Stanford R. 
Ovshinsky (born 1922), an American inventor. His inventions opened a new field 
of physics and materials research and, in 1971, laser- recorded rewritable phase- 
change memory devices were announced (Feinleib et al. 1971). 

I-V characteristics of two types of electronic switching phenomena of chalco- 
genide materials are shown in Fig. 14.34. When a voltage pulse is applied at a 
level exceeding a threshold value (V;,) (determined by the material and its thick- 
ness) to an Ovonic threshold switch (OTS) in the amorphous state, the device 
instantly switches to a low-resistive state by a phase transformation from an 
amorphous to a crystalline state, and the current flow increases dramatically. 
When the voltage is decreased to a value less than the holding voltage (V;,), the 
device returns to its high-resistive, amorphous state as indicated by the arrow in 


1) The American Heritage Dictionary of the effect by which a specific glassy thin film 
English Language: Ovonic: adj, of or relating switches from a nonconductor to a 
to a device whose operation is based on the semiconductor upon application of 


Ovshinsky effect; Ovshinsky effect: n, The minimum voltage. 
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Fig. 14.34. The basic disordered structure is maintained in the material in both 
states when appropriate chalcogenide alloy compositions are used to display the 
Ovonic threshold switching phenomena. 

An Ovonic memory switch (OMS) can be made using different compositions 
and bonding configurations. This switch also displays the threshold switching 
phenomenon of the OTS but, at specific current levels, a low-resistive state re- 
mains after the applied pulse is terminated. The decrease in resistance results 
from atomic restructuring into a crystalline configuration. The desired amor- 
phous or crystalline structure is obtained by varying the pulse amplitude and du- 
ration. The low-resistive crystalline state can be induced by applying a pulse with 
moderate amplitude and long duration (20-50 ns) (Lai and Lowrey 2001). The 
amorphous state is programmed using a pulse with a large amplitude and short 
duration (20 ns). The ability to retain the desired resistive state after the current 
has been removed allows the OMS to be used as nonvolatile memory (NVM). 

Whereas state changes in the OTS and OMS were induced with voltage pulses, 
the structure of the chalcogenide material in phase-change optical data storage 
devices is altered using a laser pulse. Changes in the atomic configuration state 
for optical memory and electronic memory (OMS) are shown in Fig. 14.35. 
Figure 14.35(a) illustrates atomic order (crystalline) and disorder (amorphous) 
phase-change states; Fig. 14.35(b) gives the temperature dependence of the en- 
thalpy of the crystalline and amorphous disordered states. A short, high-intensity 
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Fig. 14.35 Model of phase-change memory. (a) Schematic 
representation ofthe change between amorphous and crystalline 
atomic configuration; (b) enthalpy change between the crystalline and 
amorphous states. 
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laser pulse increases the temperature of the crystalline film past the melting tem- 
perature Tm, starting at state A and passing through states B, C, and D (solid ar- 
rows in Fig. 14.35b). When the laser pulse is removed, the material cools rapidly 
without time for atomic rearrangement and ends up in the disordered amor- 
phous state, F. A longer, low-intensity laser pulse applied to an amorphous film, 
F, will raise the temperature above the glass transition temperature, T, (E), and 
crystallize the material (dotted arrows). 

When the chalcogenide alloy is switched from the amorphous to the crystalline 
structure, the electronic resistance is similar to that in a metallic state and it 
is around 1000 times lower than in the amorphous state. The optical characteris- 
tics change as the optical absorption edge of the material shifts to a longer 
wavelength. This is accompanied by a change in the complex refractive index 
N =n + ik (n is the refractive index and k is the extinction coefficient), which 
leads to a difference in reflectivity. The laser records low-reflective amorphous 
data bits in the high-reflective crystalline background. The data bits can be recrys- 
tallized (erased) for rewritable data storage (Ohta et al. 1989a). 

In the differential scanning calorimeter (DSC) scan for Ge2SbzTe; (Fig. 14.36) 
an exothermic crystallization transition is visible at 130 °C and the endothermic 
melting transition occurs at 610 °C (Yamada et al. 1991). The phase diagram of 
the pseudo-binary GeTe-Sb,Te; system is shown in Fig. 14.37 (Abrikosov and 
Danilova-Dobroyakova 1965; Yamada et al. 1987; Suzuki et al. 1988; Ohta et al. 
1989b). The melting temperatures for all the compounds in this system are 
around 600 °C and the initial crystalline structure for all is face-centered cubic 
(fcc). The optical constants and conductivities of the two structures are shown in 
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Fig. 14.36 DSC results for Ge2Sb2Tes at a heating rate of 10 °C min™!. 
Taken from Yamada et al. (1991). Latent heat of crystallization at 

130 °C = 85. kcal kg~!; latent heat of crystalline-liquid transition at 
610 °C = 16.3 kcal kg“. 
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Fig. 14.37 Phase diagram for the GeTe-SbTe3 system (Abrikosov and 
Danilova-Dobroyakova 1965). 


Table 14.13 Optical constants and change in resistance of Ge2Sb2Tes phase-change material. 





Amorphous Crystalline 
Refractive index n small, ng = 4.9 large, n. = 5.7 
Extinction coefficient k small, k, = 1.4 large, k, = 3.4 
Electrical resistance R [Q] high (1E + 6) low (5E + 3) 
Chemical and physical characteristics remarkably different 





Table 14.13. Another phase-change optical disk material of the AgInSbTe system 
has been proposed for CD-RWs (rewritable compact disks) (Iwasaki 1997). 


14.5.2 
Phase-Change Dynamic Model 


As storage densities increase, higher data transfer rates are desired. The crystalli- 
zation speed of the two structural changes in a material (amorphous to crystalline 
and crystalline to amorphous) determines the rate at which data can be over- 
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(a) Nucleation type: GeTe-Sb,Te, (b) Growth type: Sb, Tes, 
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Fig. 14.38 Nucleation and growth rates obtained by TEM (Nishi et al. 
2002) for two-material systems with different crystallization 
mechanisms: (a) GeTe-SbzTe3; (b) Sb79Te30. 


written and thus the speed of data transfer. There are two mechanisms for crys- 
tallization in phase-change materials. Materials such as those of the GeIe-Sb,Tez 
pseudo-binary system undergo crystallization through a nucleation-dominated 
process, where crystalline growth proceeds from crystalline nuclei. Compositions 
based on the Sb7oTe39 eutectic crystallize through a growth-dominated process, 
where crystalline growth continues from existing crystalline areas. 

The nucleation and growth rates of both material systems are shown in Fig. 
14.38 (Nishi et al. 2002). The GeTe-Sb,Te; system (Fig. 14.38a) has a high nudle- 
ation rate peaking at a temperature lower than that of the growth rate. The nucle- 
ation and growth rates in this system are about the same. The Sb7oTe39 system 
(Fig. 14.38b) has a very low nucleation rate while the growth rate is much greater. 
The peaks of both of these rates occur around the same temperature, just below 
400 °C. 

The speed of crystallization and consequently the overwrite speed can be 
altered through changes in the structure of the material layers and through 
changes in the laser modulation, or the write strategy. Recording simula- 
tions are important in optimizing the desired effect (Nishi et al. 2002). The 
Kolmogorov-Johnson-Mehl-Avrami (KJMA) equation [Eq. (39)] (see Chapter 7) 
is often used to calculate the crystalline fraction in thermal modeling (Avrami 
1939, 1940, 1941; Senkader and Wright 2004; Wright at al. 2004): 


x= 1—exp(—k(T)t”) (39) 


Here x is the transformed crystal fraction, k(T) the rate constant, n the Avrami 
exponent relating to growth dimensions, t the time, and T the temperature. A 
schematic plot of a KJMA calculation of the crystalline fraction as a function of 
time is shown in Fig. 14.39(a). 
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Fig. 14.39 Crystallization analysis obtained with the KJMA equation 
[Eq. (40)]: (a) crystallized fraction as calculated with KJMA Eq. (40); 
(b) KJMA plot yielding the Avramic constant. 


Equation (39) can be rearranged into Eq. (40), which is useful for determining 
the Avrami exponent n and the rate constant k (Avrami plot, Fig. 39a). 


In(—In(1 — z)) = n In(t) + Ink (40) 


The slope of the line in Fig. 14.39(b) yields the Avrami constant, n, and the rate 
constant, k, is obtained from the intercept. 

Avrami coefficient, n, and rate constant, k(T), have been obtained by reflectivity 
measurement on nucleation-type Ge,Sb,Te; phase-change film during isother- 
mal heating using a heater stage. With R(t) measuring the reflectance at time t 
of heating at temperature T, Rey the reflectance (crystalline state), and Ram the 
reflectance (amorphous state), Eq. (39) can be written as the fraction of reflec- 
tance change [Eq. (41)]. 


x(t) = (R(t) — Ram)/(Rery — Ram) (41) 


The measured results are shown in Fig. 14.40(a,b). The plot of Fig. 14.40(a) in- 
cludes the incubation time of the initial nucleation process, whereas in the 
plot of Fig. 14.40(b) the incubation time is subtracted as t’ = t — tinc. The Avrami 
constant of Ge2Sb7Te; film including incubation time results as n = 5.8. The 
value obtained from the plot neglecting the incubation time is n = 2.5, which cor- 
responds to a nearly three-dimensional crystalline growth. 

Introducing in Eq. (39) a nucleation factor I(t) and growth factor U(t) (as there 
is three-dimensional volume growth of (4n/(3R°)), an extended KJMA equation, 
Eq. (42), results (Davies 1976, Wright et al. 2004). 


(i= 1s epf -47/3 [ IR’) a) (42) 
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(a) t(min) 
2.73 739 + 20.09 


In{-In(1-7(0)} 





(b) t(min) 
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Fig. 14.40 Measurement of the Avrami coefficient n with a KJMA plot: 
(a) KJMA plot including the incubation time: ting = 14.3 min; n = 5.8; 
(b) KJMA plot with the incubation time subtracted: t’ = t — tinc; n = 2.5. 


Equation (42) can be used to calculate the nucleation and growth rates for 
Ge2Sb2Tes. The temperature dependence of the nucleation rate and the growth 
rate are shown in Fig. 14.41(a,b), respectively (Nishi et al. 2002). The results cor- 
relate well with the experimental data for nucleation rate and the crystalline 
growth rate in Fig. 14.38(a). This correspondence gives evidence that the extended 
KJMA equation can be used in thermal modeling of amorphous mark formation. 
An example of such a calculation is shown in Fig. 14.42 (Nishi et al. 2002). The 
laser recording strategy (laser modulation pattern) is shown in Fig. 14.42(a). Here 
the laser pulse width used is ty = 17.1 ns, the linear disk velocity LTV = 8.2 
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Fig. 14.41 (a) Nucleation rate I(T) and (b) growth rate U(T) of 
Ge2Sb2Tes as calculated by the extended KJMA equation [Eq. (42)]. The 
parameters on the curves indicate the radius (nm) of the clusters. 


m s-!, the laser wavelength 4 = 650 nm, and the numerical aperture of the lens 
NA = 0.6. The formation of an amorphous mark for the nucleation-type material 
GeSbTe is shown in Fig. 42(b) for t = 0-4.5t,, with a step 0.5t,, as calculated with 
the above formalism. This modeling shows that the trailing edge of the amor- 
phous mark is influenced by thermal diffusion. This indicates how the write strat- 
egy can be modified to control mark formation and position. Thermal modeling 
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Fig. 14.42 Simulation of amorphous mark formation for GeSbTe as 
calculated by the extended KJMA model: (a) laser write strategy (laser 
modulation pattern) to write a 3T mark (T = tw = 1.7 Ins, pulse width); 
(b) amorphous mark formation for times t = 0-4.5ty. 
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Fig. 14.43 Increasing crystallization of phase-change materials by 
addition of elements (Jiang and Okuda 1991): (a) time-temperature 
transformation curve; (b) variation of critical cooling rate with reduced 
temperature, Tg/Tm- 


results of a growth-type material, Ag,In,(Sb7oTezo)z, show that the mark edge 
shrinks much less in these materials (Nishi et al. 2002). 

The speed of crystallization of growth-type materials can be affected by the ad- 
dition of transition and main group metals. A temperature-time-transformation 
(I-T-T) plot for various additional elements is shown in Fig. 14.43(a) (Jiang and 
Okuda 1991). The curves represent a crystallized volume fraction of 1076. The 
point on the curves indicated with a full circle (T„, tn) corresponds to the shortest 
time, tn, for crystallization (fraction of 1076) occurring at temperature Tn. 

The critical cooling rate, R., can be calculated by Eq. (43) using tn and T, and 
the melting temperature Tm. 


Re = (Tin bi Failte (43) 


The relationship between the R, and T,/Tm for some elements is shown in Fig. 
14.43(b). The elements at the top of the curve, Sb, Ag, and Cu, are predicted to 
contribute to fast crystallization (Okuda et al. 1992). Table 14.14 shows the critical 
parameters related to log R, for these elements. From Table 14.14, the critical 
cooling rate of Sb can be determined as R, = 4.1 x 10° K s!, a value close to 
the measured critical cooling rate of GeSbTe, R. = 3.4 x 10° K s-! (Ohta et al. 
1989a). 

Phase-change optical disks are manufactured with a four-layer structure on 
a polycarbonate substrate (bottom dielectric layer ~ 100 nm; phase-change 
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Table 14.14 Estimated values of T, and T,/T, and calculated values of 
Tn, tn and log R, of certain elements. 





Element Tm [K] Tg [K] Tg/Tm Tn [K] tn [S] logy Re 
[K s"] 
Sb 895 182 0.20 600 7.41E-8 9.61 
Ag 1235 250 0.20 800 2.98E-7 9.16 
Cu 1357 298 0.22 900 3.38E-7 9.13 
Co 1768 445 0.25 1200 6.54E-7 8.94 
Pb 601 152 0.25 375 7.07E-7 8.50 
Te 722 285 0.39 500 5.75E-6 7.54 
Ge 1212 750 0.62 945 8.96E-4 5.47 





layer ~ 20 nm; upper dielectric layer ~ 20 nm; metal reflection layer ~ 100 nm). 
The dielectric layers serve two purposes in controlling the temperature of the 
phase-change material. First, they offer thermal protection by the dissipation of 
heat. Second, they are used in the optical design of the layer structure to optimize 
laser absorption efficiency (contrast between the amorphous and crystalline 
reflectance). The metal reflection layer is necessary for the quenching process 
involved in creating amorphous marks. The optical, mechanical, and thermal 
characteristics of optical disk layers are listed in Table 14.15. 

Early in their development, phase-change rewritable disks had limited over- 
write cycle performance. Amorphous marks are formed by laser heating above 
the melting temperature, 600 °C. Erasing of marks through crystallization re- 
quires temperatures of at least near 400 °C. When optical disks were first being 
developed in the 1980s, the cycle life was projected to be 10-100 cycles. But there 
are essential differences between the phase-change optical disk, with its thin 
layered structure, and the usual melt solidification metallurgy process. The di- 
mensions on a disk are localized: time in nanoseconds and space in nanometers. 
Diffusion of materials and segregation of elements are localized to nanoscale di- 
mensions. Crystallization and melt-quenching on these scales are very different 
from common metallurgical processes which are recorded in hours and tons. A 
breakthrough in cycle performance was made by looking closely to the two main 
cycle degradation mechanisms, first the increase in noise and second the signal 
pulse dropout with write-erase cycle time. 

The first degradation mechanism was analyzed as the grain growth of the pro- 
tection dielectric layers, bottom and upper layers. The original dielectric layer ma- 
terial was a ZnS dielectric layer; after an overwrite cycle the grain size increased 
and therefore the noise level also increased. A new dielectric protection layer was 
found in a mixture of ZnS and SiO, with a composition of (ZnS)go(SiO2)20. A 
comparison of the grain structures of ZnS and ZnS-SiO; films is shown in Fig. 
14.44. The grains of ZnS-SiO, after annealing at 700 °C for 5 min were very 
small, ~2 nm, while the grains of ZnS became rather large (about 50 nm) and 
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Table 14.15 The optical, mechanical, and thermal properties of materials. 





Material 


GeTe-Sb, Te3-Sb 
(amorphous) 


GeTe-Sb, Te3-Sb 
(crystalline), 
2:1:0.5 mol. ratio 


ZnS-SiO;, 
4:1 mol. ratio 


SiO, 
Al alloy 


Polycarbonate 


Refractive 
index 

(I = 830 
nm) 


4.9 + 1.41 





5.7 + 3.41 


2.0 


1.46 
2.2 + 7.51 


1.58 


Density 
[kg m°] 


6150 


6150 


3650 


2202 
2750 


1200 


Young’s 
modulus 
[N m7] 


5.49 x 101° 


5.49 x 101° 


7.81 x 101° 


7.81 x 101° 
7.03 x 101° 


2.26 x 10° 


Poisson’s 
ratio 


0.33 


0.33 


0.2 


0.2 
0.345 


0.3 


Latent heat: crystal-liquid: 0.682 x 10° J kg~!; amorphous-liquid: 
0.356 x 105 J kg. 





ZnS 


(b) 


annealing for 5 min at 700 °C (Ohta et al. 1990). 


Specific 
heat 


0.209 x 103 


0.209 x 103 


0.563 x 103 


0.753 x 103 
0.892 x 103 


0.126 x 10? 


ZnS-SiO, 


Fig. 14.44 TEM of grain structure of ZnS and ZnS-SiO; layers after 


Thermal 
conductivity 
[Wm K=] 


0.581 


0.581 


0.657 


0.313 
0.215 x 103 


0.223 


Coeff. 

of linear 
expansion 
IK] 


1.1 x 10° 


1.1 x 10° 


7.4 x 10° 


5.5 x 107 
2.2 x 10 


7.0 x 10 
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Fig. 14.45 The effect of an additional SiO protection layer: 2 x 106 
overwwrite cycles of phase-change optical disk (Ohta et al. 1989b). 


were still growing to 100 nm and more with longer annealing time (Inoue et al. 
1992). Grain growth in ZnS-SiO, was not observed after annealing at 700 °C 
for 5 min and the cycle characteristics were improved for more than 1000 
cycles. 

The second mechanism of overwrite cycle life degradation concerns the de- 
crease in the absorbed laser pulse during write-erase cycles. This is related to 
the change in layer thickness along the track. The reason for this is a displace- 
ment of the phase-change layer element by deformation of the disk layers, which 
is due to thermal expansion and contraction during the recording and erasing 
process. This deformation can be reduced by adding a layer which has a small 
thermal expansion coefficient. The additional layer is inserted between the 
phase-change and the upper dielectric layers. The thermal expansion coefficient 
of SiO; (5.5 x 1077 K7!) is less by a factor of more than 10 than that of ZnS—SiO, 
(6.1 x 1076 K-71). The addition of an SiO, layer extended the overwrite cycle life 
from about 10° to over 2 x 10° cycles, as shown in Fig. 14.45 (Ohta et al. 1989b). 


14.5.3 
Alternative Functions 


Phase-change optical CDs and DVDs are widely used for data storage; they in- 
clude the next-generation products which are increasing their share of the mar- 
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Fig. 14.46 Operation of Ovonic unified memory (OUM): oscilloscope 
traces of the voltage drop across an OUM cell during repetitive cycling 
(5 MHz programming). Read pulse voltage V = 0.2 V (Gill et al. 2002). 


ket, namely Blu-ray Disc (BD) with a capacity of 50 GB and high-density DVD 
(HD-DVD), capacity 36 GB. Phase-change materials are also used more and 
more as electrical memory in applications such as for nonvolatile memory of 
flash memories and other memories, e.g., SRAM (static random access memory) 
and DRAM (dynamic random access memory). 

To use phase-change materials as electrical memory, a simple device structure 
is preferred (Lai and Lowrey 2001). Operation characteristics of so-called Ovonic 
Unified Memory (OUM) are shown in Fig. 14.46 (Gill et al. 2002). A set pulse of 
V = 0.6 V, t = 85 ns, is used to change the material from high resistance (85 kQ) 
to low resistance (2 kQ). A reset pulse of V = 0.8 V, t = 8 ns, changes the mate- 
rial back to high resistance. The power consumption is calculated to be 2.6 pJ, 
which is much less than the power consumption of 0.8 nJ of phase-change mate- 
rial in an optical disk. 

In the Ovonic memory switching (OMS) process, the current flow heats the 
high-resistance amorphous material sufficiently to crystallize it, giving a low- 
resistance state. A short, high-intensity pulse melts the crystalline filament, 
which then rapidly cools back to the amorphous state. These devices will cycle 
more than 1013 times. The electrical memory has a much longer cycle life than 
the optical memory because of the fixed operation of the phase-change material. 
Movement of atoms is minimal in electrical switching, while the dynamic scan- 
ning of the laser in optical memory induces large movement, and thus potential 
for failure in the material. 
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Fig. 14.47 Resistance characteristics of phase-change RAM (PRAM). Set 
pulse: 80 ns; reset pulse: 20 ns. 


An example of a change in resistance by a supplied current pulse for phase- 
change RAM (PRAM) is shown in Fig. 14.47. When the pulse amplitude is lower 
than the set current, the device does not change state and the resistance remains 
constant. At the set current, the resistance drops to a low-resistance state. 

What is the speed limit of the phase change between the amorphous and crys- 
talline states? Applications requiring high-speed switching include fiber commu- 
nications and high-resolution laser processing. Lasers having pulse widths on the 
order of pico- and femtoseconds have enabled new studies of materials to be 
carried out (Miura et al. 1997). The recording speed of the phase-change optical 
memory has been increased from 10 Mb s~! to 140 Mb s™! (Kato et al. 2001). 
Now femtosecond laser pulses will provide an opportunity to develop materials 
and devices for even higher-speed phase-change optical memory. 

Due to thermal diffusion, the phase-change material heated by a laser is wider 
than the laser spot. The size and position of a mark are determined not only by 
the diffraction-limited spot size (4/2NA), but also by the pulse duration and ther- 
mal characteristics of the disk. The response of phase-change memory to femto- 
second laser pulses was measured (Morilla et al. 1997; Ohta et al. 2001). Marks 
formed with 60 ns and 120 fs pulses are shown in Figure 14.48. The main differ- 
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Fig. 14.48 Comparison by TEM of marks made with (a) nanosecond 
and (b) femtosecond pulses (Wang et al. 2003): (a) conventional laser 
recording: 2 = 780 nm, NA = 0.5, t = 60 ns; (b) femtosecond pulse 
laser recording: 2 = 780 nm, NA = 0.95, t = 120 fs. 


ence between the conventional 60 ns pulse and the 120 fs pulse is the recrystal- 
lized band around the amorphous mark. The longer pulse, Fig. 14.48(a), shows a 
large-grain crystalline band around the mark, while the mark made with the short 
pulse, Fig. 14.48(b), does not. The data rate of phase-change optical memory re- 
corded with femtosecond laser pulses is expected to be more than 1 Tb s71. 
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Fig. 14.49 Change in reflective intensity as a function of delay time 
after the femto laser pulse (t) exposure: mean (pulse width of 120 fs) 
influence = 30 m) cm~?, Al = I(t) — la. (la is the reflection intensity of 
as-deposit amorphous state). 
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Fig. 14.50 Behavior of an Ovonic electrical cognitive device (Ovshinsky 
2003; Ovshinsky and Pashmakov 2004): change in resistance versus 
number of set pulses. 


T. C. Chong and his group investigated crystallization using a femtosecond 
laser (Wang et al. 2003). The reflectivity change measured by time-resolved mi- 
croscopy after a 130 s pulse is shown in Fig. 14.49. The results show that the 
reflectivity continues to change long after the end of the pulse. 

Ovshinsky recently proposed another use for phase-change materials: cognitive 
switching (Ovshinsky 2003; Ovshinsky and Pashmakov 2004). The behavior of an 
Ovonic electrical cognitive device is shown in Fig. 14.50. No change is obtained 
until a certain number of pulses have been applied, after which the device 
switches to a low-resistance state. The device “remembers” previous pulses ap- 
plied and only switches once the threshold pulse number is reached. A compari- 
son of Ovonic switching memory and Ovonic cognitive response is shown in Fig. 
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Fig. 14.51 Comparison of an Ovonic memory device and an Ovonic 
cognitive device: (a) PRAM (OUM): binary memory; (b) cognitive 
function: processing and memory. 
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14.51. A possible explanation of the cognitive function is that nucleation and 
grain growth begin during the application of the initial pulses. At the threshold 
pulse, percolation occurs and the device switches to low resistance. This provides, 
in a single device, functionality similar to that of a neurosynaptic cell in the brain. 


14.5.4 
Outlook 


Phase changes have been a well-known phenomenon for a long time (see Chap- 
ters 1 and 7). Since Ovshinsky found the effective functions of the memory effect 
named the Ovonic effect, it took around 20 years to promote the phase-change re- 
writable optical disks such as CD-RWs, DVD-RAMs and BDs, HD-DVDs and so 
on by proposal of new materials and the development in device structure. Laser 
marks on the disks are getting smaller and smaller, and lie today at about 100 
nm in the nanometer range. Research in the field of phase-change electronic 
memory (PRAM) is very exciting at the moment with respect to the development 
of nonvolatile memory: the flash memory market, for example, is more than dou- 
bling every year. PRAM is a candidate for a higher density of data storage than 
flash memory and also has DRAM speed overwrite characteristics; it is therefore 
a candidate for replacing DRAM. 
The main pathway of scientific research and technical development in the field 

of phase-change media will involve: 

e gaining a more detailed understanding of nucleation and 

growth parameters during the phase change 

e increasing the phase-change speed limit 

e exploring the threshold switching mechanisms 

e searching for corresponding phase changes which do not 

need crystallographic long-range order. 


In any case, further research in the field of materials physics with respect to 
phase-change mechanisms and size limitations will be necessary to secure an im- 
portant place in the nanostructure device market for phase-change media in the 
future. 
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14.6 

Superconductors 


Harald W. Weber 


14.6.1 
Fundamentals 


The discovery of superconductivity by Heike Kamerlingh Onnes (in 1911) was 
prompted by two major developments at the beginning of the 20th century: firstly 
the successful liquefaction of the “last gas,” i.e., helium, by Onnes himself in 
1908, which opened a new field of research at very low temperatures (4.2 K and 
below); and secondly, the quest, mainly by solid-state theorists, to understand the 
temperature dependence of the electrical resistivity, especially at temperatures 
close to absolute zero. Consequently the metal that could be prepared with the 
smallest amount of impurities at that time, Hg, was chosen by Onnes for his in- 
vestigations of the resistivity as a function of temperature for T — 0. The result, 
i.e., the complete disappearance of dc resistivity at a certain (low) temperature, 
the transition temperature T,, set the stage for a fascinating new field of research 
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that has been keeping theorists and experimentalists busy for nearly 100 years 
now. The final breakthrough onto the market in the electric power industry, med- 
ical diagnostics, metrology, and electronics (“key technology of the 21st century”) 
is expected in the very near future. 

It soon turned out that ideal conductivity was accompanied by perfect diamag- 
netism (x = —1, the “Meissner effect”), i.e., the complete expulsion of magnetic 
flux from the interior of the superconductor. These two unique properties of the 
solid state are linked, as already indicated by the first phenomenological theory 
of the electrodynamics of superconductivity in the early 1930s by the London 
brothers, who showed that “supercurrents’” flowing in a thin surface layer (of 
dimension 4, the magnetic penetration depth, ~100 nm) were able to shield the 
interior completely from flux penetration. Inserting typical numbers, we find 
current densities, Je, of these shielding currents of the order of 10!! A m~?, but - 
because the current flow is restricted to these thin surface layers — currents of only 
around 70 A (e.g., through a “massive” wire with a diameter of 1 mm). Another 
issue refers to the stability of the superconducting phase. It was found very early 
on that both the ideal conductivity and the perfect diamagnetism could be sup- 
pressed at temperatures well in the superconducting regime (T < T.) either by a 
certain current, the critical current I., or by a certain magnetic field, the critical 
field H., which are again linked through a simple relation, I. = H,/2zr (Silsbee), 
i.e., a phase transition back into the normal conducting state can be induced at 
any temperature in the superconducting phase (Fig. 14.52a). The temperature 
dependence of these critical parameters follows a parabolic law. 

To find a suitable explanation for the occurrence of superconductivity in metals, 
alloys, and compounds proved to be one of the most demanding challenges for 
the community of solid-state theorists and took nearly 50 years in the end. The 
fundamental idea put forward by Bardeen, Cooper, and Schrieffer (“BCS theory,” 
in 1957) is based on a mechanism that can be summarized as follows. At suffi- 
ciently low temperatures, a quantum mechanical exchange interaction mediated 
by the lattice vibrations (phonons) becomes strong enough to establish an attrac- 





H 


Fig. 14.52 Magnetization curves of superconductors: (a) type | 
superconductor; (b) type Il superconductor; (c) type II superconductor 
with flux pinning. 
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tive interaction between two valence electrons (with opposite wave vector and 
spin) which overcomes the (partly strongly reduced) Coulomb repulsion between 
these charge carriers in the lattice and leads to a definite correlation between 
them (Cooper pair formation) over a certain distance (BCS coherence length éo, 
10-100 nm), which can be understood as the spatial extension of this new parti- 
cle (Cooper pair). In view of this “huge” size on the scale of the crystal lattice 
(~0.1 nm), these new charge carriers (responsible for the current transport in 
the superconducting state) are strongly correlated and strongly overlapping. 
Moreover, due to the spin selection condition, the Cooper pair has a total spin of 
0, and therefore escapes the limitations set up by the Fermi-Dirac statistics and 
the Pauli exclusion principle, thus rendering the usual solutions for the density of 
states at the Fermi level unstable and opening an energy gap A(T) at the Fermi 
energy Er. Moreover, all Cooper pairs are allowed to attain the same quantum 
mechanical state and can be described by a single wavefunction with an ampli- 
tude that corresponds to the density of Cooper pairs in the metal, and with a 
well-defined phase, thus making superconductivity a “macroscopic quantum phe- 
nomenon” or a “phase coherent state” with wide consequences. Firstly, being in 
the same quantum mechanical state, individual Cooper pairs cannot interact with 
the crystal lattice or any of its defects, i.e., scattering processes by impurities are 
impeded and the resistance drops to zero. Secondly, this mechanism prevails only 
so long as any energy transferred to the superconductor does not exceed the 
“binding energy” of the Cooper pairs, i.e., the strength of the attractive interac- 
tion provided by the electron-phonon mechanism (“condensation energy”, 
4oH?/2 per unit volume). In other words, if this energy is exceeded, e.g., by 
transport currents or external fields, the Cooper pairs break up and the normal 
conducting state is re-established (even at temperatures well below T,), thus ex- 
plaining the existence of critical parameters (I.,J., He) introduced above (J, 
is actually called the “depairing” critical current density J. ap). Thirdly, the 
electron—phonon coupling is directly reflected by the magnitude of the energy 
gap A(T) at the Fermi energy Er (a few milli-electronvolts at typical Fermi ener- 
gies of a few electronvolts) and its influence on the density of states in the super- 
conducting phase N,(E). A(T) is of course correlated with the condensation en- 
ergy (e.g., at T = 0, N(Er)A(T)”/2, where N(Er) denotes the density of states at 
the Fermi energy) and is assessed best by tunnel experiments between two thin 
superconducting films separated by an insulating layer (Giaever et al. in 1961). 
An outstanding result related to the theoretical analysis of such tunnel con- 
tacts should be mentioned here, i.e., the prediction of tunneling Cooper pairs by 
Brian Josephson in 1962 (the “Josephson effect”), i.e., the flow of phase-coherent 
supercurrents across such barriers. This paved the way for the development 
of one of the most important fields for applications of superconductivity 
(SQUIDs, superconducting quantum interference devices), which are based on a 
phase modulation of the macroscopic phase-coherent wavefunction by the vector 
potential. 

All of these fundamental predictions (and later extensions of the basic ideas) 
were subsequently confirmed by experiment and therefore provide us with a solid 
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knowledge of the microscopic mechanisms leading to superconductivity. How- 
ever, gaining a detailed understanding of the behavior of superconductors in 
magnetic fields was a slow process. It was soon shown that the simple picture of 
the Meissner effect dominating the magnetic phase diagram could not be proven 
by experiment, because magnetization curves demonstrated that magnetic flux 
had to be able to penetrate the superconductor in some way under some cir- 
cumstances. Using Landau’s theory of second-order phase transitions as a basis, 
Ginzburg and Landau (in 1950) introduced a phenomenological theory for local 
variations of the “order parameter” y(r) (whose square turned out to be the 
“local” density of Cooper pairs) and the “local field” within the superconductor 
(GL equations), which led to a classification of their magnetic properties accord- 
ing to the relative size of these characteristic variation lengths, i.e., the spatial 
variation distance of the Cooper pair density, €¢;, the GL coherence length, and 
the variation distance of the local field, 2, the magnetic field penetration depth. 
Their ratio, x = A/£cı, the GL parameter, proved decisive for the energy provided 
by phase boundaries between the normal and superconducting phase in the ma- 
terial, which is positive for x < 1/2 and negative for x > 1/2. Accordingly, 
type-I superconductors, i.e., those with x < 1//2, will generally not allow such 
phase boundaries to be formed (because they would enhance the total free energy 
of the system) and complete flux expulsion determines their magnetic phase dia- 
gram. On the other hand, type II superconductors, i.e., those with x > 1/ V2, may 
take advantage of the negative phase boundary energy to reduce their total free 
energy. This situation was investigated in detail by the theorist A. A. Abrikosov 
(in 1957), who found “spectacular” solutions for the current and field distribu- 
tions in these materials, as follows. At a certain critical field (Fig. 14.52b), the lower 
critical field H.ı, the formation of field-containing areas becomes energetically 
favorable for the first time, i.e., the loss of condensation energy by the formation 
of these field-containing areas is compensated by the energy gain provided by the 
negative phase boundary energy. The field penetrates in the form of so-called 
“flux lines,” i.e., normal conducting spots, surrounded by circulating supercur- 
rents (“vortices”), which build up the local field and form a regular hexagonal 
lattice throughout the superconductor. Due to its special spatial configuration, 
each of the flux lines carries exactly one magnetic flux quantum, ¢9 = h/2e = 
2.067 x 10-4 Wb. With increasing field, the lattice spacing of the flux line lattice 
becomes smaller, the normal conducting cores finally overlap at a certain critical 
field, the upper critical field H.,, where a second-order phase transition into 
the normal conducting state occurs. This field, Hz = H.xV2 = do/2ruge?cr; 
can obviously become very much larger than the thermodynamic critical field 
H., depending on the magnitude of x or č, thus extending the stability range of 
superconductivity in this so-called “mixed state” to very high magnetic fields. The 
final step was achieved by Gor’kov (in 1958 and 1959), who showed that the order 
parameter y(r) of the phenomenological GL theory was proportional to the gap 
parameter A(r) of BCS theory, thus linking GL theory to the microscopic theory 
of superconductivity, the proportionality being governed by the so-called Gor’kov 
function x(a), where « = 0.882¢)// is the “impurity” parameter, &, denotes the 
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BCS coherence length at T = 0 in the “clean limit,” and / the mean free electron 
path. Since &¢,; is proportional to /, we immediately find that small GL parame- 
ters x (i.e., type I superconductivity) will always occur when / — œ (i.e., mostly 
in pure metals), whereas type II superconductivity prevails for / — 0, i.e., in al- 
loys and compounds. 

It took several years for the community to realize the real impact of this theory 
complex alternative: set of theories, for two reasons. The most important was that 
Abrikosovs paper was published in the same year as the BCS theory, which 
attracted worldwide attention. Furthermore, magnetization curves such as those 
schematically depicted in Fig. 14.52b had not been confirmed by experiment but 
showed a rather broad and hysteretic behavior (Fig. 14.52c) which, at the time, 
was analyzed (again) in terms of phenomenological models, e.g., the Bean model, 
where a “material parameter,” the critical current density, J., was introduced to 
explain the hysteretic magnetization and the loss-free current transport in mag- 
netic fields. Indeed, according to Abrikosovs picture, loss-free current transport 
cannot exist in the mixed state, since the Lorentz forces set up by the field and 
the circulating supercurrents would immediately start to drive the flux lines 
(with their “normal conducting’ cores) through the superconductor and thus cre- 
ate an electric field E, which would lead to energy dissipation W = |E| |J|, unless 
this motion could be prevented in some way. This subject represents the most im- 
portant issue for applications of superconductors in magnetic fields and is com- 
monly referred to as the “flux pinning problem.” It can be considered on a global 
basis, i.e., by taking the volume-averaged action of such “pinning forces” over the 
entire volume of the superconductor into account (the critical state equation: 
Fı = J.xB = —Fp), or by modeling the individual pinning forces, fp, between a 
single flux line and a certain defect in the superconductor matrix and then trying 
to sum them appropriately. The principle of flux pinning is easily explained. 
Since the normal conducting flux line core represents a volume where conden- 
sation energy is lost compared to the undisturbed superconducting state, this en- 
ergy loss can be avoided by placing the flux line core, e.g., onto a normal conduct- 
ing precipitate of appropriate size (i.e., with a diameter of ~2é¢;), which would 
make it energetically favorable for the flux line to remain at this particular posi- 
tion, thus establishing the pinning force f, of this defect. In practice, the optimi- 
zation of defect structures suitable for flux pinning turned out to be one of the 
most difficult tasks and it is still ongoing today, even in the “best known techni- 
cal” superconductors. 

To sum up this section, we have learned that tailoring superconductors appro- 
priately is most important for applications. Firstly, we need to adjust the electron 
mean free path / in such a way that type II superconductivity and, hence, high 
upper critical fields are achieved. Secondly, we need to establish an optimized net- 
work of metallurgical defects in the superconductor, each on the scale of the co- 
herence length, i.e., of a few to a few ten nanometers, in order to take advantage 
of the enhanced field range for loss-free current transport. The limit for the latter 
task will always be set by the intrinsic superconducting properties of the material, 
i.e., the condition Je < Je, dep- 
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14.6.2 
Superconducting Materials 


Beginning from Hg with its transition temperature of 4.15 K, the highest known 
T, values developed slowly but steadily (Fig. 14.53), initially among the metallic 
elements, then among alloys and compounds. They reached quite a stable plateau 
in the class of so-called A15 superconductors slightly above 20 K, the boiling tem- 
perature of liquid hydrogen, in the early 1970s. It came as a big and unexpected 
surprise, therefore, when in 2001 Akimitsu’s group reported on superconductivity 
in the “very simple” compound MgB, with a T, of nearly 40 K (“How could we 
miss it?””). In the meantime, several classes of material with highly interesting 
and exciting properties in the superconducting state had been discovered, such 
as the “heavy fermion” superconductors (e.g., CeCu2Si2, UBe13, UPt;), the ruth- 
enate superconductors (Sr, RuO4), the “magnetic superconductors” (e.g., the qua- 
ternary boron carbide compounds of the type RNi2B>C, where R stands for a rare 
earth element), the organic superconductors, or even the fullerenes, i.e., (alkali 
metal)-doped Cgo molecules, but none of them were able to participate in the 
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Fig. 14.53 Development of the highest known transition temperatures 
since the discovery of superconductivity in 1911. 
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“race” for the highest known T,, maybe with the exception of charge-doped pure 
Ceo, which disappeared again in the context of the fraud scandal around G. 
Schön, however. 

The most outstanding event during the nearly 100 years of superconductor re- 
search was certainly the discovery of “high-temperature superconductivity” 
(HTS) by Bednorz and Miiller in 1986. In the course of research on ceramic ox- 
ides (some of which had been known to become superconducting at tempera- 
tures around 10 K), they started to investigate cuprate-based compounds and 
found evidence for superconductivity above 30 K in (La,Ba),CuOx,, a discovery 
that led not only to a “gold rush’ atmosphere among scientists, but also attracted 
the attention of politicians and the public in general. The highest known T, 
values rocketed to temperatures well above the boiling temperature of liquid ni- 
trogen (77 K), thus revolutionizing the issue of cryogenics for applications. 
Among all the cuprates, YBazCu307_5 (YBCO) and the Bi-based compounds, 
with transition temperatures of ~93 and up to ~110 K, respectively, play the 
dominant role. 

Today, nearly 20 years after its discovery, HTS is still far from being fully under- 
stood. Although we know that Cooper pairs are again formed (in most cases, 
however, with a different pairing symmetry), the pairing mechanism itself is still 
under intensive discussion and major modifications to the original BCS concepts 
developed for metals will certainly be required. Concerning their physical proper- 
ties, the cuprates offer an enormous variety of challenges for physicists, crystal- 
lographers, metallurgists, and engineers. Due to their unit cell structure with rel- 
atively small a,b plane-, but “very large” c axis-dimensions (typically 0.4 versus 3.5 
nm), and due to the periodic formation of CuO; layers along the c axis, extraordi- 
nary properties occur in the normal as well as in the superconducting state, the 
most prominent being the high anisotropy between the basal planes and the c 
axis. Since the Cooper pairs are mainly formed within the CuO; planes, which — 
in some extreme cases — may be separated by several layers of “nonsuperconduct- 
ing” material, superconductivity itself becomes two-dimensional (2D), rather than 
three-dimensional (3D) as is the case in conventional superconductors. As a con- 
sequence, the standard flux line lattice structure in the mixed state of these ex- 
treme type II superconductors becomes “interrupted” if the field is applied along 
the c direction, and breaks up into a loosely connected array of flux line sections 
(“pancakes”) which are extremely difficult to pin by metallurgical defects, espe- 
cially at the high temperatures where the superconductor is supposed to operate. 
This is because of the constant supply of thermal energy kgT, which is about 20 
times higher at 77 K than at 4.2 K, and counteracts the pinning energy. Moreover, 
the coherence lengths are extremely small (just slightly larger than the unit cell 
dimensions) and highly anisotropic (with factors of 7 up to a few 100 between the 
basal plane and the c axis), thus rendering all other mixed state parameters (in- 
cluding the upper critical fields) anisotropic in the same way. As a consequence, 
sintered ceramics (i.e., the original material form of the cuprates) are completely 
unsuitable for carrying loss-free supercurrents of reasonable magnitude, since the 
current would have to cross a huge number of grain boundaries between grains 
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of different orientations and thus of hugely different superconducting properties. 
Complex texturing techniques are therefore required as a way out, 2D texturing 
(i.e., the alignment of the grains’ c axis) being sufficient for the Bi-based mate- 
rials, whereas 3D texturing (i.e., the additional alignment of the a,b planes) is 
needed for YBCO. Returning to the flux pinning issue, the small coherence 
length makes nanoengineering of the defect structures mandatory, since very 
small defects (“point defects”) can only act collectively and usually lead to very 
small pinning forces (at least at high temperatures), whereas large defects (i.e., 
those that often occur during processing) are far too large for efficient pinning. 
Excellent examples of such efficient defects include fast-neutron-induced collision 
cascades (with a diameter of ~6 nm, almost exactly 2£ in YBCO at 77 K) or the 
recently reported addition of nanoparticles of a second phase to YBCO bulk mate- 
rials before processing. 

In summary, a few thousand metals, alloys, compounds, ceramics, or organic 
substances are known today to become superconducting at temperatures between 
a few microkelvins and ~135 K. Among this enormous variety — not only with 
regard to their material form, but also concerning their spectrum of physical 
properties in the normal and superconducting state — fewer than ten are suitable 
for applications (“technical superconductors”) either in the high-current sector or 
for thin-film applications, an alarmingly small number that meet the four basic 
application requirements: high T,, high H., high Je, and mechanical stability 
against thermal cycling as well as against deformation. 


14.6.3 
Technical Superconductors 


Imagine you have identified a material that fulfills all the four requirements just 
mentioned at the end Section 14.6.2, and further, that you have managed to pro- 
duce it in the form of a wire with a length of few hundred meters and that you 
have successfully wound a superconducting magnet coil. From the critical mate- 
rial parameters and your design of the coil, you would certainly expect to be able 
to operate the magnet safely, e.g., at 4.2 K and at a load current of e.g., 120 A, to 
produce a magnetic field of, e.g., 8 T. However, this was not the case initially, i.e., 
the magnet “quenched” (turned normal conducting) at currents that were lower 
than the “short sample” critical currents by a factor of 5-10. This effect, called 
“degradation,” represented the last obstacle to high-current applications and was 
only solved by a careful analysis of the operating conditions within a typical 
“densely packed” wire configuration in a coil. It turns out that heat may be pro- 
duced locally (either by less efficient exposure to the cooling agent or by mechan- 
ical energy transferred by vibrations or Lorentz forces), thus locally increasing the 
temperature of this coil segment, which in turn results in a decrease of the pin- 
ning forces and the initiation of flux movement. In this way a “vicious circle” is 
initiated, since flux movement again increases the local temperature, further re- 
ducing pinning there, and so on, until the normal state is reached at that spot. 
Due to the sudden appearance of the full normal-state resistance under full cur- 
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rent load, an “enormous” amount of heat is generated, which quickly spreads 
through the rest of the coil and turns it to entirely normal conducting. This can 
be avoided by “stabilizing” the superconducting material by surrounding it with a 
good conductor (Cu), in order to provide an alternative path for the current flow 
(with low dissipation) and thus to enable the superconductor to return to its nor- 
mal operating conditions. This basic idea has been refined over the past decades 
and has led to the concept of “multifilamentary wires,” where hundreds of tiny 
superconducting wires with diameters in the micrometer range are embedded in 
a Cu matrix (Fig. 14.54a) and arranged according to a sophisticated scheme that 
allows transposure and twisting of the entire wire ensemble with a twist pitch of a 
few centimeters, in order to keep the wires decoupled under moderate field 
sweep rates. This concept has led to advanced high-tech processes, by which 
wires are commercially produced in kilometer lengths and assembled into 
strands, cables, and finally suitable conductors, depending on the desired applica- 
tion (Fig. 14.54b). 

At present, the materials for high-current applications are dominated by two 
metallic low-temperature superconductors, the alloy Nb-Ti (with T, values be- 
tween 8.5 and ~10 K, depending on composition) and the compound Nb3Sn 
(T; © 18 K). 

The “workhorse” for superconducting magnets is certainly Nb-Ti, which is 
produced in quantities of roughly 1000 tons per year, mainly for the magnets of 
magnetic resonance imaging (MRI) devices. The success of Nb-Ti is due to its 
combination of excellent strength and ductility with high current-carrying capac- 


(a) (b) 





Fig. 14.54 (a) Multifilamentary superconductor: Nb3Sn strand made by 
the internal tin process (19 x 219 filaments, single Ta barrier, Cu/non- 
Cu = 1:1, wire diameter: 0.81 mm, J, > 10? A m”? at 12 T and 4.2 K); 
(b) Nb3Sn conductors manufactured for the toroidal field model coil 
(TFMC) of ITER for operation at a current of 80 kA and a field of 9.7 T 
(4.5 K). 720 Nb3Sn strands are arranged in a stainless steel jacket 
(cable diameter: 37.5 mm). 
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ity at magnetic fields sufficient for most applications (i.e., ~8 Tat 4.2 K and ~10 
T at 1.9 K). Extensive work on this alloy system has demonstrated that both the 
transition temperature and the upper critical field are optimal near ~40 at.% Ti, 
a composition in which — according to the phase diagram - a significant fraction 
of the normal conducting -Ti phase is formed as a precipitate under suitable 
processing conditions and acts as pinning centers together with the dislocation 
cell structure and other crystalline defects. After nearly 40 years of development, 
significant progress has only recently been achieved by optimizing all processing 
steps toward the defect structure on a nanometer scale at the final wire diameter. 
Critical current densities of (4-4.6) x 10° A m”? (4.2 K, 5 T) have been reported; 
room for further improvements can still be expected. 

Unfortunately, the intrinsic material parameters limit the range of applicability 
to magnetic fields of ~8 T at 4.2 K. Higher fields can only be achieved by turning 
to the intermetallic compound Nb3Sn, which offers much less handling comfort 
for the magnet assembly due to its brittleness (small strain tolerance of about 
0.2-0.4%). Consequently, a “wind and react” concept has to be employed for the 
magnet assembly in most cases, i.e., the entire wire and cable production and the 
winding of the magnet (or magnet sections) have to be done with a “precursor” 
material. The “final product” is then subjected to a heat treatment procedure in- 
volving fairly elevated temperatures (600-700 °C), which are needed to form the 
superconducting A15 phase. Unfortunately, these temperatures are far too high 
for one essential component of the magnet, i.e., the conductor insulation, which 
usually consists of glass-fiber-reinforced plastics (usually epoxies). Therefore, the 
windings have to be carefully separated (without bending them too much) to 
provide room for the application of the glass fiber tapes (Fig. 14.55). After reas- 
sembly, the entire winding pack is then subjected to a “vacuum impregnation’ 
process in which the epoxy is introduced and allowed to form the plastic com- 
pound at a certain curing temperature (~120 °C), thus providing the winding 
pack with the required electrical insulation as well as with sufficient strength 
against shear and compression loads exerted by the Lorentz forces. 

In order to actually produce Nb3Sn multifilamentary wires, a variety of tech- 
niques have been devised, among them the “bronze process” and the “internal 
tin process”, which will be outlined roughly here. In the bronze process, Nb 
rods are placed in a bronze (Cu-Sn alloy) cylinder and arranged there in the de- 
sired configuration. After extrusion and repeated drawing and annealing steps, 
the Cu stabilizer and diffusion barrier materials are added, and a second billet 
configuration is assembled, again subjected to similar mechanical and thermal 
cycles, and finally twisted. The decisive heat treatment step will then allow Sn to 
diffuse from the bronze into the Nb rods and to react there to the desired A15 
phase. The internal tin process relies on a different way of supplying Sn to the 
superconductor. In this case, Nb rods are again arranged in the desired form, 
but now in a hollow Cu cylinder, forming the initial billet which is extruded. In- 
sertion of a Sn rod in the central hole is followed by drawing and annealing 
cycles, in much the same way as described above. In all cases, great care must 
be taken to achieve the best possible degree of reaction, i.e., to ensure that most 
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Fig. 14.55 Production step of the TFMC: application of the winding 
insulation (white glass-fiber tape, arrow) after the reaction heat 
treatment. 


of the Nb volume is indeed transformed into Nb3Sn by sufficient Sn uptake. Fur- 
ther improvements of the performance have been achieved by adding small 
amounts of Ti or Ta to the Nb rods, which in the end act as additional scattering 
centers for the charge carriers in the A15 phase, enhancing H., and thus the crit- 
ical current densities at 4.2 K. Depending on design details and application 
requirements (stabilization, ac losses), J, values of (1-4) x 10° A m? (4.2 K, 
12 T) over lengths of several kilometers, and of ~10® A m”? even at 19 T (and 
4.2 K), have been reported. 

Turning now to the “emerging” conductors, comparatively rapid progress could 
be achieved in the fabrication of long conductors based on the cuprate HTS 
Biz Sr2Ca2Cu;010+5 (BiSCCO-2223, T, ~ 110 K). As mentioned in Section 14.6.2, 
texturing is required to align the c axis orientation of the grains. The route to suc- 
cess is based on the “powder-in-tube” (PIT) technique employing silver as the 
tube material. The principle is as follows. BISCCO powder or rods are inserted 
in a silver tube (or several such tubes are arranged in a suitable form in a silver 
block for the production of multifilamentary conductors) and subjected to similar 
cold working (pressing, drawing, rolling) steps to those mentioned above for the 
metallic conductors, until a tape with typical dimensions of 3 mm x 0.3 mm is 
obtained. The tape is wound onto a drum and then subjected to a heat treatment 
cycle to form the superconducting phase. The temperature window for this pro- 
cess is extremely narrow (around 835 °C) and also depends on the sheath mate- 
rial (i.e., whether or not alloyed silver is used to increase the mechanical strength 
of the conductor). Great care has to be taken to optimize the density of the super- 
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conductor within the filaments as well as to achieve the highest possible volume 
fraction of the 2223 phase. At present, commercial processes are in place for the 
production of tapes in kilometer lengths offering critical currents in the range of 
70 kA (at 77 K in self-field). The problem with this material lies in its very low 
current-carrying capability in external magnetic fields at this temperature (be- 
cause of weak flux pinning), i.e., Je drops to zero at around 400 mT at 77 K. This 
imposes severe limitations on its application range, e.g., to power cables or cur- 
rent leads, where the magnetic field at the conductor remains small. Of course, 
these restrictions no longer apply if lower temperatures are considered, because 
flux pinning drastically increases at temperatures below ~40 K. Interestingly 
enough, recent development rather favors the sister compound Bi-2212 (T, ~ 85 
K) for applications at temperatures between 4.2 and 20 K, e.g., as insert coils for 
enhancing the ultimate magnetic field produced by such a “hybrid” magnet. 
Returning to applications at temperatures around 77 K, higher magnetic fields 
can be achieved only by employing YBCO. In this case, however, the intrinsic 
physical properties of this material require “full” texturing, i.e., the additional 
alignment of the basal planes, a task that can be solved only by highly sophisti- 
cated techniques (“coated conductors,” Fig. 14.56) as follows. The RABiTS tech- 
nique (Rolling-Assisted Biaxially Textured Substrate) is based on texture forma- 
tion by cold-working Ni or Ni alloy tapes leading (after a recrystallization heat 
treatment) to a biaxial grain structure with typical grain misorientation angles 
of the order of 10°. As the next step, a buffer layer (e.g., yttrium oxide, yttrium- 
stabilized zirconia (YSZ), ceria, or a combination of these) is deposited, which 
grows epitaxially on top of the substrate and serves as a protection layer against 
interdiffusion between the substrate metal and the superconductor (thickness: 
several 100 nm). The YBCO film is deposited next (by any suitable deposition 
technique), the alignment (being transferred to the YBCO grains via the buffer 
layer) is of the order of 5-6°, the thickness of the superconductor around 1.5 
um. The second technique, IBAD (Jon Beam Assisted Deposition), starts from 
an untextured metallic substrate (either of the same materials as above or stain- 
less steel) and induces an in-plane texture of the buffer layer (initially YSZ) dur- 
ing its deposition by an assisting ion beam. The degree of texture is again of the 
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Fig. 14.56 Layout of a coated conductor. 
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Fig. 14.57 Summary of J.-B data for technical superconductors at 4.2 K 
(unless otherwise stated). 


order of 10° or less, and the deposition of the superconductor proceeds as above. 
In this way, coated conductors with critical current densities exceeding 101° 
A m? at 77 K and self-field have been achieved and maintained at a significant 
level in fields up to several tesla due to the much more favorable flux pinning 
conditions in YBCO at high temperatures. The present challenge consists mainly 
in developing these techniques for long length production and for devising suit- 
able conductor architectures for high current transport. 

The “new” superconductor MgB has a certain application potential. It is com- 
peting with both the metallic and the ceramic superconductors, mainly because 
of its very high upper critical fields and the very low cost of the materials, but 
only at temperatures below ~28 K. It has already been developed in the form of 
multifilamentary wires (again based on the PIT technique) and can be produced 
in lengths of several hundred meters, but a significant improvement in its 
current-carrying capability is clearly necessary. A summary of the present situa- 
tion, illustrated by a J.-B diagram, is depicted in Fig. 14.57. 

To conclude this section, a few remarks on materials for small-scale applica- 
tions seem to be appropriate. The “workhorse” here is the elemental supercon- 
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ductor Nb (T, = 9.2 K), which is used practically exclusively for all SQUID mag- 
netometers and electronic devices, especially because of its robustness against 
thermal cycling and its moisture tolerance. NbN (T, = 16 K) is certainly a candi- 
date material for SQUID applications (because of the reduced cooling require- 
ments compared to Nb, especially for mobile magnetometers), but also for 
integrated circuit technology. The same holds for YBCO thin films on single- 
crystalline substrates or bicrystals, as long as the advantage of the high operating 
temperature (extremely favorable cooling requirements) is not counterbalanced 
by the unavoidable enhancement of the noise level at these temperatures. 


14.6.4 
Applications 


In 1915, Onnes predicted that applications of superconductivity were “not far 
away,” an error that can obviously be made even by one of the greatest and most 
ingenious scientists. The short review of the development of superconductivity 
presented in the previous sections should, however, enable the reader to appreci- 
ate the numerous problems and obstacles provided by the intrinsic physics of the 
superconducting state as well as by technological issues that had to be solved in 
order to pave the way to success. Indeed, the development of superconducting 
magnets revolutionized the measuring techniques in solid-state, nuclear, low- 
temperature physics and chemistry laboratories worldwide, as did the successful 
development of SQUID magnetometers based on the Josephson effect. Huge sci- 
entific projects in high-energy physics, such as the accelerators at the Fermi Na- 
tional Laboratory (Batavia, IL, USA: Tevatron) or at CERN (Geneva, Switzerland: 
Large Hadron Collider), could not have been built without superconducting mag- 
nets. The same holds for the largest international project to be undertaken in the 
decade beginning from 2006, i.e., the construction of the nuclear fusion device 
ITER (at Cadarache, France) which is designed to demonstrate the feasibility of 
nuclear fusion for electric power generation. 

Whereas all of these applications are somehow related to a scientific/ 
technological environment, the largest and commercially most successful applica- 
tion of superconductivity has occurred in medicine, where superconducting 
magnets of unique uniformity and homogeneity form the basis for magnetic res- 
onance imaging, a novel diagnostic tool with unprecedented resolution, especially 
in their latest versions employing 7 T magnets. In the same discipline, Josephson 
junctions arranged in the form of multiarray sensors have opened a new field of 
research and diagnostics, “biomagnetism,” i.e., a new way of detecting (without 
any contacts) certain functions of our body and thus complementing the tradi- 
tional electrocardiogram or encephalogram techniques. 

Most of these applications do not really suffer severely from the requirement 
of cooling the superconductor with liquid helium. However, a much broader 
range of applications could easily be imagined, if liquid nitrogen cooling or the 
much more readily available cooling capacity of cryocoolers at temperatures 
in that range could be employed. Indeed, first power cables based on the high- 
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temperature superconductor Bi-2223 are already being installed, YBCO thin-film 
filters for cellphone base stations have undergone successful field tests, fault cur- 
rent limiters for the protection of the power grid, motors, and transformers are in 
their final development stages, etc., etc. At the same time, the programmes for 
optimization of materials mentioned in Section 14.6.3 progress rapidly, thus mak- 
ing the prediction of a real breakthrough into the market in the near future much 


less risky than a few years ago. 
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correlation functions 528, 533, 543, 662 
correlations 363 
— short-range 363 
corrosion 449, 874 
Cottrell-Stokes law 293, 325 
coverings 54 
CPA (coherent potential approximation) 
893, 894 
creation (of phonons) 
critical cluster 388 
critical cooling rate 930 
critical current 940 
critical current density 943 
critical gradient concept 843 
critical pressure 891 
critical radius 385, 386 
critical temperature 361, 364, 373, 380 
cross-slip 305, 332 
crystal rotation 801 
crystal structure 19 ff 
crystal-to-amorphous 452 
crystalline anisotropy 863 
cubic lattice (crystal, structure) 
135 ff, 146, 159 ff 
Curie temperature 
898 
Curie/Neel temperature 887 
Cu-Al 407 
Cu-Al-Ni 410 
Cu-Co 384 
Cu-Mo 444 
Cu-W 449 
Cu-Zn 407 
Cu-Zn system 643 
— short-range order 644 
— Warren-Cowley parameters 644 
Cu-Zn-Al 410 


140, 157 ff 


119, 127 £, 


866, 870, 871, 874, 891, 


CVM 572, 580 

cyclic response 454 

cyclotron magnetoresistance 915 ff 
CVM-PFM hybrid model 573, 577 


d 
p-sorbitol 483 
D0; alloys Fe3Si 745 
D019 (Ni; Sn) structure 207 
D0; (Fe3Al) structure 207 
DAC (diamond anvil cell) 731 
Darken equation 260, 261 
Debye approximation 612 
Debye frequency 145, 153, 569 
Debye temperature 145 ff, 153, 569 
Debye wave vector 153 
Debye’s model 144 ff, 149 
Debye-Gruneisen model 569 
Debye-Scherrer rings 519 
decagonal quasicrystals 53 
decomposition of alloys 837 
defect annihilation 377 
defect concentrations 209, 210 
— B2 ordered alloys 210 
— Llo ordered alloys 209 
defects in metals and alloys 63, 173 
deformation 124 ff, 129 ff, 154, 167 
degradation mechanism 931 
delay line detector 826 
demagnetizing curve 874, 875 
demagnetizing field 902 
dendrite kinetics 78-82 
dendrite periodicity 83-85 
dendrites 74 
— intensive treatment 74-86 
density (ensemble) 668 
density functional theory 590 ff, 591 
- effective potential 592 
— exchange-correlation 591 
- generalized gradient approximation 
(GGA) 593 
— ground-state electron density 592 
— Kohn-Sham orbitals 592 
— local density approximation (LDA) 
593 
— total energy of the ground state 
591 
density of states 27, 28, 29, 36 
detailed balance 235, 425, 450, 457, 671, 
737 
detrending fluctuation analysis 728 
diagonalize free energy matrix 549 
diamond 446 
diamond anvil cell 731 
diatomic linear chain 136 
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differential anomalous scattering 758 
differential dilatometry 179 
differential thermal analysis 495 
diffracted beam 795 
diffuse scattering 549, 643, 683, 756 
diffusion 198, 217, 259, 263, 265, 266, 268 
270, 371, 616 ff, 673, 847 
— diffusion barrier 617 
- diffusion mechanisms 617 
- diffusion paths 617 
—dislocation-core 266 
— grain-boundary 268 
- intermetallic compounds 198 
— nanocrystalline materials 847 
—nonreciprocal 259 
—pipe 266 
— processes at an atomistic level 617 
— short-circuit 265 
— suppressed 371 
- surface 270 
- uphill 263 
diffusion coefficient 227, 236, 237, 259, 261, 
495, 619, 621, 638 
—chemical 259 
—interdiffusion 259, 261 
- intrinsic 259 
- random walk 227 
— thermodynamic factor 236 
-tracer 237 
diffusion couple 257 
diffusion equation 256, 264, 402, 554 
diffusion of solute atoms 291 
diffusion potentials 433, 435, 436 
diffusion theory of thermal activation 221 
diffusion, ab-initio methods 617 ff 
— activated complex Eyring 618 
— attempt frequency 620 
—chain-of-states method 618 
— diffusion coefficient 619 
- diffusion path 618 
- minimum energy paths 617 
— molecular statics 617 
— nudged elastic band method 618 
— phonon dispersions of the transition 
state 620 
— slowest ascent 617 
— the transition state 618 
- transition state theory 617 
— Wert and Zener 618 
diffusion-controlled glide 291 
diffusionless transformation 406 
diffusivity 550 
disclination 799 
- dipole 800 
— partial 799 
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Index 


dislocation 95, 125, 166 ff, 175, 266, 281, 
449, 799 

- climb 309 

- diffusion pathway 266 

— dipole 799 

— exhaustion 320 

— line tension 286 

- multiplication 317 

— patterning 447 

— sources 318 

- source/sink of point defects 175 
dislocation-dislocation interaction 321 
dislocation-solute interaction 285 
disorder-to-order transition 694 
disordered phase 533, 548 
disordering 444 
displacement cascades 441, 468, 479 

— cascade sizes 479 

— disordered zones 479 
displacive transformation 412 


dissipative systems 443 

dissolution of ordered precipitates 444 
divacancy 174 

divergence theorem 553 

DLM (disordered local moment) 893 
domain wall 899 

Doppler shift 739 


double well potential 558 

DQCs (decagonal quasicrystals) 53 

driven alloys 438 

droplet algorithm 723 

Drude formula 912 

3DTAP 437, see also three-dimensional 
tomographic atom probe 

Dulong and Petit (Law of) 

dynamic light-scattering (DLS) 

dynamic recovery 329 

dynamic strain aging 291, 331 

dynamic structure factor 663 

dynamical equilibrium phase diagram 450 

— control parameters 450 

dynamical equilibrium phase diagram 471, 
479 

dynamical matrix 134 ff, 138, 163 

dynamical phase transitions 453 

dynamical sources of entropy 363 

dynamical structure factor 158 

dynamical theories of thermal activation 221 


143 ff, 153 
724 


e 

EAM (embedded atom method) 

effective anisotropy 872 

effective cluster interaction energy 534, 564, 
565, 570 


657 


effective cluster interactions 636 
effective diffusion coefficient 480 
effective free energy 447, 465 
effective Hamiltonian 457 
effective interactions 477, 483 
effective pair interaction (EPI) energies 
effective potentials 121, 165 
effective stress 283 
effective surface energy 477 
effective temperature 462, 470, 476, 483 
EFTEM (energy filtered TEM) 792 
eigenvalue, eigenvector, eigenstate 
141, 149, 163, 549 
elastic constant (modulus) 
146, 161, 166 ff 
elastic modulus 521 
elastic properties 598 ff 
— elastic constants from ab-initio 
methods 598 
— elastic constants of Ti, Al, TiAl, NiAl 
600 
— elastic contants of Quartz and Schorl 
601 
- elastic energy density 598 
— elastic moduli 600 
— Hooke’s law 598 
— number of elastic constants for crystal 
systems and point groups 599 
— polycrystalline material 600 
— strain tensor 598 
— stress tensor 598 
— Voigt notation 598 
astic strain 393 
astic strain energy 701 
astic waves 146 
asticity 124 ff, 130, 161, 166 ff 
ectrical resistometry 182, 238 
— kinetics of ordering 238 
— point defects 182 
electron 20, 21, 24, 27, 28, 29, 33, 34, 362 
— concentrations 20, 21, 24, 27, 28, 29 
— counts 27, 33, 34 
— exchange and correlation 362 
electron energy-loss spectroscopy (EELS) 
791 
ectron localizability indicator 34 
ectron localization function 34 
ectron tomography 856 
ectron-to-atom ratio 367 
ectronegativity 20, 21, 23, 24, 27, 32, 367 
ectronic memory 923 
ectronic structure calculation 418 
ectronic total energy 569 
ectron—phonon interactions 


204 


135, 137, 


125 ff, 128 ff, 


ooo, 0, 0 
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ELF (electron localization function) 34, 36 
ELI (electron localization indicator) 34 
Elinvar 887 

elongation, elastic 127 

embedded atom method (EAM) 657 


embedding function 657 
embryo, subscritical fluctuation 386 
energy 349, 362 
- configurational 349 
- exchange and correlation 362 
energy density 873, 874 
energy-dispersive X-ray spectroscopy (EDXS) 
791 
energy-filtered TEM 792 
enthalpy of formation 606, 607 
- calorimetry 607 
— DFT enthalpy Miedema’s model 
— magnetic compounds 609 
entropy 349, 350, 363, 366, 536, 672 
-andCVM 536 
- atom vibrations 
- configurational 
—dynamical 363 
— electronic excitations 363 
- logarithmic singularity 350 
— magnetic spins 363 


608 


363 
349 


—phonon 366 
environment 37 
-atomic 37 





epitaxial softening function 626 

equation of motion 130, 133 ff, 137, 146 

equiaxed dendrite 77, 81-82 

equiaxed structure 101-103 

equilibrium constituent strain energy 628, 
633 

equilibrium state of order 380 

Euler-Lagrange equation 402 

eutectic spacing 93 

eutectic structures 90-93 

eutectic temperature suppression 375, 496 

eutectoid phase diagram 358 

evaporation rates 430 

excess free energy 583 

exchange bias, see exchange field 919 

exchange field 918 

exhaustion hardening 322 

exhaustion of dislocations 317 

experimental investigation of phonons 156 ff 

explicit Euler technique 692 

external forcing 438 

external strain 126 ff, 134 

extraction electrode 823 

extrinsic dynamics 438 

extrusions 335 
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faceted interface 63 
far-from-equilibrium transition 552 
fast variable 464 
— adiabatic elimination 464 
fatigue 333, 454 
fec-equivalent 564 
Fe 865, 866, 867 
Fe-Si 867 
Fe-Zr-Cu-B 872 
Fe3Pd (L12) 564 
Fe-Al 182, 448, 452, 744 
FeCr alloy 838 
femtosecond laser 
Fe-Ni 409 
Fe-Ni L1o 
FePd 244 
FePd (L10) 
FePd, FePd; 
FePd3 (L12) 
FePt 244 
ferrimagnetism 908 
Ferrites 866, 870 
ferromagnetic state 564, 570 
Fibonacci sequence 50 ff 
Ficks laws 256 
field evaporation 822 
field ion microscope (FIM) 
FIM 
field variables 
FIM 821, 829 
— best imaging field 821 
- magnification 829 
Finemet 872 
finite-range ballistic jumps 
finite-size effects 674 
finite-size scaling 674 
first-nearest-neighbors (nearest neighbour 
atoms) 348 
first-order transition 549 
first-principles calculation 576 
Fisher model 848 
FLAPW (full-potential linear augmented 
plane wave) method 561 
flash memory 934 
fluctuation kinetics 550 
fluctuation-dissipation 475 
fluctuations 384, 429, 433, 442, 459, 474, 
479, 546 
— gas of clusters 
433 
— subcritical 
flux lines 942 
flux pinning 943 


855 
570 
564 


136, 142 ff, 162 f£ 
564 


818, 819, see also 





552 


476 


429 
— source 


433 
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fluxes 445 
— chemical 445 
— point defects 445 
focused ion beam 837 
Fokker-Planck equation 265, 389 
force constants (matrix) 133 ff, 137, 140, 164 
force matching method 658 
forced atomic mixing 441 
forced magnetostriction 892 
forcing intensity 482 
forest dislocations 321 
forest mechanism 292, 325 
forging 373 
formation energy of a compound 606 
formation enthalpy of a compound 631 
Fourier spectral method 692 
Fourier transformation, prediction of 
fluctuations 549 
Fowler-Guggenheim correction factor 
fracture 335, 338 
frame-by-frame analyses 815 
free energy 354, 362, 381, 542, 554 
—minimize 354, 362 
- surface 381 
— versus composition 354 
free energy function 362 
free-electron laser 750, 764 
free-electron-gas 26 
freezing 373 





364 


— dendritic 373 
Frenkel defect 173 
Frenkel pairs 440 


full-potential linearized augmented plane- 
wave (FLAPW) 596 

full-potential linearized muffin-tin orbital 
(FLMTO) approach 596 


£ 
GB (grain boundary) diffusion 848 


Gear algorithm 662 

(generalized) chemical potential 691 

generalized gradient approximation (GGA) 
561 

(generalized) gradient energy coefficient 

generalized V-matrix 579 

genetic algorithm 459 

genetic algorithm for the selection of 
geometrical figures 631 

geometrical condition 528, 532, 533 

giant magneto resistance (GMR) 850 

giant magnetoresistance 915 ff 

Gibbs coefficient 69 

Ginzburg-Landau equations 254 

Ginzburg-Landau free energy 552 


580 


Ginzburg-Landau type 554 

Ginzburg-Landau type of free energy 572, 
579 

glass 374 

glass transition 522 

glass transition temperature 

Glauber 675 

glide of dislocations 

global stability analysis 


499 


282 
473 


GMR (giant magnetoresistance) 919 ff 
GMR sensors 850 
— thermal stability 850 
golden mean 50 
Gorter model 877 
gradient energy coefficient 402, 554, 558 


grain boundary 98-100, 384, 395 
—low-angle 384 
- surface energy 395 





grain boundary (as diffusion pathway) 268 
grain boundary diffusion 852 

grain boundary energies 664 

grain boundary grooving 787 

grain boundary, impurity effects 622 ff 


—ab-initio modelling 622 
— boron strengthener 623 
— cleavage energy 623 
— cleavage of 623 
— cracking resistance 623 
- critical energy release rate of a crack 
622 
- embrittling agents 623 
— grain boundary embrittlement 
- Griffith criterion 622 
- Griffith energy 622 
— hydrogen embrittlement 623 
— hydrogen recombination poison 623 
— hydrogen uptake 623 
— theoretical fracture energy 623 
grain growth 698 
Grain structure 101-103 
—important sternum 77 
grandcanonical ensemble 669 
graphite 446 
grazing-incidence reflection 743 
Green’s function of diffusion equation 
225 
ground state search 639 ff 
ground-state 525 
ground-state analysis 561 
ground-state diagram 640 
- configuration space 641 
— random alloy 641 
ground-state structures 639 
— united-space cluster expansion 639 


622 





groundstate diagram of a binary alloy 632 
group velocity 142 

growth of precipitates 265 

Gruneisen parameter 151 

Griineisen behavior 885 

Gummelt decagons 54 


h 
habit plane 407 
half-metallicity 914 
Hall-Petch law 337 
hard disk drive (HDD) 
hard magnetic materials 
hardening 316 
hardening stages 323 
hardness measurement 

-local 788 
harmonic approximation 132ff, 137, 139 ff, 

147 

- beyond... 149ff 
harmonic oscillator 780 
heat capacity 143, 145, 150 ff 
heat flow 152 ff 
Helmholtz free energy 349, 542, 568, 672 
heterodyne X-ray studies 725 
heterogeneous nucleation 383 
heterophase fluctuations 432 
HfCr;: phase stability 607 ff 

- bulk modulus 607 

- enthalpy of formation 608 

— structural parameters 608 
hierarchy of energies 545 
high-density DVD (HD-DVD) 14 
high resolution electron microscopy 519 
high-energy ball milling 438, 448, 480, 483 
high-energy electron irradiation 467 
high-field susceptibility 892 
high-resolution transmission electron 

microscopy (HRTEM) 791 

Hohenberg and Kohn (1964) 591 
homodyne X-ray studies 723 
homogeneous nucleation 383 
homogeneous state 578 
homogeneous system 552 
Hooke’s law 126, 133, 369 
HRTEM 791, 793, 796, 797 

—hot-stage 797 

- image resolution 791 

-in-situ 797 

— microscope parameters 

— practical aspects 796 

- principles of image formation 793 
Hume-Rothery phases 20, 27, 33, 53 
Hume-Rothery rules 367 


895, 896, 920, 921 
861, 862, 863, 873 


788 


796 
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hybrid model CVM-PFM 573, 577 
hydrogen interstitial 193 
hypercrystals 48ff, 51 

hysteresis curve 790, 791 
hysteresis loop 861, 863 


I 
icosahedral quasicrystals 55 
imaging compression factor 
imaging gas 820 
impingement rates 430 
importance sampling 670 
impurity diffusion 232 

— five-frequency model 
in-situ microscopy 793 
incubation time 391 
indomethacin 483 
induced anisotropy 864 
industrial permanent magnets 874 
inelastic neutron scattering 731 
inelastic X-ray scattering 730 
infinite-temperature dynamics 443 
information resolution limit 795 
inhomogeneity 552 
inhomogeneous CVM 578 
interaction forces 656 
interaction parameters 
interatomic force constants 
interchange energy 349 
%/Ym interface 812 


829 


232 


504, 535 
368 


interface 384, 802, 806, 807, 810, 811, 813, 
815, 851 
— chemical sharpness 851 
— coherent 384, 802 
—- dynamics 815 
— incoherent 384, 802, 811 
-kink 810 


- migration 802 ff 
— order-disorder 806 
- oscillatory motion 815 
— partly coherent 802, 807 
—semicoherent 384 
- velocity 815 
— wave-like fluctuation 813 
Interface (Solid-Liquid) 63-64 
— detailed sternum 63-70 
Interface curvature 70 
interface fluctuation 806 
Interface kinetics 65 
Interface stability 71 
interfacial energy 384, 432, 496, 689 
interfacial free energy 429 
interfacial width 689 
intermediate scattering function 735 
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Index 


intermetallic alloys 20, 37, 330, 332, 736 
intermetallics 736, 907 ff 
internal stress 282, 321 
interphase boundaries 802 
interphase boundary 803, 807 
- diffuse 803 
— position 807 
— thickness 807 
interstitial solute atom 178, 195 
— migration 178 
—sites 195 
interstitialcy mechanism 229, 230 
— correlation effects 230 
interstitials 449 
intrinsic dynamics 438 
intrinsic stability 544 
Invar 885 
inverse Kirkendall effect 445, 461, 463, 465 
inverse Monte Carlo simulation 683, 757 
inversion method of Conolly and Williams 
630 
ion trajectory 827 
ion-beam mixing 477 
irradiation 438, 444, 460, 478 
— electron 444 
irradiation-induced precipitation 463 
irradiation-induced segregation 463 
Ising model 679 
isobaric-isothermal ensemble 668 
isomorphous substitution 758 
isothermal-isobaric ensemble 669 
isothermal martensite 408 
isothermal section 359 
isotopic substitution 758 
isotropic magnetic materials 864, 874, 881 
isotropic system (Ginzburg-Landau approach) 
553 
IXS (inelastic X-ray scattering) 








730 


J 

Jackson parameter 65 
Jagodzinki symbol 40 
Jagodzinski-Wyckoff notation 40 
jogs 309, 310, 311 

Josephson effect 941 

jump diffusion 734 

jump frequency 379 


k 

k-space formulation 549 
Kagome nets 44 
Kear-Wilsdorflock 332 
Kerr effect 903, 907 ff 


Kinchin-Pease formula 440 


kinetic master equation 379 
kinetic Monte-Carlo simulations 
456, 458, 637, 679 
— aging of coherent precipitates 
- diffusion 637 
- growth 637 
— microstructure evolution 637 
— precipitation 637 
kinetic path 380 
kinetic processes 


247, 424, 


637 


371 
— deviations from equilibrium 371 

kinetics, approach to equilibrium 371 
kinetics of concentration fields 436, 460 
kink-pair mechanism 293 
Kirkendall effect 259 
KMC see kinetic Monte Carlo 430 
Kohn and Sham (1965) 591 
Kolmogorov-Johnson-Mehl-Avrami (KJMA) 

growth equation 397, 926 
Kurdjumov-Sachs relationship 409 


I 
Llo 565 
Llo (CuAu-I) structure 209, 244 
— diffusion properties 244 
— point defect concentrations 
Llo (CuAu) structure 207 
Llo ordered phase 542, 571, 572, 573, 574 
Llo structure 562 
L1o-disorder phase boundary 570 
L1, ordered phase 564, 565, 567 
L12 (Cu3Au) structure 207, 242 
- jump types 242 
L1, structure 562 
lactose 483 
La2Z1r,.07 446 
Lagrange multiplier 542 
laminated media 907 
Landau-type series expansion 573 
Langevin equation 474, 475 
laser-assisted atom probe tomography 854 
latent heat (freezing) 372 
lattice gas 426 
lattice phase field 437 
lattice specific heat 611 
lattice vibration effect 569 
Laves phases 20, 21, 37, 43, 44, 46, 606 
ledge motion 809 
length scales 441, 468 
— displacement cascades 441 
- self-organization during external 
forcing 468 
Lennard-Jones (LJ) potential 
Lever rule 68, 351 


209 


534, 656 


Lewis theory 34 
Li-Al alloy 618 


- diffusion paths 618 
- energy barrier 618 
- LiAl; 618 


imited vacancy mobility 228 
imits in MD 655 
inear magnetostriction 864 
inear stability analysis 463, 472 
iquid phase 372 

— undercooled liquid 372 
iquidus (liquidus line) 356 
ithography 837 
ocal atomic displacement 569 
ocal chemical analysis 817 
ocal density approximation 362 
ocking-unlocking mechanism 295, 297 
ogarithmic singularity 350 
ong-period superlattice 727 
ong-range order (LRO) 197, 206, 240, 264, 

359, 364 

- Bragg-Williams theory 204 

- correlation effects 240 

— parameter 206, 360, 547 

- point defects 197 

- spinodal ordering 264 
longitudinal recording 902 
Lorentz microscopy 792 
losses 865, 866, 868 

- core losses 868 

- current 866 

— magnetic 865 
LRO parameters 574 
LSW (Lifshitz-Slyozov-Wagner) theory 

265 

Lyapunov function 462 





m 

machinability 875 

Mackay cluster 56 
Macro-segregation 110-113 
MAGNEQUENCH 88 

magnetic bit 897, 902, 904 ff, 910 
magnetic Compton scattering 720 
magnetic fluctuations 890 
magnetic force microscopy (MFM) 
magnetic ground state 564 
magnetic neutron scattering 710 
magnetic RAM (MRAM) 896, 920 
magnetic recording 896 ff 
magnetic resonance imaging 953 
magnetic storage 897, 906, 910 
magnetic X-ray scattering 715 
magnetization 897 ff 


782, 789 
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— remanent magnetization 899 

— spontaneous magnetization 898 
magneto-optics 897, 903, 907 ff 
magnetocrystalline anisotropy 906 
magnetoplumbite structure 878 
magnetoresistance 903, 915 ff 

—anisotropic-AMR 915 

—cyclotron 915 

—giant-GMR 915 

-tunnel-TMR 915 
Magnetostriction 864 
magneto-volume coupling 887, 889, 890 
magnons 363 
many-body interactions 525 
many-body potentials 657 
Markov chains 667, 671, 675 
Markov process 735 
martensite 406, 413, 415 

— characteristics 406 

- crystallography 413 

- models 415 
martensite finish temperature 
martensite midrib 407 
martensite plate 407 
martensite start temperature 408 
martensitic transformation 406, 407 
massive transformation 411, 811 
master equation 253, 430, 667, 735 
master equation method 253 
Matano plane 258 
mean curvature 560 
mean free path 913, 917 
mean-field approximations 253 

— crystal energetics 199 

— kinetic models 253 
mean-square displacement 
mechanical activation 483 
mechanical milling 799 
mechanical stability 546 
media noise 904 
melting temperature depression 505 
memory device 921 
Mendeleev number 24 
Meso-segregation 110-113 
metallic glass 86, 375, 745, 757 
metallic radius 367 
metastable states 382, 549 
method of inequality 536 


408 


673 


method of overlapping distributions 664 
Metropolis-type algorithms 675, 677 
MgB, 944, 952 

micro-segregation 85 

microbeams 726 

microcanonical ensemble 659, 668, 669 
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microscopic master equation 456 
microscopic methods 774 
microstructure 347, 372, 373, 864-866, 873, 
880 
— dendritic 372 
- evolution 347 
— of magnetic materials 
migration profiles 618 
milling intensity 448, 452 
mixed-space cluster expansion 633 
mixing (forced mixing) 481 
— superdiffusive 481 
Molecular dynamics (MD) 
455, 654 
- atom jump 221 
— special diffusion paths 266 
moment-volume instabilities 889 
Monte Carlo (MC) simulation 247, 419, 458, 
635, 667, 744, 838 
— bridging the time scales 637 
— coherent phase boundaries 635 
— compared to other descriptions of 
kinetics 255 
— correlated jump cycles 
— phase transition 636 
— specific heat 635, 636 
Morse potential 569 
Mossbauer effect 244, 738 
Mossbauer spectroscopy 244, 251 
— quasielastic 251, 738 
motion, oscillatory of ledges 809 
ibody interaction energy 561 
ichannel plate 823 
ichannel plate (MCP) 824 
ifilamentary wires 947 
iple anode array 824 
iply twinned structure 520 
iscale calculation 526, 560 
iscale modeling 625 ff 
— bridging the length scales 
ivalue function 546 


864-866, 873 


221, 266, 441, 


247 


625 








mu 
Mumetal 865 

mushy zone (or mush) 110 
n 

8-N rule 32, 33 


n-point cluster 531 

n-point correlation function 531 
n-state Potts model 698 

n°-fold symmetry 49 
nanoclusters 644 
nanocrystalline alloys 
nanohardness 789 
nanomagnets 789 


336, 866, 872, 880 


nanostructured materials 419 

nanostructures 480 

natural iteration method 542 

Nb 952 

Nb3Sn 406, 948 

Nb-Ti 947 

Nd-Fe-B 879 

near-equilibrium transition 552 

nearest-neighbor pair interaction 525, 
536 

nearest-neighbor pair probability 527, 
533 

neutron depolarization 714 


neutron source 367 
— pulsed 367 
— reactor 367 


neutron spin echo 714, 740 
neutron weight 367 
— correction 367 
Newton-Raphson procedure 542 
Newton’s equations of motion 655 
Ni-Al 182, 197, 214, 444, 469 
- diffusion 197 
NiAl 216 
NizAl 197, 208 
- diffusion 197 
Ni-Fe 868 
Niz Ga; 216 
Ni3Ga4 216 
Ni4Mo 444 
NisSb 197, 216 
- diffusion 197 
nickel 25{001} twist grain boundary 622 ff 
— boron strengthens 622 
— grain Boundary Cohesion 624 
— grain boundary segregation 624 
— impurity elements He, Li, S, H, C, Zr, 
P, Fe, Mn, Nb, Cr, and B 622 
- segregation Preference 624 
- surface segregation 624 
NiGa 214, 744 
Ni-Si 444 
Ni-Zr 448, 452 
noise from fluctuations 
non-equilibrium 371 
—compositions 371 
— phase fractions 371 
nonconservative field variable 555, 556 
noncrystallographic symmetry 49 
nonpolarized calculations 562 
nonresonant magnetic scattering 717 
non-volatile memory (NVM) 923, 924 
normalization condition 532, 542 
660 





442 


Nosé—Hoover thermostat 


NPT ensemble (number of particles, pressure, 


temperature) 659 

nuclear collisions 440 

nuclear inelastic scattering 732 

nuclear resonant scattering 741 

nuclear waste 446 

nucleation 264, 373, 382, 384, 387, 391, 392, 
395, 396 


- and growth rate 926 
—coherent 383 
- Gibbs droplet 387 


- grain boundary 396 


— heterogeneous 395 

— heterogeneous, incoherent 384 
—homogeneous 395 

— homogeneous, coherent 384 
-incoherent 383 

— kinetics 373 

- rate 387 


— time-dependent 391, 392 
nucleation, growth, and coarsening 436 
nucleation of product phase 843 
nucleation rate 388 

— steady state 388 
nucleation theory 386, 466 

- irradiation 466 
nucleation time lag 391 
nucleus 386, 393 

— strain energy 393 
nudged elastic band method 222 

-atom jump 222 

- dislocation core diffusion 267 
null cluster 536 





o 
occupation operator 
octahedron cluster 
octet rule 25, 26 
Olson-Cohen model 416 
Onsager coefficients 436, 461 
Onsager’s transport coefficients 435 
operating temperature (permanent magnets) 
875 

optical disk 922 
optical memory 923 
order parameter 360, 547, 552 
order-disorder 206, 452 
order—disorder interphase boundary 803 
order-disorder transition 206, 478, 575 
ordered alloys 197, 238 

—atom migration 238 

- point defects 197 
ordered arrangement 531 
ordered phase 533 


528 
541 
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ordering 350, 752 
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